textflint.generation_layer.transformation.NER.swap_ent

Substitute short entities to longer ones

class textflint.generation_layer.transformation.NER.swap_ent.SwapEnt(swap_type='OOV', res_path=None, **kwargs)[source]

Bases: textflint.generation_layer.transformation.transformation.Transformation

Swap entities which shorter than threshold to longer ones.

__init__(swap_type='OOV', res_path=None, **kwargs)[source]
Parameters

swap_type (string) – the swap type in

[‘CrossCategory’, ‘OOV’, ‘SwapLonger’]

Parameters

res_path (string) – dir for vocab/dict

class textflint.generation_layer.transformation.NER.swap_ent.Transformation(**kwargs)[source]

Bases: abc.ABC

An abstract class for transforming a sequence of text to produce a list of potential adversarial example.

processor = <textflint.common.preprocess.en_processor.EnProcessor object>
transform(sample, n=1, field='x', **kwargs)[source]

Transform data sample to a list of Sample.

Parameters
  • sample (Sample) – Data sample for augmentation.

  • n (int) – Max number of unique augmented output, default is 5.

  • field (str|list) – Indicate which fields to apply transformations.

  • **kwargs (dict) –

    other auxiliary params.

Returns

list of Sample

classmethod sample_num(x, num)[source]

Get ‘num’ samples from x.

Parameters
  • x (list) – list to sample

  • num (int) – sample number

Returns

max ‘num’ unique samples.

textflint.generation_layer.transformation.NER.swap_ent.download_if_needed(folder_name)[source]

Folder name will be saved as .cache/textflint/[folder_name]. If it doesn’t exist on disk, the zip file will be downloaded and extracted.

Parameters

folder_name (str) – path to folder or file in cache

Returns

path to the downloaded folder or file on disk