textflint.generation_layer.transformation.CWS.swap_contraction

Replace abbreviations with full names.

class textflint.generation_layer.transformation.CWS.swap_contraction.SwapContraction(**kwargs)[source]

Bases: textflint.generation_layer.transformation.transformation.Transformation

Replace abbreviations with full names.

Example:

央视 -> 中央电视台
__init__(**kwargs)[source]
Parameters
  • abbreviation_dict (dict) – the dictionary of abbreviation

  • **kwargs

static make_dict(path)[source]

read data

Parameters

path (str) – the path of data

Returns

the dic of data

class textflint.generation_layer.transformation.CWS.swap_contraction.Transformation(**kwargs)[source]

Bases: abc.ABC

An abstract class for transforming a sequence of text to produce a list of potential adversarial example.

processor = <textflint.common.preprocess.en_processor.EnProcessor object>
transform(sample, n=1, field='x', **kwargs)[source]

Transform data sample to a list of Sample.

Parameters
  • sample (Sample) – Data sample for augmentation.

  • n (int) – Max number of unique augmented output, default is 5.

  • field (str|list) – Indicate which fields to apply transformations.

  • **kwargs (dict) –

    other auxiliary params.

Returns

list of Sample

classmethod sample_num(x, num)[source]

Get ‘num’ samples from x.

Parameters
  • x (list) – list to sample

  • num (int) – sample number

Returns

max ‘num’ unique samples.

textflint.generation_layer.transformation.CWS.swap_contraction.download_if_needed(folder_name)[source]

Folder name will be saved as .cache/textflint/[folder_name]. If it doesn’t exist on disk, the zip file will be downloaded and extracted.

Parameters

folder_name (str) – path to folder or file in cache

Returns

path to the downloaded folder or file on disk

textflint.generation_layer.transformation.CWS.swap_contraction.plain_lines_loader(path)[source]

read data