textflint.generation_layer.subpopulation.UT.length¶
Extract samples with long text or short text¶
-
class
textflint.generation_layer.subpopulation.UT.length.LengthSubPopulation(intervals=['0%', '20%'])[source]¶ Bases:
textflint.generation_layer.subpopulation.subpopulation.SubPopulationFilter samples based on text length
Example:
sample 1: "I love textflint", score: 3 sample 2: "I love textflint very much", score: 5
-
class
textflint.generation_layer.subpopulation.UT.length.SubPopulation(intervals=None, **kwargs)[source]¶ Bases:
abc.ABCAn abstract class for extracting subset of examples.
-
text_processor= <textflint.common.preprocess.en_processor.EnProcessor object>¶
-
score(sample, field, **kwargs)[source]¶ Score the sample
- Parameters
sample – data sample
field (str|list) – field str
kwargs –
- Return int
score for sample
-
get_slice(scores, dataset)[source]¶ Pick up samples based on scores
- Parameters
scores (list) – list of int
dataset – Dataset
- Returns
subset samples
-