Get tokenization schemes API
Learn to get detailed information about tokenization schemes of match rules.
Name | Required | Description | Default value |
---|---|---|---|
entityType | Yes | The type of entity from where the tokenization schemes are retrieved. | - |
scopes | No | The scope of the match group that is being considered for the result. | ALL,INTERNAL,EXTERNAL,NONE |
rules | No | The match group URIs that will be considered for the result. | All |
types | No | The types of match groups that are being considered for the result. | All |
includeRules | No | The match group URIs to include in the output when the tokenization schemes are retrieved. This filters the schemesAnalysisExplanations response to return only the match groups you want. | null (Enter a value to filter the response.) |
scopes
, types
and
rules
are applied with the AND logic.Request
Returns the details of tokenization schemes for each matching group.
Example
Response
Field | Description |
---|---|
matchRulesTotal | The total number of match rules analyzed. The number depends on the request parameters. |
distinctTokenizationSchemesTotal | The number of distinct tokenization schemes. |
attributeStats | Overall statistics of attributes participating in tokenization schemes. |
attributeStats.participants | A list of attributes participating in the analysis. |
operandStats | Overall statistics of operands participating in tokenization schemes. |
cleanserStats | Overall statistics for cleansers. |
cleanserStats.NameDictionaryCleanser | A list of match rules where the NameDictionaryCleanser is
applied. |
cleanserStats.RegexpReplaceCleanser | A list of match rules where the RegexpReplaceCleanser is
applied. |
cleanserStats.TransliterateCleanser | A list of match rules where the TransliterateCleanser is
applied. |
tokenizationSchemes | Describes the tokenization schemes. |
tokenizationSchemes.id | Autogenerated identifier of a tokenization scheme. |
tokenizationSchemes.humanReadableTokenizationScheme | Describes the tokenization scheme having operands, attributes and match token
classes that are separated by colon.
|
tokenizationSchemes.matchGroups | A list of match groups that use the particular tokenization scheme. |
matchTokenClasses | Provides information about match token classes. |
matchTokenClasses.id | Autogenerated ID of the match token class. The ID is unique and a new ID is generated for match token classes with different parameters. |
matchTokenClasses.class | Name of the match token class. |
matchTokenClasses.parameters | A list of parameters in the match token class. |
matchTokenClasses.matchGroups | A list of match groups in the match token class. |
matchTokenClasses.attributes | A list of attributes having the match token class. |
matchGroups | Provides information about the match groups. |
matchGroups.uri | URI of the match group. |
matchGroups.label | Label of the match. |
matchGroups.type | Type of match group. |
matchGroups.tokenizationSchemes | A list of tokenization schemes generated by the match group. |
matchGroups.ignoreInToken | A list of attributes that are ignored in a token for the match group. |
schemesAnalysis | The analysis of the tokenization schemes. |
schemesAnalysis.schemesWithTheSameAttributes | The tokenization schemes that have the same attributes scheme. |
schemesAnalysis.sameSchemesWithDifferentClasses | The tokenization schemes that differ only by match token classes. |
schemesAnalysis.sameSchemesWithDifferentAttributesOrder | The tokenization schemes that differ by the order of attributes. |
schemesAnalysis.nonSchemeGeneratingRules | A list of the match groups that do not generate any schemes. |
schemesAnalysis.excessStrictSchemes | The tokenization schemes having additional attributes compared to the other schemes. |
schemesAnalysis.excessStrictSchemes.byAttributes | The tokenization schemes having additional attributes compared to the other schemes. Compared only by attributes. |
schemesAnalysis.excessStrictSchemes.byAttributes.loseScheme | The tokenization schemes having fewer attributes. |
schemesAnalysis.excessStrictSchemes.byAttributes.strictSchemes | The tokenization schemes having more attributes. |