Set NGramOrientalOnly
to True
to apply N-Gram tokenization to Chinese, Japanese, and Korean characters but to ignore multi-byte characters in other languages.
Type: | Boolean |
Default: | False |
Required: | No |
Configuration Section: | LanguageTypes or MyLanguage |
Example: | NGram=2
|
See Also: | NGram
NGramMultiByteOnly |
|