NGramSentenceBrokenScriptOnly
Set NGramSentenceBrokenScriptOnly
to True
to apply N-Gram tokenization to Chinese, Japanese, and Korean characters but to ignore multi-byte characters in other languages.
Type: | Boolean |
Default: | False |
Required: | No |
Configuration Section: | LanguageTypes or MyLanguage |
Example: | NGram=2
|
See Also: | NGram
NGramMultiByteOnly |
NOTE: If you change this setting after you have indexed content into IDOL Server, the new setting applies only to new content, and the server logs a warning. To clear the warning and ensure that your change applies to all your content, you must initialize your index and reindex the content.