The VocabSize
parameter sets the maximum size of the vocabulary to include in the custom language model. If the training text contains more distinct words than the specified number, words are selected based on their frequency in the text. High frequency words are selected over low frequency words.
Type: | Integer |
Default: | 65532 |
Required: | No |
Configuration Section: | lmbuild module |
Example: | VocabSize=100000
|
See Also: | VocabSize (action parameter) DropList KeepList |
|