This section lists the languages that IDOL Server supports, and the most common encodings for each language.
You can set all IDOL Server Encodings
settings to UTF8
or UCS2
. The internal IDOL Server storage encoding is UTF8
.
Acehnese | ||
---|---|---|
Script: | UTF8 | |
[MyLanguage] section name: |
ACEHNESE | |
For encoding: | Set Encodings parameter to: |
|
UTF-8 | UTF8 |
|
|
|
---|---|---|
|
Script: |
Latin |
|
|
AFRIKAANS |
|
For encoding: |
|
|
Windows-CP1252/ISO-8859-1 UTF-8 |
ASCII UTF8 |
|
|
|
---|---|---|
|
Script: |
Latin |
|
|
ALBANIAN |
|
For encoding: |
Set |
|
Windows-CP1252/ISO-8859-1 UTF-8 |
ASCII UTF8 |
|
|
|
---|---|---|
|
Script: |
UTF8 |
|
|
AMHARIC |
|
For encoding: |
Set |
|
UTF-8 |
UTF8 |
Arabic 1A stemming algorithm is available for this language and is applied by default. If you do not want to apply stemming to this language, set Stemming to False for this language. |
|
|
---|---|---|
|
Script: |
Arabic |
|
|
ARABIC |
|
For encoding: |
Set |
|
Windows-CP1256 ISO-8859-6 UTF-8 |
ARABIC ARABIC_ISO UTF8 |
|
|
|
---|---|---|
|
Script: |
UTF8 |
|
|
ARMENIAN |
|
For encoding: |
Set |
|
UTF-8 |
UTF8 |
|
|
|
---|---|---|
|
Script: |
Cyrillic |
|
|
AZERI |
|
For encoding: |
Set |
|
Windows-CP1251 KOI8-R ISO-8859-5 UTF-8 |
CYRILLIC CYRILLIC_KOI8 CYRILLIC_ISO UTF8 |
|
|
|
---|---|---|
|
Script: |
Latin |
|
|
BASQUE |
|
For encoding: |
Set |
|
Windows-CP1252/ISO-8859-1 UTF-8 |
ASCII UTF8 |
|
|
|
---|---|---|
|
Script: |
Cyrillic |
|
|
BELORUSSIAN |
|
For encoding: |
Set |
|
Windows-CP1251 KOI8-R ISO-8859-5 UTF-8 |
CYRILLIC CYRILLIC_KOI8 CYRILLIC_ISO UTF8 |
|
|
|
---|---|---|
|
Script: |
UTF8 |
|
|
BENGALI |
|
For encoding: |
Set |
|
UTF-8 |
UTF8 |
|
|
|
---|---|---|
|
Script: |
UTF8 |
|
|
BERBER |
|
For encoding: |
Set |
|
UTF-8 |
UTF8 |
|
|
|
---|---|---|
|
Script: |
UTF8 |
|
|
BIHARI |
|
For encoding: |
Set |
|
UTF-8 |
UTF8 |
|
|
|
---|---|---|
|
Script: |
UTF8 |
|
|
BIKOL |
|
For encoding: |
Set |
|
UTF-8 |
UTF8 |
|
|
|
---|---|---|
|
Script: |
UTF8 |
|
|
BISHNUPRIYA |
|
For encoding: |
Set |
|
UTF-8 |
UTF8 |
|
|
|
---|---|---|
|
Script: |
Latin |
|
|
BOSNIAN |
|
For encoding: |
Set |
|
Windows-CP1250 ISO-8859-2 UTF-8 |
EASTERNEUROPEAN EASTERNEUROPEAN_ISO UTF8 |
|
|
|
---|---|---|
|
Script: |
Latin |
|
|
BRETON |
|
For encoding: |
Set |
|
Windows-CP1252/ISO-8859-1 UTF-8 |
ASCII UTF8 |
|
|
|
---|---|---|
|
Script: |
Cyrillic |
|
|
BULGARIAN |
|
For encoding: |
Set |
|
Windows-CP1251 KOI8-R ISO-8859-5 UTF-8 |
CYRILLIC CYRILLIC_KOI8 CYRILLIC_ISO UTF8 |
|
|
|
---|---|---|
|
Script: |
UTF8 |
|
|
BURMESE |
|
For encoding: |
Set |
|
UTF-8 |
UTF8 |
Catalan 2A stemming algorithm is available for this language and is applied by default. If you do not want to apply stemming to this language, set Stemming to False for this language. |
|
|
---|---|---|
|
Script: |
Latin |
|
|
CATALAN |
|
For encoding: |
Set |
|
Windows-CP1252/ISO-8859-1 UTF-8 |
ASCII UTF8 |
|
|
|
---|---|---|
|
Script: |
UTF8 |
|
|
CEBUANO |
|
For encoding: |
Set |
|
UTF-8 |
UTF8 |
|
|
|
---|---|---|
|
Script: |
UTF8 |
|
|
CHEROKEE |
|
For encoding: |
Set |
|
UTF-8 |
UTF8 |
Chinese traditional3The language has stemming embedded in sentence breaking. |
|
|
---|---|---|
|
Script: |
Big-5 |
|
|
CHINESE |
|
For encoding: |
Set |
|
Big-5 UTF-8 |
CHINESETRADITIONAL UTF8 |
Chinese simplified4The language has stemming embedded in sentence breaking. |
|
|
---|---|---|
|
Script: |
GB2312-80 |
|
|
CHINESE |
|
For encoding: |
Set |
|
gb2312 UTF-8 |
CHINESESIMPLIFIED UTF8 |
|
|
|
---|---|---|
|
Script: |
UTF8 |
|
|
CHUVASH |
|
For encoding: |
Set |
|
UTF-8 |
UTF8 |
|
|
|
---|---|---|
|
Script: |
Latin |
|
|
CROATIAN |
|
For encoding: |
Set |
|
Windows-CP1250 ISO-8859-2 UTF-8 |
EASTERNEUROPEAN EASTERNEUROPEAN_ISO UTF8 |
Czech 5A stemming algorithm is available for this language and is applied by default. If you do not want to apply stemming to this language, set Stemming to False for this language. |
|
|
---|---|---|
|
Script: |
Latin |
|
|
CZECH |
|
For encoding: |
Set |
|
Windows-CP1250 ISO-8859-2 UTF-8 |
EASTERNEUROPEAN EASTERNEUROPEAN_ISO UTF8 |
Danish 6A stemming algorithm is available for this language and is applied by default. If you do not want to apply stemming to this language, set Stemming to False for this language. |
|
|
---|---|---|
|
Script: |
Latin |
|
|
DANISH |
|
For encoding: |
Set |
|
Windows-CP1252/ISO-8859-1 UTF-8 |
ASCII UTF8 |
|
|
|
---|---|---|
|
Script: |
UTF8 |
|
|
DIVEHI |
|
For encoding: |
Set |
|
UTF-8 |
UTF8 |
Dutch 7A stemming algorithm is available for this language and is applied by default. If you do not want to apply stemming to this language, set Stemming to False for this language. |
|
|
---|---|---|
|
Script: |
Latin |
|
|
DUTCH |
|
For encoding: |
Set |
|
Windows-CP1252/ISO-8859-1 UTF-8 |
ASCII UTF8 |
English 8A stemming algorithm is available for this language and is applied by default. If you do not want to apply stemming to this language, set Stemming to False for this language. |
|
|
---|---|---|
|
Script: |
Latin |
|
|
ENGLISH |
|
For encoding: |
Set |
|
Windows-CP1252/ISO-8859-1 UTF-8 |
ASCII UTF8 |
|
|
|
---|---|---|
|
Script: |
UTF8 |
|
|
ERZYA |
|
For encoding: |
Set |
|
UTF-8 |
UTF8 |
|
|
|
---|---|---|
|
Script: |
UTF8 |
|
|
ESPERANTO |
|
For encoding: |
Set |
|
UTF-8 |
UTF8 |
|
|
|
---|---|---|
|
Script: |
Latin |
|
|
ESTONIAN |
|
For encoding: |
Set |
|
Windows-CP1257 ISO-8859-4 UTF-8 |
NORTHERNEUROPEAN NORTHERNEUROPEAN_ISO UTF8 |
|
|
|
---|---|---|
|
Script: |
UTF8 |
|
|
ETHIOPIC |
|
For encoding: |
Set |
|
UTF-8 |
UTF8 |
|
|
|
---|---|---|
|
Script: |
Latin |
|
|
FAROESE |
|
For encoding: |
Set |
|
Windows-CP1252/ISO-8859-1 UTF-8 |
ASCII UTF8 |
Finnish 9A stemming algorithm is available for this language and is applied by default. If you do not want to apply stemming to this language, set Stemming to False for this language. |
|
|
---|---|---|
|
Script: |
Latin |
|
|
FINNISH |
|
For encoding: |
Set |
|
Windows-CP1252/ISO-8859-1 UTF-8 |
ASCII UTF8 |
French 10A stemming algorithm is available for this language and is applied by default. If you do not want to apply stemming to this language, set Stemming to False for this language. |
|
|
---|---|---|
|
Script: |
Latin |
|
|
FRENCH |
|
For encoding: |
Set |
|
Windows-CP1252/ISO-8859-1 UTF-8 |
ASCII UTF8 |
|
|
|
---|---|---|
|
Script: |
UTF8 |
|
|
FRISIAN |
|
For encoding: |
Set |
|
UTF-8 |
UTF8 |
|
|
|
---|---|---|
|
Script: |
Latin |
|
|
GAELIC |
|
For encoding: |
Set |
|
Windows-CP1252/ISO-8859-1 UTF-8 |
ASCII UTF8 |
|
|
|
---|---|---|
|
Script: |
Latin |
|
|
GALICIAN |
|
For encoding: |
Set |
|
Windows-CP1252/ISO-8859-1 UTF-8 |
ASCII UTF8 |
|
|
|
---|---|---|
|
Script: |
UTF8 |
|
|
GEORGIAN |
|
For encoding: |
Set |
|
UTF-8 |
UTF8 |
German 11A stemming algorithm is available for this language and is applied by default. If you do not want to apply stemming to this language, set Stemming to False for this language. |
|
|
---|---|---|
|
Script: |
Latin |
|
|
GERMAN |
|
For encoding: |
Set |
|
Windows-CP1252/ISO-8859-1 UTF-8 |
ASCII UTF8 |
|
|
|
---|---|---|
|
Script: |
UTF8 |
|
|
GILAKI |
|
For encoding: |
Set |
|
UTF-8 |
UTF8 |
Greek 12A stemming algorithm is available for this language and is applied by default. If you do not want to apply stemming to this language, set Stemming to False for this language. |
|
|
---|---|---|
|
Script: |
Greek |
|
|
GREEK |
|
For encoding: |
Set |
|
Windows-CP1253 ISO-8859-7 UTF-8 |
GREEK GREEK_ISO UTF8 |
|
|
|
---|---|---|
|
Script: |
Latin |
|
|
GREENLANDIC |
|
For encoding: |
Set |
|
Windows-CP1257 ISO-8859-4 UTF-8 |
NORTHERNEUROPEAN NORTHERNEUROPEAN_ISO UTF8 |
|
|
|
---|---|---|
|
Script: |
UTF8 |
|
|
GUARANI |
|
For encoding: |
Set |
|
UTF-8 |
UTF8 |
|
|
|
---|---|---|
|
Script: |
UTF8 |
|
|
GUJARATI |
|
For encoding: |
Set |
|
UTF-8 |
UTF8 |
|
|
|
---|---|---|
|
Script: |
UTF8 |
|
|
HAITIAN |
|
For encoding: |
Set |
|
UTF-8 |
UTF8 |
|
|
|
---|---|---|
|
Script: |
UTF8 |
|
|
HAUSA |
|
For encoding: |
Set |
|
UTF-8 |
UTF8 |
|
|
|
---|---|---|
|
Script: |
UTF8 |
|
|
HAWAIIAN |
|
For encoding: |
Set |
|
UTF-8 |
UTF8 |
Hebrew 13A stemming algorithm is available for this language and is applied by default. If you do not want to apply stemming to this language, set Stemming to False for this language. |
|
|
---|---|---|
|
Script: |
Hebrew |
|
|
HEBREW |
|
For encoding: |
Set |
|
Windows-CP1255 ISO-8859-8 UTF-8 |
HEBREW HEBREW_ISO UTF8 |
|
|
|
---|---|---|
|
Script: |
UTF8 |
|
|
HINDI |
|
For encoding: |
Set |
|
UTF-8 |
UTF8 |
Hungarian 14A stemming algorithm is available for this language and is applied by default. If you do not want to apply stemming to this language, set Stemming to False for this language. |
|
|
---|---|---|
|
Script: |
Latin |
|
|
HUNGARIAN |
|
For encoding: |
Set |
|
Windows-CP1250 ISO-8859-2 UTF-8 |
EASTERNEUROPEAN EASTERNEUROPEAN_ISO UTF8 |
|
|
|
---|---|---|
|
Script: |
Latin |
|
|
ICELANDIC |
|
For encoding: |
Set |
|
Windows-CP1252/ISO-8859-1 UTF-8 |
ASCII UTF8 |
|
|
|
---|---|---|
|
Script: |
UTF8 |
|
|
IGBO |
|
For encoding: |
Set |
|
UTF-8 |
UTF8 |
|
|
|
---|---|---|
|
Script: |
UTF8 |
|
|
ILOKANO |
|
For encoding: |
Set |
|
UTF-8 |
UTF8 |
|
|
|
---|---|---|
|
Script: |
Latin |
|
|
INDONESIAN |
|
For encoding: |
Set |
|
Windows-CP1252/ISO-8859-1 UTF-8 |
ASCII UTF8 |
Italian 15A stemming algorithm is available for this language and is applied by default. If you do not want to apply stemming to this language, set Stemming to False for this language. |
|
|
---|---|---|
|
Script: |
Latin |
|
|
ITALIAN |
|
For encoding: |
Set |
|
Windows-CP1252/ISO-8859-1 UTF-8 |
ASCII UTF8 |
Japanese 16The language has stemming embedded in sentence breaking. |
|
|
---|---|---|
|
Script: |
Japanese |
|
|
JAPANESE |
|
For encoding: |
Set |
|
Shift-JIS EUC JIS UTF-8 |
SHIFTJIS EUC JIS UTF8 |
|
|
|
---|---|---|
|
Script: |
UTF8 |
|
|
JAVANESE |
|
For encoding: |
Set |
|
UTF-8 |
UTF8 |
|
|
|
---|---|---|
|
Script: |
UTF8 |
|
|
KALMYK |
|
For encoding: |
Set |
|
UTF-8 |
UTF8 |
|
|
|
---|---|---|
|
Script: |
UTF8 |
|
|
KANNADA |
|
For encoding: |
Set |
|
UTF-8 |
UTF8 |
|
|
|
---|---|---|
|
Script: |
UTF8 |
|
|
KAPAMPANGAN |
|
For encoding: |
Set |
|
UTF-8 |
UTF8 |
|
|
|
---|---|---|
|
Script: |
Cyrillic |
|
|
KAZAKH |
|
For encoding: |
Set |
|
Windows-CP1251 KOI8-R ISO-8859-5 UTF-8 |
CYRILLIC CYRILLIC_KOI8 CYRILLIC_ISO UTF8 |
|
|
|
---|---|---|
|
Script: |
UTF8 |
|
|
KHMER |
|
For encoding: |
Set |
|
UTF-8 |
UTF8 |
|
|
|
---|---|---|
|
Script: |
UTF8 |
|
|
KIKONGO |
|
For encoding: |
Set |
|
UTF-8 |
UTF8 |
|
|
|
---|---|---|
|
Script: |
UTF8 |
|
|
KINYARWANDA |
|
For encoding: |
Set |
|
UTF-8 |
UTF8 |
|
|
|
---|---|---|
|
Script: |
UTF8 |
|
|
KIRUNDI |
|
For encoding: |
Set |
|
UTF-8 |
UTF8 |
|
|
|
---|---|---|
|
Script: |
UTF8 |
|
|
KOMI |
|
For encoding: |
Set |
|
UTF-8 |
UTF8 |
Korean 17The language has stemming embedded in sentence breaking. |
|
|
---|---|---|
|
Script: |
Hangul |
|
|
KOREAN |
|
For encoding: |
Set |
|
KS C 5601-1987 KS C 5601-1992 UTF-8 |
KOREAN KOREAN UTF8 |
|
|
|
---|---|---|
|
Script: |
Latin |
|
|
KURDISH |
|
For encoding: |
Set |
|
Windows-CP1252/ISO-8859-1 UTF-8 |
ASCII UTF8 |
|
|
|
---|---|---|
|
Script: |
Cyrillic |
|
|
KYRGYZ |
|
For encoding: |
Set |
|
Windows-CP1251 KOI8-R ISO-8859-5 UTF-8 |
CYRILLIC CYRILLIC_KOI8 CYRILLIC_ISO UTF8 |
|
|
|
---|---|---|
|
Script: |
UTF8 |
|
|
LAO |
|
For encoding: |
Set |
|
UTF-8 |
UTF8 |
|
|
|
---|---|---|
|
Script: |
Latin |
|
|
LAPPISH |
|
For encoding: |
Set |
|
Windows-CP1257 ISO-8859-4 UTF-8 |
NORTHERNEUROPEAN NORTHERNEUROPEAN_ISO UTF8 |
Latin18A stemming algorithm is available for this language and is applied by default. If you do not want to apply stemming to this language, set Stemming to False for this language. |
|
|
---|---|---|
|
Script: |
Latin |
|
|
LATIN |
|
For encoding: |
Set |
|
Windows-CP1252/ISO-8859-1 UTF-8 |
ASCII UTF8 |
|
|
|
---|---|---|
|
Script: |
Latin |
|
|
LATVIAN |
|
For encoding: |
Set |
|
Windows-CP1257 ISO-8859-4 UTF-8 |
NORTHERNEUROPEAN NORTHERNEUROPEAN_ISO UTF8 |
|
|
|
---|---|---|
|
Script: |
UTF8 |
|
|
LINGALA |
|
For encoding: |
Set |
|
UTF-8 |
UTF8 |
|
|
|
---|---|---|
|
Script: |
Latin |
|
|
LITHUANIAN |
|
For encoding: |
Set |
|
Windows-CP1257 ISO-8859-4 UTF-8 |
NORTHERNEUROPEAN NORTHERNEUROPEAN_ISO UTF8 |
|
|
|
---|---|---|
|
Script: |
Latin |
|
|
LUXEMBOURGISH |
|
For encoding: |
Set |
|
Windows-CP1252/ISO-8859-1 UTF-8 |
ASCII UTF8 |
|
|
|
---|---|---|
|
Script: |
Cyrillic |
|
|
MACEDONIAN |
|
For encoding: |
Set |
|
Windows-CP1251 KOI8-R ISO-8859-5 UTF-8 |
CYRILLIC CYRILLIC_KOI8 CYRILLIC_ISO UTF8 |
|
|
|
---|---|---|
|
Script: |
UTF8 |
|
|
MALAGASY |
|
For encoding: |
Set |
|
UTF-8 |
UTF8 |
|
|
|
---|---|---|
|
Script: |
Latin |
|
|
MALAY |
|
For encoding: |
Set |
|
Windows-CP1252/ISO-8859-1 UTF-8 |
ASCII UTF8 |
|
|
|
---|---|---|
|
Script: |
UTF8 |
|
|
MALAYALAM |
|
For encoding: |
Set |
|
UTF-8 |
UTF8 |
|
|
|
---|---|---|
|
Script: |
UTF8 |
|
|
MALTESE |
|
For encoding: |
Set |
|
UTF-8 |
UTF8 |
|
|
|
---|---|---|
|
Script: |
UTF8 |
|
|
MANIPURI |
|
For encoding: |
Set |
|
UTF-8 |
UTF8 |
|
|
|
---|---|---|
|
Script: |
Latin1 |
|
|
MAORI |
|
For encoding: |
Set |
|
Windows-CP1252/ISO-8859-1 UTF-8 |
ASCII UTF8 |
|
|
|
---|---|---|
|
Script: |
UTF8 |
|
|
MARATHI |
|
For encoding: |
Set |
|
UTF-8 |
UTF8 |
|
|
|
---|---|---|
|
Script: |
UTF8 |
|
|
MAZANDARANI |
|
For encoding: |
Set |
|
UTF-8 |
UTF8 |
|
|
|
---|---|---|
|
Script: |
UTF8 |
|
|
MIRANDESE |
|
For encoding: |
Set |
|
UTF-8 |
UTF8 |
|
|
|
---|---|---|
|
Script: |
Cyrillic |
|
|
MONGOLIAN |
|
For encoding: |
Set |
|
Windows-CP1251 KOI8-R ISO-8859-5 UTF-8 |
CYRILLIC CYRILLIC_KOI8 CYRILLIC_ISO UTF8 |
|
|
|
---|---|---|
|
Script: |
UTF8 |
|
|
NAHUATL |
|
For encoding: |
Set |
|
UTF-8 |
UTF8 |
|
|
|
---|---|---|
|
Script: |
UTF8 |
|
|
NAVAJO |
|
For encoding: |
Set |
|
UTF-8 |
UTF8 |
|
|
|
---|---|---|
|
Script: |
UTF8 |
|
|
NDEBELE |
|
For encoding: |
Set |
|
UTF-8 |
UTF8 |
|
|
|
---|---|---|
|
Script: |
UTF8 |
|
|
NEPALI |
|
For encoding: |
Set |
|
UTF-8 |
UTF8 |
|
|
|
---|---|---|
|
Script: |
UTF8 |
|
|
NEWARI |
|
For encoding: |
Set |
|
UTF-8 |
UTF8 |
Norwegian 19A stemming algorithm is available for this language and is applied by default. If you do not want to apply stemming to this language, set Stemming to False for this language. |
|
|
---|---|---|
|
Script: |
Latin |
|
|
NORWEGIAN |
|
For encoding: |
Set |
|
Windows-CP1252/ISO-8859-1 UTF-8 |
ASCII UTF8 |
|
|
|
---|---|---|
|
Script: |
UTF8 |
|
|
ORIYA |
|
For encoding: |
Set |
|
UTF-8 |
UTF8 |
|
|
|
---|---|---|
|
Script: |
UTF8 |
|
|
OSSETIAN |
|
For encoding: |
Set |
|
UTF-8 |
UTF8 |
|
|
|
---|---|---|
|
Script: |
UTF8 |
|
|
PANJABI |
|
For encoding: |
Set |
|
UTF-8 |
UTF8 |
|
|
|
---|---|---|
|
Script: |
UTF8 |
|
|
PAPIAMENTU |
|
For encoding: |
Set |
|
UTF-8 |
UTF8 |
|
|
|
---|---|---|
|
Script: |
UTF8 |
|
|
PERSIAN |
|
For encoding: |
Set |
|
UTF-8 |
UTF8 |
Polish 20A stemming algorithm is available for this language and is applied by default. If you do not want to apply stemming to this language, set Stemming to False for this language. |
|
|
---|---|---|
|
Script: |
Latin |
|
|
POLISH |
|
For encoding: |
Set |
|
Windows-CP1250 ISO-8859-2 UTF-8 |
EASTERNEUROPEAN EASTERNEUROPEAN_ISO UTF8 |
Portuguese 21 A stemming algorithm is available for this language and is applied by default. If you do not want to apply stemming to this language, set Stemming to False for this language. |
|
|
---|---|---|
|
Script: |
Latin |
|
|
PORTUGUESE |
|
For encoding: |
Set |
|
Windows-CP1252/ISO-8859-1 UTF-8 |
ASCII UTF8 |
|
|
|
---|---|---|
|
Script: |
UTF8 |
|
|
PUSHTO |
|
For encoding: |
Set |
|
UTF-8 |
UTF8 |
|
|
|
---|---|---|
|
Script: |
UTF8 |
|
|
QUECHUA |
|
For encoding: |
Set |
|
UTF-8 |
UTF8 |
|
|
|
---|---|---|
|
Script: |
UTF8 |
|
|
RHAETO-ROMANCE |
|
For encoding: |
Set |
|
UTF-8 |
UTF8 |
Romanian 22A stemming algorithm is available for this language and is applied by default. If you do not want to apply stemming to this language, set Stemming to False for this language. |
|
|
---|---|---|
|
Script: |
Latin |
|
|
ROMANIAN |
|
For encoding: |
Set |
|
Windows-CP1250 ISO-8859-2 UTF-8 |
EASTERNEUROPEAN EASTERNEUROPEAN_ISO UTF8 |
Russian 23A stemming algorithm is available for this language and is applied by default. If you do not want to apply stemming to this language, set Stemming to False for this language. |
|
|
---|---|---|
|
Script: |
Cyrillic |
|
|
RUSSIAN |
|
For encoding: |
Set |
|
Windows-CP1251 KOI8-R ISO-8859-5 UTF-8 |
CYRILLIC CYRILLIC_KOI8 CYRILLIC_ISO UTF8 |
|
|
|
---|---|---|
|
Script: |
UTF8 |
|
|
SAKHA |
|
For encoding: |
Set |
|
UTF-8 |
UTF8 |
|
|
|
---|---|---|
|
Script: |
UTF8 |
|
|
SAMI |
|
For encoding: |
Set |
|
UTF-8 |
UTF8 |
|
|
|
---|---|---|
|
Script: |
UTF8 |
|
|
SANSKRIT |
|
For encoding: |
Set |
|
UTF-8 |
UTF8 |
|
|
|
---|---|---|
|
Script: |
Cyrillic |
|
|
SERBIAN |
|
For encoding: |
Set |
|
Windows-CP1251 KOI8-R ISO-8859-5 UTF-8 |
CYRILLIC CYRILLIC_KOI8 CYRILLIC_ISO UTF8 |
|
|
|
---|---|---|
|
Script: |
UTF8 |
|
|
SESOTHO |
|
For encoding: |
Set |
|
UTF-8 |
UTF8 |
|
|
|
---|---|---|
|
Script: |
UTF8 |
|
|
SESOTHOSALEBOA |
|
For encoding: |
Set |
|
UTF-8 |
UTF8 |
|
|
|
---|---|---|
|
Script: |
UTF8 |
|
|
SINGHALESE |
|
For encoding: |
Set |
|
UTF-8 |
UTF8 |
|
|
|
---|---|---|
|
Script: |
UTF8 |
|
|
SISWANT |
|
For encoding: |
Set |
|
UTF-8 |
UTF8 |
Slovak 24A stemming algorithm is available for this language and is applied by default. If you do not want to apply stemming to this language, set Stemming to False for this language. |
|
|
---|---|---|
|
Script: |
Latin |
|
|
SLOVAK |
|
For encoding: |
Set |
|
Windows-CP1250 ISO-8859-2 UTF-8 |
EASTERNEUROPEAN EASTERNEUROPEAN_ISO UTF8 |
|
|
|
---|---|---|
|
Script: |
Latin |
|
|
SLOVENIAN |
|
For encoding: |
Set |
|
Windows-CP1250 ISO-8859-2 UTF-8 |
EASTERNEUROPEAN EASTERNEUROPEAN_ISO UTF8 |
|
|
|
---|---|---|
|
Script: |
Latin |
|
|
SOMALI |
|
For encoding: |
Set |
|
Windows-CP1252/ISO-8859-1 UTF-8 |
ASCII UTF8 |
|
|
|
---|---|---|
|
Script: |
Latin |
|
|
SORBIAN |
|
For encoding: |
Set |
|
Windows-CP1250 ISO-8859-2 UTF-8 |
EASTERNEUROPEAN EASTERNEUROPEAN_ISO UTF8 |
Spanish 25A stemming algorithm is available for this language and is applied by default. If you do not want to apply stemming to this language, set Stemming to False for this language. |
|
|
---|---|---|
|
Script: |
Latin |
|
|
SPANISH |
|
For encoding: |
Set |
|
Windows-CP1252/ISO-8859-1 UTF-8 |
ASCII UTF8 |
|
|
|
---|---|---|
|
Script: |
UTF8 |
|
|
SRANAN |
|
For encoding: |
Set |
|
UTF-8 |
UTF8 |
|
|
|
---|---|---|
|
Script: |
UTF8 |
|
|
SUNDANESE |
|
For encoding: |
Set |
|
UTF-8 |
UTF8 |
|
|
|
---|---|---|
|
Script: |
Latin |
|
|
SWAHILI |
|
For encoding: |
Set |
|
Windows-CP1252/ISO-8859-1 UTF-8 |
ASCII UTF8 |
Swedish 26A stemming algorithm is available for this language and is applied by default. If you do not want to apply stemming to this language, set Stemming to False for this language. |
|
|
---|---|---|
|
Script: |
Latin |
|
|
SWEDISH |
|
For encoding: |
Set |
|
Windows-CP1252/ISO-8859-1 UTF-8 |
ASCII UTF8 |
|
|
|
---|---|---|
|
Script: |
UTF8 |
|
|
SYRIAC |
|
For encoding: |
Set |
|
UTF-8 |
UTF8 |
|
|
|
---|---|---|
|
Script: |
Latin |
|
|
TAGALOG |
|
For encoding: |
Set |
|
Windows-CP1252/ISO-8859-1 UTF-8 |
ASCII UTF8 |
|
|
|
---|---|---|
|
Script: |
UTF8 |
|
|
TAHITIAN |
|
For encoding: |
Set |
|
UTF-8 |
UTF8 |
|
|
|
---|---|---|
|
Script: |
Cyrillic |
|
|
TAJIK |
|
For encoding: |
Set |
|
Windows-CP1251 KOI8-R ISO-8859-5 UTF-8 |
CYRILLIC CYRILLIC_KOI8 CYRILLIC_ISO UTF8 |
|
|
|
---|---|---|
|
Script: |
UTF8 |
|
|
TAMIL |
|
For encoding: |
Set |
|
UTF-8 |
UTF8 |
|
|
|
---|---|---|
|
Script: |
Cyrillic |
|
|
TATAR |
|
For encoding: |
Set |
|
Windows-CP1251 KOI8-R ISO-8859-5 UTF-8 |
CYRILLIC CYRILLIC_KOI8 CYRILLIC_ISO UTF8 |
|
|
|
---|---|---|
|
Script: |
UTF8 |
|
|
TELUGU |
|
For encoding: |
Set |
|
UTF-8 |
UTF8 |
|
|
|
---|---|---|
|
Script: |
Thai |
|
|
THAI |
|
For encoding: |
Set |
|
Windows-CP874/ISO-8859-11 UTF-8 |
THAI UTF8 |
|
|
|
---|---|---|
|
Script: |
UTF8 |
|
|
TIBETAN |
|
For encoding: |
Set |
|
UTF-8 |
UTF8 |
|
|
|
---|---|---|
|
Script: |
UTF8 |
|
|
TOKPISIN |
|
For encoding: |
Set |
|
UTF-8 |
UTF8 |
|
|
|
---|---|---|
|
Script: |
UTF8 |
|
|
TONGAN |
|
For encoding: |
Set |
|
UTF-8 |
UTF8 |
|
|
|
---|---|---|
|
Script: |
UTF8 |
|
|
TSONGA |
|
For encoding: |
Set |
|
UTF-8 |
UTF8 |
|
|
|
---|---|---|
|
Script: |
UTF8 |
|
|
TSWANA |
|
For encoding: |
Set |
|
UTF-8 |
UTF8 |
|
|
|
---|---|---|
|
Script: |
Latin |
|
|
TURKISH |
|
For encoding: |
Set |
|
Windows-CP1254/ISO-8859-9 UTF-8 |
TURKISH UTF8 |
|
|
|
---|---|---|
|
Script: |
UTF8 |
|
|
TURKMEN |
|
For encoding: |
Set |
|
UTF-8 |
UTF8 |
|
|
|
---|---|---|
|
Script: |
Cyrillic |
|
|
UKRAINIAN |
|
For encoding: |
Set |
|
Windows-CP1251 KOI8-R ISO-8859-5 UTF-8 |
CYRILLIC CYRILLIC_KOI8 CYRILLIC_ISO UTF8 |
|
|
|
---|---|---|
|
Script: |
UTF8 |
|
|
URDU |
|
For encoding: |
Set |
|
UTF-8 |
UTF8 |
|
|
|
---|---|---|
|
Script: |
UTF8 |
|
|
UYGHUR |
|
For encoding: |
Set |
|
UTF-8 |
UTF8 |
|
|
|
---|---|---|
|
Script: |
Cyrillic |
|
|
UZBEK |
|
For encoding: |
Set |
|
Windows-CP1251 KOI8-R ISO-8859-5 UTF-8 |
CYRILLIC CYRILLIC_KOI8 CYRILLIC_ISO UTF8 |
|
|
|
---|---|---|
|
Script: |
Latin |
|
|
VALENCIAN |
|
For encoding: |
Set |
|
Windows-CP1252/ISO-8859-1 UTF-8 |
ASCII UTF8 |
|
|
|
---|---|---|
|
Script: |
UTF8 |
|
|
VENDA |
|
For encoding: |
Set |
|
UTF-8 |
UTF8 |
|
|
|
---|---|---|
|
Script: |
Vietnamese |
|
|
VIETNAMESE |
|
For encoding: |
Set |
|
Windows-CP1258 UTF-8 |
VIETNAMESE UTF8 |
|
|
|
---|---|---|
|
Script: |
UTF8 |
|
|
WARAYWARAY |
|
For encoding: |
Set |
|
UTF-8 |
UTF8 |
Welsh 27A stemming algorithm is available for this language and is applied by default. If you do not want to apply stemming to this language, set Stemming to False for this language. |
|
|
---|---|---|
|
Script: |
Latin |
|
|
WELSH |
|
For encoding: |
Set |
|
Windows-CP1252/ISO-8859-1 UTF-8 |
ASCII UTF8 |
|
|
|
---|---|---|
|
Script: |
UTF8 |
|
|
WOLOF |
|
For encoding: |
Set |
|
UTF-8 |
UTF8 |
|
|
|
---|---|---|
|
Script: |
UTF8 |
|
|
XHOSA |
|
For encoding: |
Set |
|
UTF-8 |
UTF8 |
|
|
|
---|---|---|
|
Script: |
UTF8 |
|
|
YIDDISH |
|
For encoding: |
Set |
|
UTF-8 |
UTF8 |
|
|
|
---|---|---|
|
Script: |
UTF8 |
|
|
YORUBA |
|
For encoding: |
Set |
|
UTF-8 |
UTF8 |
|
|
|
---|---|---|
|
Script: |
UTF8 |
|
|
ZULU |
|
For encoding: |
Set |
|
UTF-8 |
UTF8 |
|