-
Notifications
You must be signed in to change notification settings - Fork 10.3k
Data Files in different versions
Shreeshrii edited this page Jul 3, 2018
·
10 revisions
| Lang Code | Language | 3.00 | 3.02 | 3.04 | 4.00alpha |
|---|---|---|---|---|---|
| afr | Afrikaans | x | |||
| amh | Amharic | x | |||
| ara | Arabic | x | |||
| asm | Assamese | x | |||
| aze | Azerbaijani | x | |||
| aze_cyrl | Azerbaijani - Cyrilic | x | |||
| bel | Belarusian | x | |||
| ben | Bengali | x | |||
| bod | Tibetan | x | |||
| bos | Bosnian | x | |||
| bre | Breton | x | |||
| bul | Bulgarian | x | |||
| cat | Catalan; Valencian | x | |||
| ceb | Cebuano | x | |||
| ces | Czech | x | |||
| chi_sim | Chinese - Simplified | x | |||
| chi_tra | Chinese - Traditional | x | |||
| chr | Cherokee | x | |||
| cym | Welsh | x | |||
| dan | Danish | x | |||
| deu | German | x | |||
| dzo | Dzongkha | x | |||
| ell | Greek, Modern (1453-) | x | |||
| eng | English | x | x | x | x |
| enm | English, Middle (1100-1500) | x | |||
| epo | Esperanto | x | |||
| equ | Math / equation detection module | x | |||
| est | Estonian | x | |||
| eus | Basque | x | |||
| fas | Persian | x | |||
| fin | Finnish | x | |||
| fra | French | x | |||
| frk | Frankish | x | |||
| frm | French, Middle (ca.1400-1600) | x | |||
| gle | Irish | x | |||
| glg | Galician | x | |||
| grc | Greek, Ancient (to 1453) | x | |||
| guj | Gujarati | x | |||
| hat | Haitian; Haitian Creole | x | |||
| heb | Hebrew | x | |||
| hin | Hindi | x | |||
| hrv | Croatian | x | |||
| hun | Hungarian | x | |||
| iku | Inuktitut | x | |||
| ind | Indonesian | x | |||
| isl | Icelandic | x | |||
| ita | Italian | x | |||
| ita_old | Italian - Old | x | |||
| jav | Javanese | x | |||
| jpn | Japanese | x | |||
| kan | Kannada | x | |||
| kat | Georgian | x | |||
| kat_old | Georgian - Old | x | |||
| kaz | Kazakh | x | |||
| khm | Central Khmer | x | |||
| kir | Kirghiz; Kyrgyz | x | |||
| kor | Korean | x | |||
| kor_vert | Korean (vertical) | x | |||
| kur | Kurdish | x | |||
| kur_ara | Kurdish (Arabic) | x | |||
| lao | Lao | x | |||
| lat | Latin | x | |||
| lav | Latvian | x | |||
| lit | Lithuanian | x | |||
| ltz | Luxembourgish | x | |||
| mal | Malayalam | x | |||
| mar | Marathi | x | |||
| mkd | Macedonian | x | |||
| mlt | Maltese | x | |||
| mon | Mongolian | x | |||
| mri | Maori | x | |||
| msa | Malay | x | |||
| mya | Burmese | x | |||
| nep | Nepali | x | |||
| nld | Dutch; Flemish | x | |||
| nor | Norwegian | x | |||
| oci | Occitan (post 1500) | x | |||
| ori | Oriya | x | |||
| osd | Orientation and script detection module | x | x | x | x |
| pan | Panjabi; Punjabi | x | |||
| pol | Polish | x | |||
| por | Portuguese | x | |||
| pus | Pushto; Pashto | x | |||
| que | Quechua | x | |||
| ron | Romanian; Moldavian; Moldovan | x | |||
| rus | Russian | x | |||
| san | Sanskrit | x | |||
| sin | Sinhala; Sinhalese | x | |||
| slk | Slovak | x | |||
| slv | Slovenian | x | |||
| snd | Sindhi | x | |||
| spa | Spanish; Castilian | x | |||
| spa_old | Spanish; Castilian - Old | x | |||
| sqi | Albanian | x | |||
| srp | Serbian | x | |||
| srp_latn | Serbian - Latin | x | |||
| sun | Sundanese | x | |||
| swa | Swahili | x | |||
| swe | Swedish | x | |||
| syr | Syriac | x | |||
| tam | Tamil | x | |||
| tat | Tatar | x | |||
| tel | Telugu | x | |||
| tgk | Tajik | x | |||
| tgl | Tagalog | x | |||
| tha | Thai | x | |||
| tir | Tigrinya | x | |||
| ton | Tonga | x | |||
| tur | Turkish | x | |||
| uig | Uighur; Uyghur | x | |||
| ukr | Ukrainian | x | |||
| urd | Urdu | x | |||
| uzb | Uzbek | x | |||
| uzb_cyrl | Uzbek - Cyrilic | x | |||
| vie | Vietnamese | x | |||
| yid | Yiddish | x | |||
| yor | Yoruba | x |
Old wiki - no longer maintained. The pages were moved, see the new documentation.
These wiki pages are no longer maintained.
All pages were moved to tesseract-ocr/tessdoc.
The latest documentation is available at https://tesseract-ocr.github.io/.