OCR Language Support

The Optical Character Recognition Engine is a versatile tool within Eggplant Functional. See Supported Language Properties for a list of language properties supported by Optical Character Recognition (OCR). These language properties can be added as Language parameters to any OCR search in Eggplant Functional scripts. The OCR Engine provides its own system dictionaries for the languages that have full built-in dictionary support.

Examples:

log ReadText(("TLImage","BRImage"), Language:"French") -- where "TLImage" and "BRImage" are captured images that define a search rectangle by indicating the top left and bottom right corners of that rectangle.

Click (Text:"Aubergine", Language:"French")

Supported Language Properties

Abkhaz Faeroese Lak Rundi
Adyghe Fijian Lappish Russian *
Afrikaans Finnish * Latin * RussianOldSPelling *
Agul French * Latvian * RussianWithAccent *
Albanian Frisian Lezgin Samoan
Altaic Friulian Lithuanian * Selkup
ArmenianEastern * GaelicScottish Luba SerbianCyrillic
ArmenianGrabar * Gagauz Macedonian SerbianLatin
ArmenianWestern * Galician Malagasy Shona
Awar Ganda Malay Sioux (Dakota)
Aymara German * Malinke Slovak *
AzeriCyrillic GermanNewSpelling * Maltese Slovenian *
AzeriLatin * GermanLuxembourg Mansi Somali
Bashkir * Greek * Maori Sorbian
Basque Guarani Mari Sotho
Belarusian Hani Maya Spanish *
Bemba Hausa Miao Sunda
Blackfoot Hawaiian Minankabaw Swahili
Breton Hungarian * Mixed (Russian and English) * Swazi
Bugotu Icelandic Mohawk Swedish *
Bulgarian * Ido Moldavian Tabassaran
Buryat Indonesian * Mongol Tagalog
Catalan * Ingush Mordvin Tahitian
Chamorro Interlingua Nahuatl Tajik
Chechen Irish Nenets Tatar *
ChinesePRC Italian * Nivkh Tinpo (Jingpo)
ChineseTaiwan Japanese * Nogay Tongan
Chukcha Japanese+English * Norweigan (NorvegianNynorsk and NorvegianBokmal) * Tswana
Chuvash Kabardian NorwegianBokmal * Tun
Corsican Kalmyk NorwegianNynorsk * Turkish *
CrimeanTatar KarachayBalkar Nyanja Turkmen
Croatian * Karakalpak Occidental TurkmenLatin
Crow Kasub Ojibway Tuvin
Czech * Kawa Ossetic Udmurt
Danish * Kazakh Papiamento UighurCyrillic
Dargwa Khakas PidginEnglish (Tok Pisin lnguage) UighurLatin
Dungan Khanty Polish * Ukrainian *
Dutch * Kikuyu PortugueseBrazilian * UzbekCyrillic
DutchBelgian Kirgiz PortugueseStandard * UzbekLatin
English * Kongo Provencal Visayan (Cebuano)
EskimoCyrillic Korean * Quechua Welsh
EskimoLatin KoreanHangul * RhaetoRomanic Wolof
Esperanto Koryak Romanian * Xhosa
Estonian * Kpelle RomanianMoldavia Yakut
Even Kumyk Romany Zapotec
Evenki Kurdish Ruanda Zulu

* Denotes Full Dictionary Support

Note: These predefined language properties are case-sensitive.

Eggplant Functional scripts recognize other keywords as pre-defined language properties as shown in Other Supported Keywords.

Other Supported Keywords

Basic CMC7 E13B Pascal
C++ Cobol Fortran OCRA
Chemistry Digits Java OCRB

Custom OCR Dictionaries

In addition to selecting specific languages, you can use SenseTalk properties to customize the OCR engine dictionary. You can add specific words that you want text searches to recognize, and you can list words that you want to prohibit the OCR engine from recognizing.

For complete information about creating a custom dictionary, see Customize the OCR Engine Dictionary.

 

This topic was last updated on October 11, 2019, at 11:18:55 AM.

Eggplant icon Eggplantsoftware.com | Documentation Home | User Forums | Support | Copyright © 2019 Eggplant