Language Detection (Google Service)
Detecting language used in any field of given inputs
About
In this language detection, Google Translate Service is utilized to detect the language used in any field from the given input. It is possible to use this component both in real-time and batch processing.
Adding to your Dynamic Pipeline
This Component can be added to your Dynamic pipelines through the "Language Detection (Google Service)" component. It requires the following fields for configuration:
Destination Path (Required)
: the metadata field in which the ISO-3166-1 alpha-2 code will be outputted to. This will be the detected language. The field can be an existing field, or the component can create a new field for this functionality.Main (Required)
: This is the input field that the Language Detection will use as a source. By default, the content.body field will be used, however any field can be chosen as an output.
Compatible Languages
The language coverage is continuously improved as this component uses Google Translate API in the back end. Referring to https://cloud.google.com/translate/docs/languages the language coverage is:
Language | Language ID (ISO 3166 code) |
---|---|
Afrikaans | AF |
Albanian | SQ |
Amharic | AM |
Arabic | AR |
Armenian | HY |
Assamese | AS |
Aymara | AY |
Azerbaijani | AZ |
Bambara | BM |
Basque | EU |
Belarusian | BE |
Bengali | BN |
Bhojpuri | HO |
Bosnian | BS |
Bulgarian | BG |
Catalan | CA |
Cebuano | EB |
Chinese | ZH |
Corsican | CO |
Croatian | HR |
Czech | CS |
Danish | DA |
Dhivehi | DV |
Dogri | OI |
Dutch | NL |
English | EN |
Esperanto | EO |
Estonian | ET |
Ewe | EE |
Filipino | IL |
Finnish | FI |
French | FR |
Frisian | FY |
Galician | GL |
Georgian | KA |
German | DE |
Greek | EL |
Guarani | GN |
Gujarati | GU |
Haitian | HT |
Hausa | HA |
Hawaiian | AW |
Hebrew | IW |
Hindi | HI |
Hmong | MN |
Hungarian | HU |
Icelandic | IS |
Igbo | IG |
Ilocano | LO |
Indonesian | ID |
Irish | GA |
Italian | IT |
Japanese | JA |
Javanese | JW |
Kannada | KN |
Kazakh | KK |
Khmer | KM |
Kinyarwanda | RW |
Konkani | OM |
Korean | KO |
Krio | RI |
Kurdish | KU |
Kurdish | KB |
Kyrgyz | KY |
Lao | LO |
Latin | LA |
Latvian | LV |
Lingala | LN |
Lithuanian | LT |
Luganda | LG |
Luxembourgish | LB |
Macedonian | MK |
Maithili | AI |
Malagasy | MG |
Malay | MS |
Malayalam | ML |
Maltese | MT |
Maori | MI |
Marathi | MR |
Meiteilon | EI |
Mizo | US |
Mongolian | MN |
Myanmar | MY |
Nepali | NE |
Norwegian | NO |
Nyanja | NY |
Odia | OR |
Oromo | OM |
Pashto | PS |
Persian | FA |
Polish | PL |
Portuguese | PT |
Punjabi | PA |
Quechua | QU |
Romanian | RO |
Russian | RU |
Samoan | SM |
Sanskrit | SA |
Scots | GD |
Sepedi | SO |
Serbian | SR |
Sesotho | ST |
Shona | SN |
Sindhi | SD |
Sinhala | SI |
Slovak | SK |
Slovenian | SL |
Somali | SO |
Spanish | ES |
Sundanese | SU |
Swahili | SW |
Swedish | SV |
Tagalog | TL |
Tajik | TG |
Tamil | TA |
Tatar | TT |
Telugu | TE |
Thai | TH |
Tigrinya | TI |
Tsonga | TS |
Turkish | TR |
Turkmen | TK |
Twi | AK |
Ukrainian | UK |
Urdu | UR |
Uyghur | UG |
Uzbek | UZ |
Vietnamese | VI |
Welsh | CY |
Xhosa | XH |
Yiddish | YI |
Yoruba | YO |
Zulu | ZU |
Usage in Search API
This Operation allows a user to specify the destination field, source fields, and separator.
{
"query": {
...
},
"operations": [
{
"name": "detect_language",
"destination_path": "operations.language",
"parameters": {
"main": "content.body"
}
}
]
}
Updated 5 months ago