MULTI-LINGUAL AND CROSS-LINGUAL TEXT ANALYSIS

multi-lingualWithin any single language, Content Analyst technology can be applied to any topic, vocabulary or language that can be represented in the Unicode® encoding system.

Content Analyst technology can also work across the languages most frequently encountered in security applications (e.g., users can submit queries in English while searching documents in other languages). Languages currently supported for cross-lingual retrieval and analysis are:

Arabic
Chinese
Danish
English
French
German

Greek
Finnish
Italian
Japanese
Korean

Dutch
Portuguese
Russian
Spanish
Swedish

Other text variables accommodated by Content Analyst technology include:

• Synonyms
• Name variations
• Transliteration differences
• Nicknames
• Aliases

 

spotlight