TECHNOLOGIES WE PROVIDE

Applying Text Analytics To Make Sense of Unstructured Data

technologies we provideThe sheer volume of data now available to companies and government organizations challenges their ability to act quickly in a “sense and respond” environment.
Content Analyst technology is a powerful example of a new class of technologies known as “Text Analytics” designed to transform large volumes of unstructured data into relevant actionable information. Text Analytics products automate most of the human activity traditionally associated with understanding, organizing, prioritizing and retrieving information from large sources of unstructured data.

While a variety of companies have developed applications with limited Text Analytics features, Content Analyst technology is fulfilling the promise of Text Analytics, giving organizations the ability to organize, access, and share information across multiple languages without the need for extensive human intervention.

Content Analyst Technology Provides Powerful Text Analytics Capabilities:

Integrated Search—Addresses the question: “How do I know what I don’t know?” Content Analyst technology applies text queries to document repositories to find information that has been overlooked.

Categorization—Allows users to define categories by means of examples. Based on the exemplars, Content Analyst technology automatically assigns incoming documents to categories.

Contextual Explanation—Helps users understand unfamiliar terminology. The user clicks on an unfamiliar term, and Content Analyst technology highlights similar terms found in related text.

Conceptual Comparison—Automatically maps text (ranging from a single word to an entire book) into an appropriate point in a conceptual representation space, providing a direct measure of the similarity of any two items represented in that space. All Content Analyst technology functions hinge on this capability.

Multi-Lingual and Cross-Lingual Text Analysis—Within any single language, Content Analyst technology can be applied to any topic, vocabulary or language that can be represented in the Unicode encoding system. Users can submit queries in English while searching documents in other languages.

Name Tracking and Disambiguation—Track an individual who may use multiple names, nicknames or aliases.

Relationship Discovery—Provides unique capabilities for the discovery of subtle relationships. For example, Content Analyst technology can aid in identifying the use of aliases by individuals.

Summarization—Automatically identifies sentences in a document that best represent key concepts, and uses those sentences to give users a quick summary of the entire document.

Taxonomy Generation—Allows a user to select a set of documents and then create a taxonomy for those documents in a totally data-driven fashion.

Software Developers KitContent Analyst’s Software Developers Kit (SDK) has been designed to support powerful integrated applications dealing with unstructured information – from today’s text requirements to tomorrow’s composite applications – in a set of powerful tools and services.

Content Analyst’ Patented Latent Semantic Indexing (LSI) Technology

 

spotlight