• home contact us
  • who we are
  • chanllenges we solve
  • markets we serve
  • technologies we provide
  • who we are
  • Home
  • | Main
  • | Value
  • | Technology
  • | Support

Technology Overview

Latent Semantic Indexing (LSI)

Concept Search

Concept-Based Categorization

Dynamic Clustering

Email Thread Analysis

Near-Duplicate Document Identification

Difference Highlighting

Instant Context

Language Analytics

Automatic Summarization

CAAT Software Developers Kit

CONCEPT SEARCH

Just what is "Concept Search?" We are asked that question a lot, particularly as people become more frustrated with the limitations of traditional search engines. Concept Search is:

Search Based on Meaning, not spelling.

Boolean search finds documents based on specific shared character strings – if your search term doesn't exist in a document or search engine dictionaries, you never see it. Concept Search is different – it returns relevant documents regardless of shared terms or even a common language. It gracefully handles degraded documents like those processed by OCR where Boolean search quickly breaks down.

Organic not Artificial

Most search engines today enhance their search with spelling, thesaurus and ontology-based dictionaries and NLP modules to improve finding documents. These methods based on human-coded mappings of terms can be subjective, narrow, and/or incomplete because they are not up to date or do not understand the context of the subject manner. Concept Search derives meaning from documents organically through a rigorous mathematical analysis of the relationship between terms across documents. It requires no outside assistance from people or dictionaries.

Dynamic not Static

As the meaning of terms change, Concept Search changes with them. Human-coded dictionaries usually can't keep up or require constant costly maintenance which for most applications is not done.

This capability is at the heart of CAAT. CAAT can automatically associate any search string – from a few words to an entire book – to all conceptually similar documents contained in a given index. It can then provide a direct measure of the conceptual similarity of this original search string to all the other documents in that index – from most similar to most dissimilar. It accomplishes this via advanced vector geometry and mathematics, so CAAT never needs lists of synonyms, keywords, or dictionaries to find relevant documents.

Many of Content Analyst technology functions leverage conceptual analysis and comparison capabilities. A wide variety of experiments have demonstrated that the technology in CAAT accurately captures conceptual meaning and that proximity of objects or documents in that geometric "conceptual space" is a remarkably valid measure for conceptual similarity without any additional human intervention.

 



Copyright 2012 Content Analyst Company, LLC All rights reserved.

  • COMPANY
  • About Us
  • Press Releases
  • Careers
  • Contact Us
  • PARTNERS
  • Partners
  • Approach
  • Get Started
  • Content Care
  • MARKETS
  • Legal
  • Intelligence
  • Brand Research
  • Compliance
  • Data LP
  • Forensics
  • SOLUTIONS
  • Value
  • Technology
  • Support
  • References