• home contact us
  • who we are
  • chanllenges we solve
  • markets we serve
  • technologies we provide
  • who we are
  • Home
  • | Main
  • | Value
  • | Technology
  • | Support

Technology Overview

Latent Semantic Indexing (LSI)

Concept Search

Concept-Based Categorization

Dynamic Clustering

Email Thread Analysis

Near-Duplicate Document Identification

Difference Highlighting

Instant Context

Language Analytics

Automatic Summarization

CAAT Software Developers Kit

LATENT SEMANTIC INDEXING (LSI) TECHNOLOGY

Latent Semantic Indexing (LSI) is the machine-learning technique that enables CAAT technology to identify, represent, and compare concepts existing within a collection of documents or data.

LSI is a mathematical approach to text analytics. It is designed to extract every contextual relation among every term in every text object within a collection. It then generates a vector space representation of all terms based on those relations. Within that space, proximity is a good indicator of conceptual similarity. The result: similarities can be identified based on concepts within the material. LSI is mathematics-based, and there are no word lists, taxonomies, or thesauruses required for CAAT to accurately identify conceptually similar text.

The Content Analyst team has evolved the original concept of LSI to create major extensions and refinements to the technology and the company now holds a number of patents related to text analytics.

Refer to our White Papers Section to read more about LSI.

 

 



Copyright 2012 Content Analyst Company, LLC All rights reserved.

  • COMPANY
  • About Us
  • Press Releases
  • Careers
  • Contact Us
  • PARTNERS
  • Partners
  • Approach
  • Get Started
  • Content Care
  • MARKETS
  • Legal
  • Intelligence
  • Brand Research
  • Compliance
  • Data LP
  • Forensics
  • SOLUTIONS
  • Value
  • Technology
  • Support
  • References