Building Search Applications: Lucene, Lingpipe, and Gate

  • Authors:
  • Manu Konchady

  • Affiliations:
  • -

  • Venue:
  • Building Search Applications: Lucene, Lingpipe, and Gate
  • Year:
  • 2008

Quantified Score

Hi-index 0.00

Visualization

Abstract

Lucene, LingPipe, and Gate are popular open source tools to build powerful search applications. Building Search Applications describes functions from Lucene that include indexing, searching, ranking, and spelling correction to build search engines. Use LingPipe and Gate to find the meaning of text to make search applications more useful. With this book you will learn to: - Extract tokens from text using custom tokenizers and analyzers from Lucene, LingPipe, and Gate. - Construct a search engine index with an optional backend database to manage large document collections. - Explore the wide range of Lucene queries to search an index, understand the ranking algorithm for a query, and suggest spelling corrections. - Find the names of people, places, and other entities in text using LingPipe and Gate. - Categorize documents by topic using classifiers and build groups of self-organized documents using clustering algorithms from LingPipe. - Create a Web crawler to scan the We b, Intranet, or desktop using Nutch. - Track the sentiment of articles published on the Web with LingPipe - Detect plagiarism of documents using a registered document collection.