Another stemmer

Authors:
Chris D. Paice
Affiliations:
-
Venue:
ACM SIGIR Forum
Year:
1990

Citing 0
Cited 52

An evaluation method for stemming algorithms

SIGIR '94 Proceedings of the 17th annual international ACM SIGIR conference on Research and development in information retrieval
Efficient searching in distributed digital libraries

Proceedings of the third ACM conference on Digital libraries
Hierarchical indexing and document matching in BoW

Proceedings of the 1st ACM/IEEE-CS joint conference on Digital libraries
An algorithm for term conflation based on tree structures

Journal of the American Society for Information Science and Technology
Efficient stemmer generation

Information Processing and Management: an International Journal
The Effectiveness of a Graph-Based Algorithm for Stemming

ICADL '02 Proceedings of the 5th International Conference on Asian Digital Libraries: Digital Libraries: People, Knowledge, and Technology
Automatic Language-Specific Stemming in Information Retrieval

CLEF '00 Revised Papers from the Workshop of Cross-Language Evaluation Forum on Cross-Language Information Retrieval and Evaluation
NLPIR: a theoretical framework for applying natural language processing to information retrieval

Journal of the American Society for Information Science and Technology
Strength and similarity of affix removal stemming algorithms

ACM SIGIR Forum
Building an inflectional stemmer for Bulgarian

CompSysTech '03 Proceedings of the 4th international conference conference on Computer systems and technologies: e-Learning
Arabic morphological analysis techniques: a comprehensive survey

Journal of the American Society for Information Science and Technology
A probabilistic model for stemmer generation

Information Processing and Management: an International Journal - Special issue: An Asian digital libraries perspective
Unisys: description of the Unisys system used for MUC-3

MUC3 '91 Proceedings of the 3rd conference on Message understanding
Utterance classification in AutoTutor

HLT-NAACL-EDUC '03 Proceedings of the HLT-NAACL 03 workshop on Building educational applications using natural language processing - Volume 2
Clustering web images using association rules, interestingness measures, and hypergraph partitions

ICWE '06 Proceedings of the 6th international conference on Web engineering
Ontology based text indexing and querying for the semantic web

Knowledge-Based Systems
Design, implementation, and evaluation of a methodology for automatic stemmer generation

Journal of the American Society for Information Science and Technology
EXTRACTING RELATIONS AMONG EMBEDDED SOFTWARE DESIGN PATTERNS

Journal of Integrated Design & Process Science
Autonomous authoring tools for hypertext

ACM Computing Surveys (CSUR)
Comparing tagging vocabularies among four enterprise tag-based services

Proceedings of the 2007 international ACM conference on Supporting group work
Non-linear correlation of content and metadata information extracted from biomedical article datasets

Journal of Biomedical Informatics
An Architecture for Hybrid P2P Free-Text Search

CIA '07 Proceedings of the 11th international workshop on Cooperative Information Agents XI
Discovering Knowledge in a Large Organization through Support Vector Machines

ICCS '08 Proceedings of the 8th international conference on Computational Science, Part III
Information retrieval from digital libraries in SQL

Proceedings of the 10th ACM workshop on Web information and data management
Towards an error-free Arabic stemming

Proceedings of the 2nd ACM workshop on Improving non english web searching
PHIRST: A distributed architecture for P2P information retrieval

Information Systems
A document classification and retrieval system for R&D in semiconductor industry - A hybrid approach

Expert Systems with Applications: An International Journal
Using Stemming Algorithms on a Grid Environment

High Performance Computing for Computational Science - VECPAR 2008
A Case Study of Using Domain Engineering for the Conflation Algorithms Domain

ICSR '09 Proceedings of the 11th International Conference on Software Reuse: Formal Foundations of Reuse and Domain Engineering
An information extraction approach to reorganizing and summarizing specifications

Information and Software Technology
A lexicon-based stemming procedure

PROPOR'03 Proceedings of the 6th international conference on Computational processing of the Portuguese language
Hopfilter: an agent for filtering web pages based on the hopfield artificial neural network model

BNCOD'07 Proceedings of the 24th British national conference on Databases
Personalized web page filtering using a hopfield neural network

ICANN'07 Proceedings of the 17th international conference on Artificial neural networks
Digitization of Indian literature: problem and solution

Proceedings of the 1st Amrita ACM-W Celebration on Women in Computing in India
An efficient mechanism for stemming and tagging: the case of Greek language

KES'10 Proceedings of the 14th international conference on Knowledge-based and intelligent information and engineering systems: Part III
Merging domain ontologies based on the WordNet system and Fuzzy Formal Concept Analysis techniques

Applied Soft Computing
Classifying with co-stems: a new representation for information filtering

ECIR'11 Proceedings of the 33rd European conference on Advances in information retrieval
An unsupervised method to improve Spanish stemmer

NLDB'11 Proceedings of the 16th international conference on Natural language processing and information systems
Methods and algorithms for automatic text analysis

Automatic Documentation and Mathematical Linguistics
Assessing the impact of stemming accuracy on information retrieval

PROPOR'10 Proceedings of the 9th international conference on Computational Processing of the Portuguese Language
Term graph model for text classification

ADMA'05 Proceedings of the First international conference on Advanced Data Mining and Applications
Relation analysis among patterns on software development process

PROFES'05 Proceedings of the 6th international conference on Product Focused Software Process Improvement
A generalization of the method for evaluation of stemming algorithms based on error counting

SPIRE'05 Proceedings of the 12th international conference on String Processing and Information Retrieval
Mining writeprints from anonymous e-mails for forensic investigation

Digital Investigation: The International Journal of Digital Forensics & Incident Response
Selecting corpus-semantic models for neurolinguistic decoding

SemEval '12 Proceedings of the First Joint Conference on Lexical and Computational Semantics - Volume 1: Proceedings of the main conference and the shared task, and Volume 2: Proceedings of the Sixth International Workshop on Semantic Evaluation
Combining vector space model and multi word term extraction for semantic query expansion

NLDB'07 Proceedings of the 12th international conference on Applications of Natural Language to Information Systems
Enhancing malay stemming algorithm with background knowledge

PRICAI'12 Proceedings of the 12th Pacific Rim international conference on Trends in Artificial Intelligence
Mining Criminal Networks from Chat Log

WI-IAT '12 Proceedings of the The 2012 IEEE/WIC/ACM International Joint Conferences on Web Intelligence and Intelligent Agent Technology - Volume 01
A graph-based approach to commonsense concept extraction and semantic similarity detection

Proceedings of the 22nd international conference on World Wide Web companion
Commonsense-based topic modeling

Proceedings of the Second International Workshop on Issues of Sentiment Discovery and Opinion Mining
An ontology-based architecture for natural language access to relational databases

UAHCI'13 Proceedings of the 7th international conference on Universal Access in Human-Computer Interaction: design methods, tools, and interaction techniques for eInclusion - Volume Part I
Gesture-based control of the 3D visual representation of document collections for exploration and search

Information Services and Use - Mining the Digital Information Networks

Quantified Score

Hi-index	0.00

Visualization

Abstract

In natural language processing, conflation is the process of merging or lumping together nonidentical words which refer to the same principal concept. This can relate both to words which are entirely different in form (e.g., "group" and "collection"), and to words which share some common root (e.g., "group", "grouping", "subgroups"). In the former case the words can only be mapped by referring to a dictionary or thesaurus, but in the latter case use can be made of the orthographic similarities between the forms. One popular approach is to remove affixes from the input words, thus reducing them to a stem; if this could be done correctly, all the variant forms of a word would be converted to the same standard form. Since the process is aimed at mapping for retrieval purposes, the stem need not be a linguistically correct lemma or root (see also Frakes 1982).