Automatic building of new field association word candidates using search engine

Authors:
El-Sayed Atlam;Ghada Elmarhomy;Kazuhiro Morita;Masao Fuketa;Jun-ichi Aoe
Affiliations:
Department of Statistics and Computer Science, Tanta University, Egypt and Department of Information Science and Intelligent Systems, University of Tokushima, Tokushima, Japan;Department of Information Science and Intelligent Systems, University of Tokushima, Tokushima, Japan;Department of Information Science and Intelligent Systems, University of Tokushima, Tokushima, Japan;Department of Information Science and Intelligent Systems, University of Tokushima, Tokushima, Japan;Department of Information Science and Intelligent Systems, University of Tokushima, Tokushima, Japan
Venue:
Information Processing and Management: an International Journal
Year:
2006

Citing 8
Cited 12

Models for retrieval with probabilistic indexing

Information Processing and Management: an International Journal - Modeling data, information and knowledge
Passage-level evidence in document retrieval

SIGIR '94 Proceedings of the 17th annual international ACM SIGIR conference on Research and development in information retrieval
Passage retrieval revisited

Proceedings of the 20th annual international ACM SIGIR conference on Research and development in information retrieval
Passage retrieval: a probabilistic technique

Information Processing and Management: an International Journal
Introduction to Modern Information Retrieval

Introduction to Modern Information Retrieval
A new method for selecting English field association terms of compound words and its knowledge representation

Information Processing and Management: an International Journal
Documents similarity measurement using field association terms

Information Processing and Management: an International Journal
An automatic clustering of articles using dictionary definitions

COLING '96 Proceedings of the 16th conference on Computational linguistics - Volume 1

Improvement of building field association term dictionary using passage retrieval

Information Processing and Management: an International Journal
Ranking of field association terms using Co-word analysis

Information Processing and Management: an International Journal
Estimation of FAQ knowledge bases by using semantic expressions for questions and answers

International Journal of Computer Applications in Technology
Accuracy improvement for a voice recognition using field association knowledge

International Journal of Computer Applications in Technology
An automatic extraction method of word tendency judgement for specific subjects

International Journal of Computer Applications in Technology
Relevant estimation among fields using field association words

International Journal of Computer Applications in Technology
Intelligent QA Systems Using Semantic Expressions

KES '09 Proceedings of the 13th International Conference on Knowledge-Based and Intelligent Information and Engineering Systems: Part II
Building of field association terms based on links

International Journal of Computer Applications in Technology
New approach for field association term dictionary with passage retrieval

ACMOS'07 Proceedings of the 9th WSEAS international conference on Automatic control, modelling and simulation
Estimation of FAQ knowledge bases by introducing measurements

KES'06 Proceedings of the 10th international conference on Knowledge-Based Intelligent Information and Engineering Systems - Volume Part II
A new approach for improving field association term dictionary using passage retrieval

KES'06 Proceedings of the 10th international conference on Knowledge-Based Intelligent Information and Engineering Systems - Volume Part II
A new approach for automatic building field association words using selective passage retrieval

KES'06 Proceedings of the 10th international conference on Knowledge-Based Intelligent Information and Engineering Systems - Volume Part II

Quantified Score

Hi-index	0.00

Visualization

Abstract

With increasing popularity of the Internet and tremendous amount of on-line text, automatic document classification is important for organizing huge amounts of data. Readers can know the subject of many document fields by reading only some specific Field Association (FA) words. Document fields can be decided efficiently if there are many FA words and if the frequency rate is high. This paper proposes a method for automatically building new FA words. A WWW search engine is used to extract FA word candidates from document corpora. New FA word candidates in each field are automatically compared with previously determined FA words. Then new FA words are appended to an FA word dictionary. From the experiential results, our new system can automatically appended around 44% of new FA words to the existence FA word dictionary. Moreover, the concentration ratio 0.9 is also effective for extracting relevant FA words that needed for the system design to build FA words automatically.