Evaluating text categorization
HLT '91 Proceedings of the workshop on Speech and Natural Language
Automated learning of decision rules for text categorization
ACM Transactions on Information Systems (TOIS)
The nature of statistical learning theory
The nature of statistical learning theory
Automatic condensation of electronic publications by sentence selection
Information Processing and Management: an International Journal - Special issue: summarizing text
Inductive learning algorithms and representations for text categorization
Proceedings of the seventh international conference on Information and knowledge management
CACTUS—clustering categorical data using summaries
KDD '99 Proceedings of the fifth ACM SIGKDD international conference on Knowledge discovery and data mining
A re-examination of text categorization methods
Proceedings of the 22nd annual international ACM SIGIR conference on Research and development in information retrieval
The decomposition of human-written summary sentences
Proceedings of the 22nd annual international ACM SIGIR conference on Research and development in information retrieval
Finding out about: a cognitive perspective on search engine technology and the WWW
Finding out about: a cognitive perspective on search engine technology and the WWW
Lightweight Document Matching for Help-Desk Applications
IEEE Intelligent Systems
Naive (Bayes) at Forty: The Independence Assumption in Information Retrieval
ECML '98 Proceedings of the 10th European Conference on Machine Learning
Text Categorization with Suport Vector Machines: Learning with Many Relevant Features
ECML '98 Proceedings of the 10th European Conference on Machine Learning
A Comparative Study on Feature Selection in Text Categorization
ICML '97 Proceedings of the Fourteenth International Conference on Machine Learning
A text categorization based on summarization technique
RANLPIR '00 Proceedings of the ACL-2000 workshop on Recent advances in natural language processing and information retrieval: held in conjunction with the 38th Annual Meeting of the Association for Computational Linguistics - Volume 11
Web-page classification through summarization
Proceedings of the 27th annual international ACM SIGIR conference on Research and development in information retrieval
A study on automatically extracted keywords in text categorization
ACL-44 Proceedings of the 21st International Conference on Computational Linguistics and the 44th annual meeting of the Association for Computational Linguistics
Cross-document event clustering using knowledge mining from co-reference chains
Information Processing and Management: an International Journal - Special issue: AIRS2005: Information retrieval research in Asia
Noise reduction through summarization for Web-page classification
Information Processing and Management: an International Journal
Just-in-time contextual advertising
Proceedings of the sixteenth ACM conference on Conference on information and knowledge management
A novel efficient classification algorithm for search engines
AIC'08 Proceedings of the 8th conference on Applied informatics and communications
Exploiting neighborhood knowledge for single document summarization and keyphrase extraction
ACM Transactions on Information Systems (TOIS)
Summarizing microblogs automatically
HLT '10 Human Language Technologies: The 2010 Annual Conference of the North American Chapter of the Association for Computational Linguistics
A risk minimization framework for extractive speech summarization
ACL '10 Proceedings of the 48th Annual Meeting of the Association for Computational Linguistics
Summarization as feature selection for document categorization on small datasets
IceTAL'10 Proceedings of the 7th international conference on Advances in natural language processing
A K-mixture connective-strength-based approach to automatic text summarisation
International Journal of Intelligent Systems Technologies and Applications
COMPUTE '11 Proceedings of the Fourth Annual ACM Bangalore Conference
Web Page Summarization for Just-in-Time Contextual Advertising
ACM Transactions on Intelligent Systems and Technology (TIST)
Cross document event clustering using knowledge mining from co-reference chains
AIRS'05 Proceedings of the Second Asia conference on Asia Information Retrieval Technology
Document similarity search based on generic summaries
AIRS'05 Proceedings of the Second Asia conference on Asia Information Retrieval Technology
Importance of HTML structural elements and metadata in automated subject classification
ECDL'05 Proceedings of the 9th European conference on Research and Advanced Technology for Digital Libraries
Hi-index | 0.00 |
We address the problem of evaluating the effectiveness of summarization techniques for the task of document categorization. It is argued that for a large class of automatic categorization algorithms, extraction-based document categorization can be viewed as a particular form of feature selection performed on the full text of the document and, in this context, its impact can be compared with state-of-the-art feature selection techniques especially devised to provide good categorization performance. Such a framework provides for a better assessment of the expected performance of a categorizer if the compression rate of the summarizer is known.