C4.5: programs for machine learning
C4.5: programs for machine learning
Some advances in transformation-based part of speech tagging
AAAI '94 Proceedings of the twelfth national conference on Artificial intelligence (vol. 1)
Integrating automatic genre analysis into digital libraries
Proceedings of the 1st ACM/IEEE-CS joint conference on Digital libraries
Learning Subjective Adjectives from Corpora
Proceedings of the Seventeenth National Conference on Artificial Intelligence and Twelfth Conference on Innovative Applications of Artificial Intelligence
Mining the peanut gallery: opinion extraction and semantic classification of product reviews
WWW '03 Proceedings of the 12th international conference on World Wide Web
Classifying racist texts using a support vector machine
Proceedings of the 27th annual international ACM SIGIR conference on Research and development in information retrieval
Mining and summarizing customer reviews
Proceedings of the tenth ACM SIGKDD international conference on Knowledge discovery and data mining
Thumbs up?: sentiment classification using machine learning techniques
EMNLP '02 Proceedings of the ACL-02 conference on Empirical methods in natural language processing - Volume 10
Question terminology and representation for question type classification
COMPUTERM '02 COLING-02 on COMPUTERM 2002: second international workshop on computational terminology - Volume 14
Stylistic text classification using functional lexical features: Research Articles
Journal of the American Society for Information Science and Technology
Filtering product reviews from web search results
Proceedings of the 2007 ACM symposium on Document engineering
Ontology-supported polarity mining
Journal of the American Society for Information Science and Technology
Opinion Mining and Sentiment Analysis
Foundations and Trends in Information Retrieval
A survey on sentiment detection of reviews
Expert Systems with Applications: An International Journal
Improving product review search experiences on general search engines
Proceedings of the 11th International Conference on Electronic Commerce
Effectiveness of web search results for genre and sentiment classification
Journal of Information Science
Automatic classification of web search results: product review vs. non-review documents
ICADL'07 Proceedings of the 10th international conference on Asian digital libraries: looking back 10 years and forging new frontiers
Cuisine: Classification using stylistic feature sets and-or name-based feature sets
Journal of the American Society for Information Science and Technology
Locational relativity and domain constraints in spatial questions
Proceedings of the 20th International Conference on Advances in Geographic Information Systems
Hi-index | 0.00 |
The World Wide Web is a vast repository of information, but the sheer volume makes it difficult to identify useful documents. We identify document genre is an important factor in retrieving useful documents and focus on the novel document genre dimension of subjectivity. We investigate three approaches to automatically classifying documents by genre: traditional bag of words techniques, part-of-speech statistics, and hand-crafted shallow linguistic features. We are particularly interested in domain transfer: how well the learned classifiers generalize from the training corpus to a new document corpus. Our experiments demonstrate that the part-of-speech approach is better than traditional bag of words techniques, particularly in the domain transfer conditions.