An evaluation of phrasal and clustered representations on a text categorization task
SIGIR '92 Proceedings of the 15th annual international ACM SIGIR conference on Research and development in information retrieval
Little words can make a big difference for text classification
SIGIR '95 Proceedings of the 18th annual international ACM SIGIR conference on Research and development in information retrieval
An interactive system for finding complementary literatures: a stimulus to scientific discovery
Artificial Intelligence - Special issue on scientific discovery
Inductive learning algorithms and representations for text categorization
Proceedings of the seventh international conference on Information and knowledge management
Foundations of statistical natural language processing
Foundations of statistical natural language processing
A re-examination of text categorization methods
Proceedings of the 22nd annual international ACM SIGIR conference on Research and development in information retrieval
Analyzing the effectiveness and applicability of co-training
Proceedings of the ninth international conference on Information and knowledge management
A vector space model for automatic indexing
Communications of the ACM
Text databases & document management
Machine learning in automated text categorization
ACM Computing Surveys (CSUR)
Information Retrieval
Conditional Random Fields: Probabilistic Models for Segmenting and Labeling Sequence Data
ICML '01 Proceedings of the Eighteenth International Conference on Machine Learning
A Comparative Study on Feature Selection in Text Categorization
ICML '97 Proceedings of the Fourteenth International Conference on Machine Learning
Better Rules, Few Features: A Semantic Approach to Selecting Features from Text
ICDM '01 Proceedings of the 2001 IEEE International Conference on Data Mining
A Brief Introduction to Boosting
IJCAI '99 Proceedings of the Sixteenth International Joint Conference on Artificial Intelligence
Text Mining: A New Frontier for Lossless Compression
DCC '99 Proceedings of the Conference on Data Compression
Machine learning for information extraction in informal domains
Machine learning for information extraction in informal domains
A Comparison of Word- and Sense-Based Text Categorization Using Several Classification Algorithms
Journal of Intelligent Information Systems
Measuring praise and criticism: Inference of semantic orientation from association
ACM Transactions on Information Systems (TOIS)
The Journal of Machine Learning Research
Predicting the semantic orientation of adjectives
ACL '98 Proceedings of the 35th Annual Meeting of the Association for Computational Linguistics and Eighth Conference of the European Chapter of the Association for Computational Linguistics
Text segmentation based on similarity between words
ACL '93 Proceedings of the 31st annual meeting on Association for Computational Linguistics
Unsupervised word sense disambiguation rivaling supervised methods
ACL '95 Proceedings of the 33rd annual meeting on Association for Computational Linguistics
Multi-paragraph segmentation of expository text
ACL '94 Proceedings of the 32nd annual meeting on Association for Computational Linguistics
Syntactic approaches to automatic book indexing
ACL '88 Proceedings of the 26th annual meeting on Association for Computational Linguistics
Towards automatic extraction of monolingual and bilingual terminology
COLING '94 Proceedings of the 15th conference on Computational linguistics - Volume 1
Surface grammatical analysis for the extraction of terminological noun phrases
COLING '92 Proceedings of the 14th conference on Computational linguistics - Volume 3
Text Classification by Boosting Weak Learners based on Terms and Concepts
ICDM '04 Proceedings of the Fourth IEEE International Conference on Data Mining
ACL '99 Proceedings of the 37th annual meeting of the Association for Computational Linguistics on Computational Linguistics
Report on KDD conference 2004 panel discussion can natural language processing help text mining?
ACM SIGKDD Explorations Newsletter
Extracting knowledge from evaluative text
Proceedings of the 3rd international conference on Knowledge capture
Text mining and natural language processing: introduction for the special issue
ACM SIGKDD Explorations Newsletter - Natural language processing and text mining
Mining knowledge from text using information extraction
ACM SIGKDD Explorations Newsletter - Natural language processing and text mining
Semantic similarity methods in wordNet and their application to information retrieval on the web
Proceedings of the 7th annual ACM international workshop on Web information and data management
Similarity measures for tracking information flow
Proceedings of the 14th ACM international conference on Information and knowledge management
Information Extraction: Distilling Structured Data from Unstructured Text
Queue - Social Computing
Thumbs up?: sentiment classification using machine learning techniques
EMNLP '02 Proceedings of the ACL-02 conference on Empirical methods in natural language processing - Volume 10
Integrating Unstructured Data into Relational Databases
ICDE '06 Proceedings of the 22nd International Conference on Data Engineering
Mining semantically related terms from biomedical literature
ACM Transactions on Asian Language Information Processing (TALIP)
Background knowledge for ontology construction
Proceedings of the 15th international conference on World Wide Web
Identifying comparative sentences in text documents
SIGIR '06 Proceedings of the 29th annual international ACM SIGIR conference on Research and development in information retrieval
Tapping the power of text mining
Communications of the ACM - Privacy and security in highly dynamic systems
Data integration: the teenage years
VLDB '06 Proceedings of the 32nd international conference on Very large data bases
A high-performance semi-supervised learning method for text chunking
ACL '05 Proceedings of the 43rd Annual Meeting on Association for Computational Linguistics
HLT-NAACL '06 Proceedings of the main conference on Human Language Technology Conference of the North American Chapter of the Association of Computational Linguistics
Scalable semantic web data management using vertical partitioning
VLDB '07 Proceedings of the 33rd international conference on Very large data bases
Learning concept hierarchies from text corpora using formal concept analysis
Journal of Artificial Intelligence Research
Using information content to evaluate semantic similarity in a taxonomy
IJCAI'95 Proceedings of the 14th international joint conference on Artificial intelligence - Volume 1
Semi-automatic construction of topic ontologies
EWMF'05/KDO'05 Proceedings of the 2005 joint international conference on Semantics, Web and Mining
Finding an application-appropriate model for XML data warehouses
Information Systems
Topic tracking techniques for natural language processing
ACAI '11 Proceedings of the International Conference on Advances in Computing and Artificial Intelligence
Ranked neuro fuzzy inference system (RNFIS) for information retrieval
ISNN'11 Proceedings of the 8th international conference on Advances in neural networks - Volume Part I
Data representation in machine learning-based sentiment analysis of customer reviews
PReMI'11 Proceedings of the 4th international conference on Pattern recognition and machine intelligence
A pattern discovery model for effective text mining
MLDM'12 Proceedings of the 8th international conference on Machine Learning and Data Mining in Pattern Recognition
Roles in social networks: Methodologies and research issues
Web Intelligence and Agent Systems
Hi-index | 0.00 |
Text mining refers to the discovery of previously unknown knowledge that can be found in text collections. In recent years, the text mining field has received great attention due to the abundance of textual data. A researcher in this area is requested to cope with issues originating from the natural language particularities. This survey discusses such semantic issues along with the approaches and methodologies proposed in the existing literature. It covers syntactic matters, tokenization concerns and it focuses on the different text representation techniques, categorisation tasks and similarity measures suggested.