Adaptive signal processing
Term-weighting approaches in automatic text retrieval
Information Processing and Management: an International Journal
Cyc: toward programs with common sense
Communications of the ACM
Boolean Feature Discovery in Empirical Learning
Machine Learning
Term clustering of syntactic phrases
SIGIR '90 Proceedings of the 13th annual international ACM SIGIR conference on Research and development in information retrieval
An evaluation of phrasal and clustered representations on a text categorization task
SIGIR '92 Proceedings of the 15th annual international ACM SIGIR conference on Research and development in information retrieval
Numerical recipes in C (2nd ed.): the art of scientific computing
Numerical recipes in C (2nd ed.): the art of scientific computing
SIGIR '93 Proceedings of the 16th annual international ACM SIGIR conference on Research and development in information retrieval
Feature discovery for problem solving systems
Feature discovery for problem solving systems
OHSUMED: an interactive retrieval evaluation and new large test collection for research
SIGIR '94 Proceedings of the 17th annual international ACM SIGIR conference on Research and development in information retrieval
Improving text retrieval for the routing problem using latent semantic indexing
SIGIR '94 Proceedings of the 17th annual international ACM SIGIR conference on Research and development in information retrieval
AAAI '94 Proceedings of the twelfth national conference on Artificial intelligence (vol. 1)
CYC: a large-scale investment in knowledge infrastructure
Communications of the ACM
Training algorithms for linear text classifiers
SIGIR '96 Proceedings of the 19th annual international ACM SIGIR conference on Research and development in information retrieval
Exploring the similarity space
ACM SIGIR Forum
Inductive learning algorithms and representations for text categorization
Proceedings of the seventh international conference on Information and knowledge management
Distributional clustering of words for text classification
Proceedings of the 21st annual international ACM SIGIR conference on Research and development in information retrieval
Feature generation for sequence categorization
AAAI '98/IAAI '98 Proceedings of the fifteenth national/tenth conference on Artificial intelligence/Innovative applications of artificial intelligence
Making large-scale support vector machine learning practical
Advances in kernel methods
Similarity-Based Models of Word Cooccurrence Probabilities
Machine Learning - Special issue on natural language learning
Foundations of statistical natural language processing
Foundations of statistical natural language processing
A re-examination of text categorization methods
Proceedings of the 22nd annual international ACM SIGIR conference on Research and development in information retrieval
Contextual correlates of synonymy
Communications of the ACM
Text databases & document management
A study of thresholding strategies for text categorization
Proceedings of the 24th annual international ACM SIGIR conference on Research and development in information retrieval
Using LSI for text classification in the presence of background text
Proceedings of the tenth international conference on Information and knowledge management
Placing search in context: the concept revisited
ACM Transactions on Information Systems (TOIS)
Machine learning in automated text categorization
ACM Computing Surveys (CSUR)
Modern Information Retrieval
Introduction to Modern Information Retrieval
Introduction to Modern Information Retrieval
High-performing feature selection for text classification
Proceedings of the eleventh international conference on Information and knowledge management
Feature Generation Using General Constructor Functions
Machine Learning
Text Categorization with Suport Vector Machines: Learning with Many Relevant Features
ECML '98 Proceedings of the 10th European Conference on Machine Learning
A Comparative Study on Feature Selection in Text Categorization
ICML '97 Proceedings of the Fourteenth International Conference on Machine Learning
An Information-Theoretic Definition of Similarity
ICML '98 Proceedings of the Fifteenth International Conference on Machine Learning
Feature Engineering for Text Classification
ICML '99 Proceedings of the Sixteenth International Conference on Machine Learning
Automatically Extracting Features for Concept Learning from the Web
ICML '00 Proceedings of the Seventeenth International Conference on Machine Learning
Improving Short-Text Classification using Unlabeled Data for Classification Problems
ICML '00 Proceedings of the Seventeenth International Conference on Machine Learning
A comparative study for domain ontology guided feature extraction
ACSC '03 Proceedings of the 26th Australasian computer science conference - Volume 16
Evaluating the Utility of Statistical Phrases and Latent Semantic Indexing for Text Classification
ICDM '02 Proceedings of the 2002 IEEE International Conference on Data Mining
Text categorization by boosting automatically extracted concepts
Proceedings of the 26th annual international ACM SIGIR conference on Research and development in informaion retrieval
A divisive information theoretic feature clustering algorithm for text classification
The Journal of Machine Learning Research
Supervised term weighting for automated text categorization
Proceedings of the 2003 ACM symposium on Applied computing
Augmenting Naive Bayes Classifiers with Statistical Language Models
Information Retrieval
Automatic retrieval and clustering of similar words
COLING '98 Proceedings of the 17th international conference on Computational linguistics - Volume 2
Feature lattices for maximum entropy modelling
COLING '98 Proceedings of the 17th international conference on Computational linguistics - Volume 2
RCV1: A New Benchmark Collection for Text Categorization Research
The Journal of Machine Learning Research
Proceedings of the 27th annual international ACM SIGIR conference on Research and development in information retrieval
Text classification and named entities for new event detection
Proceedings of the 27th annual international ACM SIGIR conference on Research and development in information retrieval
ICML '04 Proceedings of the twenty-first international conference on Machine learning
Improving Text Classification using Local Latent Semantic Indexing
ICDM '04 Proceedings of the Fourth IEEE International Conference on Data Mining
Measures of distributional similarity
ACL '99 Proceedings of the 37th annual meeting of the Association for Computational Linguistics on Computational Linguistics
Thumbs up or thumbs down?: semantic orientation applied to unsupervised classification of reviews
ACL '02 Proceedings of the 40th Annual Meeting on Association for Computational Linguistics
The form is the substance: classification of genres in text
HLTKM '01 Proceedings of the workshop on Human Language Technology and Knowledge Management - Volume 2001
Thumbs up?: sentiment classification using machine learning techniques
EMNLP '02 Proceedings of the ACL-02 conference on Empirical methods in natural language processing - Volume 10
NLP found helpful (at least for one text categorization task)
EMNLP '02 Proceedings of the ACL-02 conference on Empirical methods in natural language processing - Volume 10
Discovering missing links in Wikipedia
Proceedings of the 3rd international workshop on Link discovery
A web-based kernel function for measuring the similarity of short text snippets
Proceedings of the 15th international conference on World Wide Web
Evaluating WordNet-based Measures of Lexical Semantic Relatedness
Computational Linguistics
Similarity of Semantic Relations
Computational Linguistics
Statistical Comparisons of Classifiers over Multiple Data Sets
The Journal of Machine Learning Research
The Journal of Machine Learning Research
AAAI'06 proceedings of the 21st national conference on Artificial intelligence - Volume 2
WikiRelate! computing semantic relatedness using wikipedia
AAAI'06 proceedings of the 21st national conference on Artificial intelligence - Volume 2
Corpus-based and knowledge-based measures of text semantic similarity
AAAI'06 Proceedings of the 21st national conference on Artificial intelligence - Volume 1
Cheap and fast---but is it good?: evaluating non-expert annotations for natural language tasks
EMNLP '08 Proceedings of the Conference on Empirical Methods in Natural Language Processing
Importance of semantic representation: dataless classification
AAAI'08 Proceedings of the 23rd national conference on Artificial intelligence - Volume 2
Using wiktionary for computing semantic relatedness
AAAI'08 Proceedings of the 23rd national conference on Artificial intelligence - Volume 2
Concept-based feature generation and selection for information retrieval
AAAI'08 Proceedings of the 23rd national conference on Artificial intelligence - Volume 2
Computing semantic relatedness using Wikipedia-based explicit semantic analysis
IJCAI'07 Proceedings of the 20th international joint conference on Artifical intelligence
Automatically creating datasets for measures of semantic relatedness
LD '06 Proceedings of the Workshop on Linguistic Distances
Feature generation for text categorization using world knowledge
IJCAI'05 Proceedings of the 19th international joint conference on Artificial intelligence
Measuring semantic similarity by latent relational analysis
IJCAI'05 Proceedings of the 19th international joint conference on Artificial intelligence
Combining naive bayes and n-gram language models for text classification
ECIR'03 Proceedings of the 25th European conference on IR research
Similarity measures for short segments of text
ECIR'07 Proceedings of the 29th European conference on IR research
A Wikipedia-based multilingual retrieval model
ECIR'08 Proceedings of the IR research, 30th European conference on Advances in information retrieval
WikiWalk: random walks on Wikipedia for semantic relatedness
TextGraphs-4 Proceedings of the 2009 Workshop on Graph-based Methods for Natural Language Processing
Proceedings of the SIGCHI Conference on Human Factors in Computing Systems
Joint process games: from ratings to wikis
Proceedings of the 9th International Conference on Autonomous Agents and Multiagent Systems: volume 1 - Volume 1
Text relatedness based on a word thesaurus
Journal of Artificial Intelligence Research
Efficient wikipedia-based semantic interpreter by exploiting top-k processing
CIKM '10 Proceedings of the 19th ACM international conference on Information and knowledge management
Semantics-based representation model for multi-layer text classification
KES'10 Proceedings of the 14th international conference on Knowledge-based and intelligent information and engineering systems: Part II
Cross-language information retrieval using meta-language index construction and structural queries
CLEF'09 Proceedings of the 10th cross-language evaluation forum conference on Multilingual information access evaluation: text retrieval experiments
Fast text categorization using concise semantic analysis
Pattern Recognition Letters
Blognoon: exploring a topic in the blogosphere
Proceedings of the 20th international conference companion on World wide web
Combining heterogeneous knowledge resources for improved distributional semantic models
CICLing'11 Proceedings of the 12th international conference on Computational linguistics and intelligent text processing - Volume Part I
Beyond the bag-of-words paradigm to enhance information retrieval applications
Proceedings of the Fourth International Conference on SImilarity Search and APplications
Discovering context: classifying tweets through a semantic transform based on wikipedia
FAC'11 Proceedings of the 6th international conference on Foundations of augmented cognition: directing the future of adaptive systems
A multi-layer text classification framework based on two-level representation model
Expert Systems with Applications: An International Journal
Proceedings of the 20th ACM international conference on Information and knowledge management
Topical clustering of search results
Proceedings of the fifth ACM international conference on Web search and data mining
Supporting collaboration in Wikipedia between language communities
Proceedings of the 4th international conference on Intercultural Collaboration
Term similarity and weighting framework for text representation
ICCBR'11 Proceedings of the 19th international conference on Case-Based Reasoning Research and Development
Music retagging using label propagation and robust principal component analysis
Proceedings of the 21st international conference companion on World Wide Web
Re-ranking bibliographic records for personalized library search
Proceedings of the 12th ACM/IEEE-CS joint conference on Digital Libraries
Classification of short texts by deploying topical annotations
ECIR'12 Proceedings of the 34th European conference on Advances in Information Retrieval
Latent Geospatial Semantics of Social Media
ACM Transactions on Intelligent Systems and Technology (TIST)
Large-scale learning of word relatedness with constraints
Proceedings of the 18th ACM SIGKDD international conference on Knowledge discovery and data mining
Learning a concept-based document similarity measure
Journal of the American Society for Information Science and Technology
Explanatory semantic relatedness and explicit spatialization for exploratory search
SIGIR '12 Proceedings of the 35th international ACM SIGIR conference on Research and development in information retrieval
Enhanced semantic TV-show representation for personalized electronic program guides
UMAP'12 Proceedings of the 20th international conference on User Modeling, Adaptation, and Personalization
A new document author representation for authorship attribution
MCPR'12 Proceedings of the 4th Mexican conference on Pattern Recognition
Query expansion using explicit semantic analysis
Proceedings of the 4th International Conference on Internet Multimedia Computing and Service
Selecting keywords to represent web pages using Wikipedia information
Proceedings of the 18th Brazilian symposium on Multimedia and the web
Domain-specific semantic relatedness from Wikipedia: can a course be transferred?
NAACL HLT '12 Proceedings of the 2012 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies: Student Research Workshop
SRIUBC: simple similarity features for semantic textual similarity
SemEval '12 Proceedings of the First Joint Conference on Lexical and Computational Semantics - Volume 1: Proceedings of the main conference and the shared task, and Volume 2: Proceedings of the Sixth International Workshop on Semantic Evaluation
The CQC algorithm: cycling in graphs to semantically enrich and enhance a bilingual dictionary
Journal of Artificial Intelligence Research
On the self-similarity of intertextual structures in Wikipedia
Proceedings of the First ACM International Workshop on Hot Topics on Interdisciplinary Social Networks Research
On the connections between explicit semantic analysis and latent semantic analysis
Proceedings of the 21st ACM international conference on Information and knowledge management
Supervised learning of semantic relatedness
ECML PKDD'12 Proceedings of the 2012 European conference on Machine Learning and Knowledge Discovery in Databases - Volume Part I
Collaboratively built semi-structured content and Artificial Intelligence: The story so far
Artificial Intelligence
Transforming Wikipedia into a large scale multilingual concept network
Artificial Intelligence
Automated query learning with Wikipedia and genetic programming
Artificial Intelligence
Computing text semantic relatedness using the contents and links of a hypertext encyclopedia
Artificial Intelligence
Content-based and collaborative techniques for tag recommendation: an empirical evaluation
Journal of Intelligent Information Systems
Semantic tagging of places based on user interest profiles from online social networks
ECIR'13 Proceedings of the 35th European conference on Advances in Information Retrieval
A framework for benchmarking entity-annotation systems
Proceedings of the 22nd international conference on World Wide Web
Extracting Knowledge from Wikipedia Articles through Distributed Semantic Analysis
Proceedings of the 13th International Conference on Knowledge Management and Knowledge Technologies
Exploratory search with semantic transformations using collaborative knowledge bases
Proceedings of the 7th ACM international conference on Web search and data mining
Hi-index | 0.00 |
Adequate representation of natural language semantics requires access to vast amounts of common sense and domain-specific world knowledge. Prior work in the field was based on purely statistical techniques that did not make use of background knowledge, on limited lexicographic knowledge bases such as WordNet, or on huge manual efforts such as the CYC project. Here we propose a novel method, called Explicit Semantic Analysis (ESA), for fine-grained semantic interpretation of unrestricted natural language texts. Our method represents meaning in a high-dimensional space of concepts derived from Wikipedia, the largest encyclopedia in existence. We explicitly represent the meaning of any text in terms of Wikipedia-based concepts. We evaluate the effectiveness of our method on text categorization and on computing the degree of semantic relatedness between fragments of natural language text. Using ESA results in significant improvements over the previous state of the art in both tasks. Importantly, due to the use of natural concepts, the ESA model is easy to explain to human users.