C4.5: programs for machine learning
C4.5: programs for machine learning
The nature of statistical learning theory
The nature of statistical learning theory
A language modeling approach to information retrieval
Proceedings of the 21st annual international ACM SIGIR conference on Research and development in information retrieval
Fast training of support vector machines using sequential minimal optimization
Advances in kernel methods
Real life, real users, and real needs: a study and analysis of user queries on the web
Information Processing and Management: an International Journal
Variations in relevance judgments and the measurement of retrieval effectiveness
Information Processing and Management: an International Journal
Modern Information Retrieval
A Comparative Study on Feature Selection in Text Categorization
ICML '97 Proceedings of the Fourteenth International Conference on Machine Learning
Qualitative Evaluation of Thesaurus-Based Retrieval
ECDL '02 Proceedings of the 6th European Conference on Research and Advanced Technology for Digital Libraries
SemTag and seeker: bootstrapping the semantic web via automated semantic annotation
WWW '03 Proceedings of the 12th international conference on World Wide Web
DBXplorer: A System for Keyword-Based Search over Relational Databases
ICDE '02 Proceedings of the 18th International Conference on Data Engineering
Keyword Searching and Browsing in Databases using BANKS
ICDE '02 Proceedings of the 18th International Conference on Data Engineering
A study of smoothing methods for language models applied to information retrieval
ACM Transactions on Information Systems (TOIS)
Hourly analysis of a very large topically categorized web query log
Proceedings of the 27th annual international ACM SIGIR conference on Research and development in information retrieval
Data Mining: Practical Machine Learning Tools and Techniques, Second Edition (Morgan Kaufmann Series in Data Management Systems)
Defining a session on Web search engines: Research Articles
Journal of the American Society for Information Science and Technology
A large-scale evaluation and analysis of personalized search strategies
Proceedings of the 16th international conference on World Wide Web
Query performance prediction in web search environments
SIGIR '07 Proceedings of the 30th annual international ACM SIGIR conference on Research and development in information retrieval
Discover: keyword search in relational databases
VLDB '02 Proceedings of the 28th international conference on Very Large Data Bases
Wikify!: linking documents to encyclopedic knowledge
Proceedings of the sixteenth ACM conference on Conference on information and knowledge management
Linked data on the web (LDOW2008)
Proceedings of the 17th international conference on World Wide Web
SQAK: doing more with keywords
Proceedings of the 2008 ACM SIGMOD international conference on Management of data
Discovering key concepts in verbose queries
Proceedings of the 31st annual international ACM SIGIR conference on Research and development in information retrieval
A longitudinal study of real-time search assistance adoption
Proceedings of the 31st annual international ACM SIGIR conference on Research and development in information retrieval
YAGO: A Large Ontology from Wikipedia and WordNet
Web Semantics: Science, Services and Agents on the World Wide Web
Learning to link with wikipedia
Proceedings of the 17th ACM conference on Information and knowledge management
Inter-coder agreement for computational linguistics
Computational Linguistics
Modeling Documents by Combining Semantic Concepts with Unsupervised Statistical Learning
ISWC '08 Proceedings of the 7th International Conference on The Semantic Web
Learning Concept Mappings from Instance Similarity
ISWC '08 Proceedings of the 7th International Conference on The Semantic Web
Analysis of long queries in a large scale search log
Proceedings of the 2009 workshop on Web Search Click Data
NAGA: Searching and Ranking Knowledge
ICDE '08 Proceedings of the 2008 IEEE 24th International Conference on Data Engineering
Named entity recognition in query
Proceedings of the 32nd international ACM SIGIR conference on Research and development in information retrieval
An evaluation of entity and frequency based query completion methods
Proceedings of the 32nd international ACM SIGIR conference on Research and development in information retrieval
Learning to Disambiguate Search Queries from Short Sessions
ECML PKDD '09 Proceedings of the European Conference on Machine Learning and Knowledge Discovery in Databases: Part II
Open information extraction from the web
IJCAI'07 Proceedings of the 20th international joint conference on Artifical intelligence
Language-model-based ranking for queries on RDF-graphs
Proceedings of the 18th ACM conference on Information and knowledge management
Image annotation using clickthrough data
Proceedings of the ACM International Conference on Image and Video Retrieval
Learning Semantic Query Suggestions
ISWC '09 Proceedings of the 8th International Semantic Web Conference
Investigating the Semantic Gap through Query Log Analysis
ISWC '09 Proceedings of the 8th International Semantic Web Conference
Semantic annotation, indexing, and retrieval
Web Semantics: Science, Services and Agents on the World Wide Web
OntoGen: semi-automatic ontology editor
Proceedings of the 2007 conference on Human interface: Part II
DBpedia: a nucleus for a web of open data
ISWC'07/ASWC'07 Proceedings of the 6th international The semantic web and 2nd Asian conference on Asian semantic web conference
Search behavior of media professionals at an audiovisual archive: A transaction log analysis
Journal of the American Society for Information Science and Technology
DivQ: diversification for keyword search over structured databases
Proceedings of the 33rd international ACM SIGIR conference on Research and development in information retrieval
Supervised query modeling using wikipedia
Proceedings of the 33rd international ACM SIGIR conference on Research and development in information retrieval
Entity search: building bridges between two worlds
Proceedings of the 3rd International Semantic Search Workshop
Web Semantics: Science, Services and Agents on the World Wide Web
Estimating continuous distributions in Bayesian classifiers
UAI'95 Proceedings of the Eleventh conference on Uncertainty in artificial intelligence
A string metric for ontology alignment
ISWC'05 Proceedings of the 4th international conference on The Semantic Web
A survey of schema-based matching approaches
Journal on Data Semantics IV
Matching unstructured vocabularies using a background ontology
EKAW'06 Proceedings of the 15th international conference on Managing Knowledge in a World of Networks
ECIR'06 Proceedings of the 28th European conference on Advances in Information Retrieval
Adding semantics to microblog posts
Proceedings of the fifth ACM international conference on Web search and data mining
Foundations and Trends in Information Retrieval
Evaluating semantic search query approaches with expert and casual users
ISWC'12 Proceedings of the 11th international conference on The Semantic Web - Volume Part II
Linking the kingdom: enriched access to a historiographical text
Proceedings of the seventh international conference on Knowledge capture
Web usage mining with semantic analysis
Proceedings of the 22nd international conference on World Wide Web
Feeding the second screen: semantic linking based on subtitles
Proceedings of the 10th Conference on Open Research Areas in Information Retrieval
Multi-step classification approaches to cumulative citation recommendation
Proceedings of the 10th Conference on Open Research Areas in Information Retrieval
Hi-index | 0.00 |
We introduce the task of mapping search engine queries to DBpedia, a major linking hub in the Linking Open Data cloud. We propose and compare various methods for addressing this task, using a mixture of information retrieval and machine learning techniques. Specifically, we present a supervised machine learning-based method to determine which concepts are intended by a user issuing a query. The concepts are obtained from an ontology and may be used to provide contextual information, related concepts, or navigational suggestions to the user submitting the query. Our approach first ranks candidate concepts using a language modeling for information retrieval framework. We then extract query, concept, and search-history feature vectors for these concepts. Using manual annotations we inform a machine learning algorithm that learns how to select concepts from the candidates given an input query. Simply performing a lexical match between the queries and concepts is found to perform poorly and so does using retrieval alone, i.e., omitting the concept selection stage. Our proposed method significantly improves upon these baselines and we find that support vector machines are able to achieve the best performance out of the machine learning algorithms evaluated.