Automatic text processing
Efficient crawling through URL ordering
WWW7 Proceedings of the seventh international conference on World Wide Web 7
Focused crawling: a new approach to topic-specific Web resource discovery
WWW '99 Proceedings of the eighth international conference on World Wide Web
Intelligent crawling on the World Wide Web with arbitrary predicates
Proceedings of the 10th international conference on World Wide Web
Using Reinforcement Learning to Spider the Web Efficiently
ICML '99 Proceedings of the Sixteenth International Conference on Machine Learning
Focused Crawling Using Context Graphs
VLDB '00 Proceedings of the 26th International Conference on Very Large Data Bases
Focused Crawls, Tunneling, and Digital Libraries
ECDL '02 Proceedings of the 6th European Conference on Research and Advanced Technology for Digital Libraries
An information extraction core system for real world German text processing
ANLC '97 Proceedings of the fifth conference on Applied natural language processing
ISWC '02 Proceedings of the First International Semantic Web Conference on The Semantic Web
Versatile structural disambiguation for semantic-aware applications
Proceedings of the 14th ACM international conference on Information and knowledge management
Focused crawling: experiences in a real world project
Proceedings of the 15th international conference on World Wide Web
Web dynamics and their ramifications for the development of web search engines
Computer Networks: The International Journal of Computer and Telecommunications Networking - Web dynamics
Content Collection for the Labelling of Health-Related Web Content
AIME '07 Proceedings of the 11th conference on Artificial Intelligence in Medicine
Exploiting Multiple Features with MEMMs for Focused Web Crawling
NLDB '08 Proceedings of the 13th international conference on Natural Language and Information Systems: Applications of Natural Language to Information Systems
An Ontology-Based Focused Crawler
NLDB '08 Proceedings of the 13th international conference on Natural Language and Information Systems: Applications of Natural Language to Information Systems
An ontology-based approach to learnable focused crawling
Information Sciences: an International Journal
Market Blended Insight: Modeling Propensity to Buy with the Semantic Web
ISWC '08 Proceedings of the 7th International Conference on The Semantic Web
Focused Crawling with Heterogeneous Semantic Information
WI-IAT '08 Proceedings of the 2008 IEEE/WIC/ACM International Conference on Web Intelligence and Intelligent Agent Technology - Volume 01
Discovering Groups of Sibling Terms from Web Documents with XTREEM-SG
Journal on Data Semantics XI
A cross-language focused crawling algorithm based on multiple relevance prediction strategies
Computers & Mathematics with Applications
Web Page Filtering for Domain Ontology with the Context of Concept
IEICE - Transactions on Information and Systems
Design of CORE: context ontology rule enhanced focused web crawler
Proceedings of the International Conference on Advances in Computing, Communication and Control
Improving the performance of focused web crawlers
Data & Knowledge Engineering
The adaptive web
Ontology-based focused crawling of deep web sources
KSEM'07 Proceedings of the 2nd international conference on Knowledge science, engineering and management
Learnable focused crawling based on ontology
AIRS'08 Proceedings of the 4th Asia information retrieval conference on Information retrieval technology
An effective relevance prediction algorithm based on hierarchical taxonomy for focused crawling
AIRS'08 Proceedings of the 4th Asia information retrieval conference on Information retrieval technology
Knowledge-based sense disambiguation (almost) for all structures
Information Systems
Design and implementation of contextual information portals
Proceedings of the 20th international conference companion on World wide web
A constrained crawling approach and its application to a specialised search engine
International Journal of Information and Communication Technology
Searching and browsing Linked Data with SWSE: The Semantic Web Search Engine
Web Semantics: Science, Services and Agents on the World Wide Web
Domain-specific website recognition using hybrid vector space model
WAIM'05 Proceedings of the 6th international conference on Advances in Web-Age Information Management
STRIDER: a versatile system for structural disambiguation
EDBT'06 Proceedings of the 10th international conference on Advances in Database Technology
Design and implementation of an ontology algorithm for web documents classification
ICCSA'06 Proceedings of the 2006 international conference on Computational Science and Its Applications - Volume Part IV
Document filtering for domain ontology based on concept preferences
ASWC'06 Proceedings of the First Asian conference on The Semantic Web
Multi-agent approach for community clustering based on individual ontology annotations
AIS-ADM 2005 Proceedings of the 2005 international conference on Autonomous Intelligent Systems: agents and Data Mining
PROBABILISTIC MODELS FOR FOCUSED WEB CRAWLING
Computational Intelligence
Sentiment-focused web crawling
Proceedings of the 21st ACM international conference on Information and knowledge management
An analyst-adaptive approach to focused crawlers
Proceedings of the 2013 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining
Hi-index | 0.00 |
The Web, the largest unstructured database of the world, has greatly improved access to documents. However, documents on the Web are largely disorganized. Due to the distributed nature of the World Wide Web it is difficult to use it as a tool for information and knowledge management. Therefore, users doing the difficult task of exploring the Web have to be supported by intelligent means.This paper proposes an approach for document discovery building on a comprehensive framework for ontology-focused crawling of Web documents. Our framework includes means for using a complex ontology and associated instance elements. It defines several relevance computation strategies and provides an empirical evaluation which has shown promising results.