Focused crawling: a new approach to topic-specific Web resource discovery
WWW '99 Proceedings of the eighth international conference on World Wide Web
Adaptive Retrieval Agents: Internalizing Local Contextand Scaling up to the Web
Machine Learning - Special issue on information retrieval
Intelligent crawling on the World Wide Web with arbitrary predicates
Proceedings of the 10th international conference on World Wide Web
Machine Learning
Using Reinforcement Learning to Spider the Web Efficiently
ICML '99 Proceedings of the Sixteenth International Conference on Machine Learning
Focused Crawling Using Context Graphs
VLDB '00 Proceedings of the 26th International Conference on Very Large Data Bases
Ontology Based Personalized Search
ICTAI '99 Proceedings of the 11th IEEE International Conference on Tools with Artificial Intelligence
Ontology-focused crawling of Web documents
Proceedings of the 2003 ACM symposium on Applied computing
Enhancing Focused Crawling with Genetic Algorithms
ITCC '05 Proceedings of the International Conference on Information Technology: Coding and Computing (ITCC'05) - Volume II - Volume 02
User feedback based enhancement in web search quality
Information Sciences—Informatics and Computer Science: An International Journal
An Efficient Adaptive Focused Crawler Based on Ontology Learning
HIS '05 Proceedings of the Fifth International Conference on Hybrid Intelligent Systems
Geographically focused collaborative crawling
Proceedings of the 15th international conference on World Wide Web
Topic-specific crawling on the web with the measurements of the relevancy context graph
Information Systems - Special issue: The semantic web and web services
Dealing with semantic heterogeneity for improving web usage
Data & Knowledge Engineering - Special issue: ER 2004
Category ranking for personalized search
Data & Knowledge Engineering
Using HMM to learn user browsing patterns for focused web crawling
Data & Knowledge Engineering - Special issue: WIDM 2004
MedicoPort: A medical search engine for all
Computer Methods and Programs in Biomedicine
Information Sciences: an International Journal
Web search enhancement by mining user actions
Information Sciences: an International Journal
Learning and inferencing in user ontology for personalized Semantic Web search
Information Sciences: an International Journal
Combining information from multiple search engines-Preliminary comparison
Information Sciences: an International Journal
Adding semantics to software-as-a-service and cloud computing
WSEAS Transactions on Computers
Using ontologies to facilitate post-processing of association rules by domain experts
Information Sciences: an International Journal
An ontology-based approach to Chinese semantic advertising
Information Sciences: an International Journal
Discovery of environmental nodes in the web
IRFC'12 Proceedings of the 5th conference on Multidisciplinary Information Retrieval
Semantic ranking of web pages based on formal concept analysis
Journal of Systems and Software
ICSOC'12 Proceedings of the 10th international conference on Service-Oriented Computing
Focused crawling of tagged web resources using ontology
Computers and Electrical Engineering
Hi-index | 0.07 |
Focused crawling is aimed at selectively seeking out pages that are relevant to a predefined set of topics. Since an ontology is a well-formed knowledge representation, ontology-based focused crawling approaches have come into research. However, since these approaches utilize manually predefined concept weights to calculate the relevance scores of web pages, it is difficult to acquire the optimal concept weights to maintain a stable harvest rate during the crawling process. To address this issue, we proposed a learnable focused crawling framework based on ontology. An ANN (artificial neural network) was constructed using a domain-specific ontology and applied to the classification of web pages. Experimental results show that our approach outperforms the breadth-first search crawling approach, the simple keyword-based crawling approach, the ANN-based focused crawling approach, and the focused crawling approach that uses only a domain-specific ontology.