Performance standards and evaluations in IR test collections: cluster-based retrieval models
Information Processing and Management: an International Journal
The anatomy of a large-scale hypertextual Web search engine
WWW7 Proceedings of the seventh international conference on World Wide Web 7
Efficient crawling through URL ordering
WWW7 Proceedings of the seventh international conference on World Wide Web 7
Authoritative sources in a hyperlinked environment
Journal of the ACM (JACM)
The stochastic approach for link-structure analysis (SALSA) and the TKC effect
Proceedings of the 9th international World Wide Web conference on Computer networks : the international journal of computer and telecommunications netowrking
A vector space model for automatic indexing
Communications of the ACM
Finding authorities and hubs from link structures on the World Wide Web
Proceedings of the 10th international conference on World Wide Web
The semantic web: yet another hip?
Data & Knowledge Engineering - DKE 40
An Information-Theoretic Definition of Similarity
ICML '98 Proceedings of the Fifteenth International Conference on Machine Learning
Focused Crawling Using Context Graphs
VLDB '00 Proceedings of the 26th International Conference on Very Large Data Bases
Learnable topic-specific web crawler
Journal of Network and Computer Applications - Special issue on computational intelligence on the internet
A subjective measure of web search quality
Information Sciences—Informatics and Computer Science: An International Journal
Using HMM to learn user browsing patterns for focused web crawling
Data & Knowledge Engineering - Special issue: WIDM 2004
Concept similarity in Formal Concept Analysis: An information content approach
Knowledge-Based Systems
Improving web-query processing through semantic knowledge
Data & Knowledge Engineering
A Topic-Specific Web Crawler with Concept Similarity Context Graph Based on FCA
ICIC '08 Proceedings of the 4th international conference on Intelligent Computing: Advanced Intelligent Computing Theories and Applications - with Aspects of Artificial Intelligence
An ontology-based approach to learnable focused crawling
Information Sciences: an International Journal
Towards open decision support systems based on semantic focused crawling
Expert Systems with Applications: An International Journal
Improving the performance of focused web crawlers
Data & Knowledge Engineering
Using information content to evaluate semantic similarity in a taxonomy
IJCAI'95 Proceedings of the 14th international joint conference on Artificial intelligence - Volume 1
Strategy for mining association rules for web pages based on formal concept analysis
Applied Soft Computing
OntoCrawler: A focused crawler with ontology-supported website models for information agents
Expert Systems with Applications: An International Journal
New fast algorithm for constructing concept lattice
ICCSA'07 Proceedings of the 2007 international conference on Computational science and Its applications - Volume Part II
Ontology selection ranking model for knowledge reuse
Expert Systems with Applications: An International Journal
Optimal threshold control by the robots of web search engines with obsolescence of documents
Computer Networks: The International Journal of Computer and Telecommunications Networking
Ontology-based concept similarity in Formal Concept Analysis
Information Sciences: an International Journal
Formal concept analysis approach for data extraction from a limited deep web database
Journal of Intelligent Information Systems
Editorial: A topic-specific crawling strategy based on semantics similarity
Data & Knowledge Engineering
Hi-index | 0.00 |
A web crawler is an important research component in a search engine. In this paper, a new method for measuring the similarity of formal concept analysis (FCA) concepts and a new notion of a web page's rank are proposed that use an information content approach based on users' web logs. First, an extension similarity and an intension similarity that analyze a user's browsing pattern and their hyperlinks are proposed. Second, the information content similarity between two nouns is computed automatically by examining their ISA and Part-Of hierarchy and using a user's web log. A method for computing the semantic similarity between two concepts in two different concept lattices (the base concept lattice and the current concept lattice) and finding the semantic ranking of web pages is proposed. Last, our experiment demonstrates that our crawler is more suitable for crawling focused web pages. It proves that the semantic ranking of web pages is useful and efficient for making a web crawler's choice of a web page for continuing work.