Semantic ranking of web pages based on formal concept analysis

Authors:
Yajun Du;Yufeng Hai
Affiliations:
School of Mathematical and Computers Science, Xihua University, Chengdu 610039, Sichuan, China;School of Mathematical and Computers Science, Xihua University, Chengdu 610039, Sichuan, China
Venue:
Journal of Systems and Software
Year:
2013

Citing 28
Cited 2

Performance standards and evaluations in IR test collections: cluster-based retrieval models

Information Processing and Management: an International Journal
The anatomy of a large-scale hypertextual Web search engine

WWW7 Proceedings of the seventh international conference on World Wide Web 7
Efficient crawling through URL ordering

WWW7 Proceedings of the seventh international conference on World Wide Web 7
Authoritative sources in a hyperlinked environment

Journal of the ACM (JACM)
The stochastic approach for link-structure analysis (SALSA) and the TKC effect

Proceedings of the 9th international World Wide Web conference on Computer networks : the international journal of computer and telecommunications netowrking
A vector space model for automatic indexing

Communications of the ACM
Finding authorities and hubs from link structures on the World Wide Web

Proceedings of the 10th international conference on World Wide Web
The semantic web: yet another hip?

Data & Knowledge Engineering - DKE 40
An Information-Theoretic Definition of Similarity

ICML '98 Proceedings of the Fifteenth International Conference on Machine Learning
Focused Crawling Using Context Graphs

VLDB '00 Proceedings of the 26th International Conference on Very Large Data Bases
Learnable topic-specific web crawler

Journal of Network and Computer Applications - Special issue on computational intelligence on the internet
A subjective measure of web search quality

Information Sciences—Informatics and Computer Science: An International Journal
Using HMM to learn user browsing patterns for focused web crawling

Data & Knowledge Engineering - Special issue: WIDM 2004
Combining text and link analysis for focused crawling-An application for vertical search engines

Information Systems
Concept similarity in Formal Concept Analysis: An information content approach

Knowledge-Based Systems
Improving web-query processing through semantic knowledge

Data & Knowledge Engineering
A Topic-Specific Web Crawler with Concept Similarity Context Graph Based on FCA

ICIC '08 Proceedings of the 4th international conference on Intelligent Computing: Advanced Intelligent Computing Theories and Applications - with Aspects of Artificial Intelligence
An ontology-based approach to learnable focused crawling

Information Sciences: an International Journal
Towards open decision support systems based on semantic focused crawling

Expert Systems with Applications: An International Journal
Improving the performance of focused web crawlers

Data & Knowledge Engineering
Using information content to evaluate semantic similarity in a taxonomy

IJCAI'95 Proceedings of the 14th international joint conference on Artificial intelligence - Volume 1
Topic-specific crawling on the Web with the measurements of the relevancy context graph

Information Systems
Strategy for mining association rules for web pages based on formal concept analysis

Applied Soft Computing
OntoCrawler: A focused crawler with ontology-supported website models for information agents

Expert Systems with Applications: An International Journal
New fast algorithm for constructing concept lattice

ICCSA'07 Proceedings of the 2007 international conference on Computational science and Its applications - Volume Part II
Ontology selection ranking model for knowledge reuse

Expert Systems with Applications: An International Journal
Optimal threshold control by the robots of web search engines with obsolescence of documents

Computer Networks: The International Journal of Computer and Telecommunications Networking
Ontology-based concept similarity in Formal Concept Analysis

Information Sciences: an International Journal

Formal concept analysis approach for data extraction from a limited deep web database

Journal of Intelligent Information Systems
Editorial: A topic-specific crawling strategy based on semantics similarity

Data & Knowledge Engineering

Quantified Score

Hi-index	0.00

Visualization

Abstract

A web crawler is an important research component in a search engine. In this paper, a new method for measuring the similarity of formal concept analysis (FCA) concepts and a new notion of a web page's rank are proposed that use an information content approach based on users' web logs. First, an extension similarity and an intension similarity that analyze a user's browsing pattern and their hyperlinks are proposed. Second, the information content similarity between two nouns is computed automatically by examining their ISA and Part-Of hierarchy and using a user's web log. A method for computing the semantic similarity between two concepts in two different concept lattices (the base concept lattice and the current concept lattice) and finding the semantic ranking of web pages is proposed. Last, our experiment demonstrates that our crawler is more suitable for crawling focused web pages. It proves that the semantic ranking of web pages is useful and efficient for making a web crawler's choice of a web page for continuing work.