Combining fuzzy information from multiple systems (extended abstract)
PODS '96 Proceedings of the fifteenth ACM SIGACT-SIGMOD-SIGART symposium on Principles of database systems
The anatomy of a large-scale hypertextual Web search engine
WWW7 Proceedings of the seventh international conference on World Wide Web 7
Focused crawling: a new approach to topic-specific Web resource discovery
WWW '99 Proceedings of the eighth international conference on World Wide Web
Authoritative sources in a hyperlinked environment
Journal of the ACM (JACM)
WebBase: a repository of Web pages
Proceedings of the 9th international World Wide Web conference on Computer networks : the international journal of computer and telecommunications netowrking
Breadth-first crawling yields high-quality pages
Proceedings of the 10th international conference on World Wide Web
Building a distributed full-text index for the Web
Proceedings of the 10th international conference on World Wide Web
Mercator: A scalable, extensible Web crawler
World Wide Web
Faceted metadata for image search and browsing
Proceedings of the SIGCHI Conference on Human Factors in Computing Systems
Focused Crawling Using Context Graphs
VLDB '00 Proceedings of the 26th International Conference on Very Large Data Bases
Proceedings of the 27th International Conference on Very Large Data Bases
Sesame: A Generic Architecture for Storing and Querying RDF and RDF Schema
ISWC '02 Proceedings of the First International Semantic Web Conference on The Semantic Web
Ontobroker: Ontology Based Access to Distributed and Semi-Structured Information
DS-8 Proceedings of the IFIP TC2/WG2.6 Eighth Working Conference on Database Semantics- Semantic Issues in Multimedia Systems
Identifying Communities of Practice through Ontology Network Analysis
IEEE Intelligent Systems
Ontology-focused crawling of Web documents
Proceedings of the 2003 ACM symposium on Applied computing
Proceedings of the 27th annual international ACM SIGIR conference on Research and development in information retrieval
Swoogle: a search and metadata engine for the semantic web
Proceedings of the thirteenth ACM international conference on Information and knowledge management
Lucene in Action (In Action series)
Lucene in Action (In Action series)
UbiCrawler: a scalable fully distributed web crawler
Software—Practice & Experience
SemRank: ranking complex relationship search results on the semantic web
WWW '05 Proceedings of the 14th international conference on World Wide Web
Exploiting relationships for object consolidation
Proceedings of the 2nd international workshop on Information quality in information systems
QA-Pagelet: Data Preparation Techniques for Large-Scale Data Analysis of the Deep Web
IEEE Transactions on Knowledge and Data Engineering
Learning to crawl: Comparing classification schemes
ACM Transactions on Information Systems (TOIS)
Optimized Index Structures for Querying RDF from the Web
LA-WEB '05 Proceedings of the Third Latin American Web Congress
Web crawling ethics revisited: Cost, privacy, and denial of service
Journal of the American Society for Information Science and Technology
Duplicate Record Detection: A Survey
IEEE Transactions on Knowledge and Data Engineering
Piggy Bank: Experience the Semantic Web inside your web browser
Web Semantics: Science, Services and Agents on the World Wide Web
Communications of the ACM - ACM at sixty: a look back in time
MapReduce: simplified data processing on large clusters
OSDI'04 Proceedings of the 6th conference on Symposium on Opearting Systems Design & Implementation - Volume 6
Unifying Reasoning and Search to Web Scale
IEEE Internet Computing
AquaLog: An ontology-driven question answering system for organizational semantic intranets
Web Semantics: Science, Services and Agents on the World Wide Web
Bigtable: a distributed storage system for structured data
OSDI '06 Proceedings of the 7th USENIX Symposium on Operating Systems Design and Implementation - Volume 7
Hits on the web: how does it compare?
SIGIR '07 Proceedings of the 30th annual international ACM SIGIR conference on Research and development in information retrieval
Objectrank: authority-based keyword search in databases
VLDB '04 Proceedings of the Thirtieth international conference on Very large data bases - Volume 30
Four Heuristics to Guide Structured Content Crawling
ICWE '08 Proceedings of the 2008 Eighth International Conference on Web Engineering
Hexastore: sextuple indexing for semantic web data management
Proceedings of the VLDB Endowment
Sindice.com: a document-oriented lookup index for open linked data
International Journal of Metadata, Semantics and Ontologies
IRLbot: Scaling to 6 billion pages and beyond
ACM Transactions on the Web (TWEB)
Top-k Exploration of Query Candidates for Efficient Keyword Search on Graph-Shaped (RDF) Data
ICDE '09 Proceedings of the 2009 IEEE International Conference on Data Engineering
Benchmarking Fulltext Search Performance of RDF Stores
ESWC 2009 Heraklion Proceedings of the 6th European Semantic Web Conference on The Semantic Web: Research and Applications
Improving the performance of focused web crawlers
Data & Knowledge Engineering
State of the Art in Semantic Focused Crawlers
ICCSA '09 Proceedings of the International Conference on Computational Science and Its Applications: Part II
Marvin: Distributed reasoning over large-scale Semantic Web data
Web Semantics: Science, Services and Agents on the World Wide Web
On the Ostensibly Silent `W' in OWL 2 RL
RR '09 Proceedings of the 3rd International Conference on Web Reasoning and Rule Systems
TripleRank: Ranking Semantic Web Data by Tensor Decomposition
ISWC '09 Proceedings of the 8th International Semantic Web Conference
Using Naming Authority to Rank Data and Ontologies for Web Search
ISWC '09 Proceedings of the 8th International Semantic Web Conference
Executing SPARQL Queries over the Web of Linked Data
ISWC '09 Proceedings of the 8th International Semantic Web Conference
Scalable Distributed Reasoning Using MapReduce
ISWC '09 Proceedings of the 8th International Semantic Web Conference
Discovering and Maintaining Links on the Web of Data
ISWC '09 Proceedings of the 8th International Semantic Web Conference
Parallel Materialization of the Finite RDFS Closure for Hundreds of Millions of Triples
ISWC '09 Proceedings of the 8th International Semantic Web Conference
The RDF-3X engine for scalable management of RDF data
The VLDB Journal — The International Journal on Very Large Data Bases
Web Semantics: Science, Services and Agents on the World Wide Web
DLEJena: A practical forward-chaining OWL 2 RL reasoner combining Jena and Pellet
Web Semantics: Science, Services and Agents on the World Wide Web
Data summaries for on-demand queries over linked data
Proceedings of the 19th international conference on World wide web
Mind the data skew: distributed inferencing by speeddating in elastic regions
Proceedings of the 19th international conference on World wide web
YARS2: a federated repository for querying graph structured data from the web
ISWC'07/ASWC'07 Proceedings of the 6th international The semantic web and 2nd Asian conference on Asian semantic web conference
DBpedia: a nucleus for a web of open data
ISWC'07/ASWC'07 Proceedings of the 6th international The semantic web and 2nd Asian conference on Asian semantic web conference
Querying distributed RDF data sources with SPARQL
ESWC'08 Proceedings of the 5th European semantic web conference on The semantic web: research and applications
Semantic browsing with PowerMagpie
ESWC'08 Proceedings of the 5th European semantic web conference on The semantic web: research and applications
Invited paper: VisiNav: A system for visual search and navigation on web data
Web Semantics: Science, Services and Agents on the World Wide Web
Invited paper: Sig.ma: Live views on the Web of Data
Web Semantics: Science, Services and Agents on the World Wide Web
When owl: sameAs isn't the same: an analysis of identity in linked data
ISWC'10 Proceedings of the 9th international semantic web conference on The semantic web - Volume Part I
SAOR: template rule optimisations for distributed reasoning over 1 billion linked data triples
ISWC'10 Proceedings of the 9th international semantic web conference on The semantic web - Volume Part I
Can we ever catch up with the Web?
Semantic Web
Finding and ranking knowledge on the semantic web
ISWC'05 Proceedings of the 4th international conference on The Semantic Web
Ranking ontologies with AKTiveRank
ISWC'06 Proceedings of the 5th international conference on The Semantic Web
Fresnel: a browser-independent presentation vocabulary for RDF
ISWC'06 Proceedings of the 5th international conference on The Semantic Web
Extending faceted navigation for RDF data
ISWC'06 Proceedings of the 5th international conference on The Semantic Web
A survey of the web ontology landscape
ISWC'06 Proceedings of the 5th international conference on The Semantic Web
Hierarchical link analysis for ranking web data
ESWC'10 Proceedings of the 7th international conference on The Semantic Web: research and Applications - Volume Part II
A node indexing scheme for web entity retrieval
ESWC'10 Proceedings of the 7th international conference on The Semantic Web: research and Applications - Volume Part II
Web Semantics: Science, Services and Agents on the World Wide Web
An empirical survey of Linked Data conformance
Web Semantics: Science, Services and Agents on the World Wide Web
The Semantic Service Search Engine (S3E)
Journal of Intelligent Information Systems
A novel semantic web browser for user centric information retrieval: PERSON
Expert Systems with Applications: An International Journal
Proceedings of the 3rd Annual ACM Web Science Conference
Recommendations using linked data
Proceedings of the 5th Ph.D. workshop on Information and knowledge
Structure inference for linked data sources using clustering
Proceedings of the Joint EDBT/ICDT 2013 Workshops
X-ENS: semantic enrichment of web search results at real-time
Proceedings of the 36th international ACM SIGIR conference on Research and development in information retrieval
Question answering on interlinked data
Proceedings of the 22nd international conference on World Wide Web
Colledge: a vision of collaborative knowledge networks
Proceedings of the 2nd International Workshop on Semantic Search over the Web
Ontology-based semantic search for large-scale RDF data
WAIM'13 Proceedings of the 14th international conference on Web-Age Information Management
RDFS and OWL reasoning for linked data
RW'13 Proceedings of the 9th international conference on Reasoning Web: semantic technologies for intelligent data access
An approach for selecting seed URLs of focused crawler based on user-interest ontology
Applied Soft Computing
Generating SPARQL queries using templates
Web Intelligence and Agent Systems
Hi-index | 0.00 |
In this paper, we discuss the architecture and implementation of the Semantic Web Search Engine (SWSE). Following traditional search engine architecture, SWSE consists of crawling, data enhancing, indexing and a user interface for search, browsing and retrieval of information; unlike traditional search engines, SWSE operates over RDF Web data - loosely also known as Linked Data - which implies unique challenges for the system design, architecture, algorithms, implementation and user interface. In particular, many challenges exist in adopting Semantic Web technologies for Web data: the unique challenges of the Web - in terms of scale, unreliability, inconsistency and noise - are largely overlooked by the current Semantic Web standards. Herein, we describe the current SWSE system, initially detailing the architecture and later elaborating upon the function, design, implementation and performance of each individual component. In so doing, we also give an insight into how current Semantic Web standards can be tailored, in a best-effort manner, for use on Web data. Throughout, we offer evaluation and complementary argumentation to support our design choices, and also offer discussion on future directions and open research questions. Later, we also provide candid discussion relating to the difficulties currently faced in bringing such a search engine into the mainstream, and lessons learnt from roughly six years working on the Semantic Web Search Engine project.