ACM Computing Surveys (CSUR) - Annals of discrete mathematics, 24
Using latent semantic analysis to improve access to textual information
CHI '88 Proceedings of the SIGCHI Conference on Human Factors in Computing Systems
Automatic text processing
Suffix arrays: a new method for on-line string searches
SIAM Journal on Computing
In situ generation of compressed inverted files
Journal of the American Society for Information Science
Randomized algorithms
Dissemination of collection wide information in a distributed information retrieval system
SIGIR '95 Proceedings of the 18th annual international ACM SIGIR conference on Research and development in information retrieval
Resource scheduling for parallel database and scientific applications
Proceedings of the eighth annual ACM symposium on Parallel algorithms and architectures
Matrix computations (3rd ed.)
A design of a distributed full text retrieval system
Proceedings of the 9th annual international ACM SIGIR conference on Research and development in information retrieval
Life, death, and lawfulness on the electronic frontier
Proceedings of the ACM SIGCHI Conference on Human factors in computing systems
Enhanced hypertext categorization using hyperlinks
SIGMOD '98 Proceedings of the 1998 ACM SIGMOD international conference on Management of data
Inferring Web communities from link topology
Proceedings of the ninth ACM conference on Hypertext and hypermedia : links, objects, time and space---structure in hypermedia systems: links, objects, time and space---structure in hypermedia systems
Query performance for tightly coupled distributed digital libraries
Proceedings of the third ACM conference on Digital libraries
Compressed inverted files with reduced decoding overheads
Proceedings of the 21st annual international ACM SIGIR conference on Research and development in information retrieval
Automatic resource compilation by analyzing hyperlink structure and associated text
WWW7 Proceedings of the seventh international conference on World Wide Web 7
The anatomy of a large-scale hypertextual Web search engine
WWW7 Proceedings of the seventh international conference on World Wide Web 7
Efficient crawling through URL ordering
WWW7 Proceedings of the seventh international conference on World Wide Web 7
The connectivity server: fast access to linkage information on the Web
WWW7 Proceedings of the seventh international conference on World Wide Web 7
Towards a better understanding of Web resources and server responses for improved caching
WWW '99 Proceedings of the eighth international conference on World Wide Web
Finding related pages in the World Wide Web
WWW '99 Proceedings of the eighth international conference on World Wide Web
Trawling the Web for emerging cyber-communities
WWW '99 Proceedings of the eighth international conference on World Wide Web
Mirror, mirror on the Web: a study of host pairs with replicated content
WWW '99 Proceedings of the eighth international conference on World Wide Web
Focused crawling: a new approach to topic-specific Web resource discovery
WWW '99 Proceedings of the eighth international conference on World Wide Web
Managing gigabytes (2nd ed.): compressing and indexing documents and images
Managing gigabytes (2nd ed.): compressing and indexing documents and images
Authoritative sources in a hyperlinked environment
Journal of the ACM (JACM)
Synchronizing a database to improve freshness
SIGMOD '00 Proceedings of the 2000 ACM SIGMOD international conference on Management of data
Does “authority” mean quality? predicting expert quality ratings of Web documents
SIGIR '00 Proceedings of the 23rd annual international ACM SIGIR conference on Research and development in information retrieval
WebBase: a repository of Web pages
Proceedings of the 9th international World Wide Web conference on Computer networks : the international journal of computer and telecommunications netowrking
Proceedings of the 9th international World Wide Web conference on Computer networks : the international journal of computer and telecommunications netowrking
Signature files: an access method for documents and its analytical performance evaluation
ACM Transactions on Information Systems (TOIS)
Building a distributed full-text index for the Web
Proceedings of the 10th international conference on World Wide Web
PDIS '93 Proceedings of the second international conference on Parallel and distributed information systems
Data Structures and Algorithms
Data Structures and Algorithms
The Evolution of the Web and Implications for an Incremental Crawler
VLDB '00 Proceedings of the 26th International Conference on Very Large Data Bases
Focused Crawling Using Context Graphs
VLDB '00 Proceedings of the 26th International Conference on Very Large Data Bases
Approximating Aggregate Queries about Web Pages via Random Walks
VLDB '00 Proceedings of the 26th International Conference on Very Large Data Bases
Extracting Patterns and Relations from the World Wide Web
WebDB '98 Selected papers from the International Workshop on The World Wide Web and Databases
Optimal crawling strategies for web search engines
Proceedings of the 11th international conference on World Wide Web
Proceedings of the 2nd ACM/IEEE-CS joint conference on Digital libraries
I/O-efficient techniques for computing pagerank
Proceedings of the eleventh international conference on Information and knowledge management
Text Retrieval Systems for the Web
Programming and Computing Software
Web Recency Maintenance Protocol
IWDC '02 Proceedings of the 4th International Workshop on Distributed Computing, Mobile and Wireless Computing
Design and Implementation of a Distributed Crawler and Filtering Processor
NGITS '02 Proceedings of the 5th International Workshop on Next Generation Information Technologies and Systems
Using PageRank to Characterize Web Structure
COCOON '02 Proceedings of the 8th Annual International Conference on Computing and Combinatorics
Agents, Crawlers, and Web Retrieval
CIA '02 Proceedings of the 6th International Workshop on Cooperative Information Agents VI
Journal of the American Society for Information Science and Technology
Winnowing: local algorithms for document fingerprinting
Proceedings of the 2003 ACM SIGMOD international conference on Management of data
Query expansion using associated queries
CIKM '03 Proceedings of the twelfth international conference on Information and knowledge management
Optimizing result prefetching in web search engines with segmented indices
ACM Transactions on Internet Technology (TOIT)
Do the Web sites of higher rated scholars have significantly more online impact?
Journal of the American Society for Information Science and Technology
Finding similar academic web sites with links, bibliometric couplings and colinks
Information Processing and Management: an International Journal
Guest Editors' Introduction: Web Engineering--The Evolution of New Technologies
Computing in Science and Engineering
Web Searching and Information Retrieval
Computing in Science and Engineering
Design of a crawler with bounded bandwidth
Proceedings of the 13th international World Wide Web conference on Alternate track papers & posters
On the temporal dimension of search
Proceedings of the 13th international World Wide Web conference on Alternate track papers & posters
Teaching key topics in computer science and information systems through a web search engine project
Journal on Educational Resources in Computing (JERIC)
Local methods for estimating pagerank values
Proceedings of the thirteenth ACM international conference on Information and knowledge management
EBizPort: collecting and analyzing business intelligence information
Journal of the American Society for Information Science and Technology
UbiCrawler: a scalable fully distributed web crawler
Software—Practice & Experience
Concept-based querying in mediator systems
The VLDB Journal — The International Journal on Very Large Data Bases
A modeling approach to uncover hyperlink patterns: the case of Canadian universities
Information Processing and Management: an International Journal
Three-level caching for efficient query processing in large Web search engines
WWW '05 Proceedings of the 14th international conference on World Wide Web
Crawling a country: better strategies than breadth-first for web page ordering
WWW '05 Special interest tracks and posters of the 14th international conference on World Wide Web
Lexical and semantic clustering by web links
Journal of the American Society for Information Science and Technology - Special issue: Webometrics
The impact of metadata implementation on webpage visibility in search engine results (part II)
Information Processing and Management: an International Journal - Special issue: Cross-language information retrieval
SpidersRUs: automated development of vertical search engines in different domains and languages
Proceedings of the 5th ACM/IEEE-CS joint conference on Digital libraries
Personalized e-learning system using Item Response Theory
Computers & Education
VLDB '05 Proceedings of the 31st international conference on Very large data bases
Interpreting social science link analysis research: A theoretical framework
Journal of the American Society for Information Science and Technology
DGPort: a web portal for digital government
dg.o '03 Proceedings of the 2003 annual national conference on Digital government research
DGPort: a web portal for digital government
dg.o '03 Proceedings of the 2003 annual national conference on Digital government research
Multilingual Web retrieval: An experiment in English–Chinese business intelligence
Journal of the American Society for Information Science and Technology
Inverted files for text search engines
ACM Computing Surveys (CSUR)
WebKhoj: Indian language IR from multiple character encodings
Proceedings of the 15th international conference on World Wide Web
PageSim: a novel link-based measure of web page aimilarity
Proceedings of the 15th international conference on World Wide Web
Efficient query processing in geographic web search engines
Proceedings of the 2006 ACM SIGMOD international conference on Management of data
Generalizing PageRank: damping functions for link-based ranking algorithms
SIGIR '06 Proceedings of the 29th annual international ACM SIGIR conference on Research and development in information retrieval
Stanford WebBase components and applications
ACM Transactions on Internet Technology (TOIT)
Evaluation of crawling policies for a web-repository crawler
Proceedings of the seventeenth conference on Hypertext and hypermedia
Web crawling ethics revisited: Cost, privacy, and denial of service
Journal of the American Society for Information Science and Technology
Searching for experts on the Web: A review of contemporary expertise locator systems
ACM Transactions on Internet Technology (TOIT)
ACM Transactions on Internet Technology (TOIT)
Personalized mining of web documents using link structures and fuzzy concept networks
Applied Soft Computing
Architecture of a grid-enabled Web search engine
Information Processing and Management: an International Journal
Building a scientific knowledge web portal: the NanoPort experience
Decision Support Systems
User modeling for personalized Web search with self-organizing map: Research Articles
Journal of the American Society for Information Science and Technology
User modeling for personalized Web search with self-organizing map: Research Articles
Journal of the American Society for Information Science and Technology
CMedPort: an integrated approach to facilitating Chinese medical information seeking
Decision Support Systems
Efficient in-memory extensible inverted file
Information Systems
The quest to find the best pages on the web
Information Services and Use
Detecting near-duplicates for web crawling
Proceedings of the 16th international conference on World Wide Web
Efficient search in large textual collections with redundancy
Proceedings of the 16th international conference on World Wide Web
Decoding the structure of the WWW: A comparative analysis of Web crawls
ACM Transactions on the Web (TWEB)
Information discovery and retrieval tools
Information Services and Use
Genetic Programming-Based Discovery of Ranking Functions for Effective Web Search
Journal of Management Information Systems
Know your neighbors: web spam detection using the web topology
SIGIR '07 Proceedings of the 30th annual international ACM SIGIR conference on Research and development in information retrieval
Searching for logo and trademark images on the web
Proceedings of the 6th ACM international conference on Image and video retrieval
User-assisted similarity estimation for searching related web pages
Proceedings of the eighteenth conference on Hypertext and hypermedia
Optimizing result prefetching in web search engines with segmented indices
VLDB '02 Proceedings of the 28th international conference on Very Large Data Bases
I/O-conscious data preparation for large-scale web search engines
VLDB '02 Proceedings of the 28th international conference on Very Large Data Bases
Dynamic role allocation for small search engine clusters
Proceedings of the 2007 annual research conference of the South African institute of computer scientists and information technologists on IT research in developing countries
Complex queries over web repositories
VLDB '03 Proceedings of the 29th international conference on Very large data bases - Volume 29
Optimized query execution in large search engines with global page ordering
VLDB '03 Proceedings of the 29th international conference on Very large data bases - Volume 29
Just in time indexing for up to the second search
Proceedings of the sixteenth ACM conference on Conference on information and knowledge management
A machine learning approach to web page filtering using content and structure analysis
Decision Support Systems
Extracting accurate and complete results from search engines: Case study windows live
Journal of the American Society for Information Science and Technology
DistanceRank: An intelligent ranking algorithm for web pages
Information Processing and Management: an International Journal
A study about browsers in the Web and the Desktop
EATIS '07 Proceedings of the 2007 Euro American conference on Telematics and information systems
Managing legal risks associated with intellectual property on the web
International Journal of Business Information Systems
IRLbot: scaling to 6 billion pages and beyond
Proceedings of the 17th international conference on World Wide Web
Fourth international workshop on adversarial information retrieval on the web (AIRWeb 2008)
Proceedings of the 17th international conference on World Wide Web
SpidersRUs: Creating specialized search engines in multiple languages
Decision Support Systems
Toward a theory of network gatekeeping: A framework for exploring information control
Journal of the American Society for Information Science and Technology
Quantitative comparisons of search engine results
Journal of the American Society for Information Science and Technology
Improving Web Search by Categorization, Clustering, and Personalization
ADMA '08 Proceedings of the 4th international conference on Advanced Data Mining and Applications
Exploiting Hybrid Parallelism in Web Search Engines
Euro-Par '08 Proceedings of the 14th international Euro-Par conference on Parallel Processing
Collection selection: ...now, with more documents!
Proceedings of the 3rd international conference on Scalable information systems
A Hybrid System: Neural Network with Data Mining in an e-Learning Environment
KES '07 Knowledge-Based Intelligent Information and Engineering Systems and the XVII Italian Workshop on Neural Networks on Proceedings of the 11th International Conference
Web robot detection: A probabilistic reasoning approach
Computer Networks: The International Journal of Computer and Telecommunications Networking
Improving Search Engines Performance on Multithreading Processors
High Performance Computing for Computational Science - VECPAR 2008
Ranking billions of web pages using diodes
Communications of the ACM - A Blind Person's Interaction with Technology
IRLbot: Scaling to 6 billion pages and beyond
ACM Transactions on the Web (TWEB)
Journal of Information Science
Hierarchical location and topic based query expansion
AAAI'08 Proceedings of the 23rd national conference on Artificial intelligence - Volume 2
Design and deployment of a digital forensics service platform for online videos
MiFor '09 Proceedings of the First ACM workshop on Multimedia in forensics
An investigation of web crawler behavior: characterization and metrics
Computer Communications
Leveraging a scalable row store to build a distributed text index
Proceedings of the first international workshop on Cloud data management
FICA: A novel intelligent crawling algorithm based on reinforcement learning
Web Intelligence and Agent Systems
Designing the user interface and functions of a search engine development tool
Decision Support Systems
Adaptive focused crawler based on tunneling and link analysis
ICACT'09 Proceedings of the 11th international conference on Advanced Communication Technology - Volume 3
Referral based expertise search system in a time evolving social network
Proceedings of the Third Annual ACM Bangalore Conference
An initial proposal for cooperative evaluation on information retrieval in Portuguese
PROPOR'03 Proceedings of the 6th international conference on Computational processing of the Portuguese language
Efficient indexing of versioned document sequences
ECIR'07 Proceedings of the 29th European conference on IR research
The adaptive web
Discovering implicit feedbacks from search engine log files
DS'07 Proceedings of the 10th international conference on Discovery science
ISVC'07 Proceedings of the 3rd international conference on Advances in visual computing - Volume Part II
Mining Query Logs: Turning Search Usage Data into Knowledge
Foundations and Trends in Information Retrieval
Design of SMACA: synthesis and its analysis through rule vector graph for web based application
International Journal of Intelligent Information and Database Systems
Mining the web with hierarchical crawlers – a resource sharing based crawling approach
International Journal of Intelligent Information and Database Systems
Caching search engine results over incremental indices
Proceedings of the 33rd international ACM SIGIR conference on Research and development in information retrieval
Detecting spam bots in online social networking sites: a machine learning approach
DBSec'10 Proceedings of the 24th annual IFIP WG 11.3 working conference on Data and applications security and privacy
CAMEO: continuous analytics for massively multiplayer online games on cloud resources
Euro-Par'09 Proceedings of the 2009 international conference on Parallel processing
Searching the web with mobile images for location recognition
CVPR'04 Proceedings of the 2004 IEEE computer society conference on Computer vision and pattern recognition
Inverted index compression via online document routing
Proceedings of the 20th international conference on World wide web
Users search trends on WWW and their analysis
Proceedings of the First International Conference on Intelligent Interactive Technologies and Multimedia
On-line multi-threaded processing of web user-clicks on multi-core processors
VECPAR'10 Proceedings of the 9th international conference on High performance computing for computational science
Foundations and Trends in Information Retrieval
An approach to manage and search for software components
ACOS'06 Proceedings of the 5th WSEAS international conference on Applied computer science
A strategy for efficient crawling of rich internet applications
ICWE'11 Proceedings of the 11th international conference on Web engineering
Enhance web pages genre identification using neighboring pages
WISE'11 Proceedings of the 12th international conference on Web information system engineering
Researching Personal Information on the Public Web: Methods and Ethics
Social Science Computer Review
Combining text and link analysis for focused crawling
ICAPR'05 Proceedings of the Third international conference on Advances in Pattern Recognition - Volume Part I
Intellisearch: intelligent search for images and text on the web
ICIAR'06 Proceedings of the Third international conference on Image Analysis and Recognition - Volume Part I
Using hyperlink features to personalize web search
WebKDD'04 Proceedings of the 6th international conference on Knowledge Discovery on the Web: advances in Web Mining and Web Usage Analysis
Discriminating biased web manipulations in terms of link oriented measures
ISCIS'05 Proceedings of the 20th international conference on Computer and Information Sciences
Decomposition-Based optimization of reload strategies in the world wide web
WISE'05 Proceedings of the 6th international conference on Web Information Systems Engineering
Matching ontologies in open networked systems: techniques and applications
Journal on Data Semantics V
Focused crawling using latent semantic indexing – an application for vertical search engines
ECDL'05 Proceedings of the 9th European conference on Research and Advanced Technology for Digital Libraries
Learning the grammar of distant change in the world-wide web
AI'04 Proceedings of the 17th Australian joint conference on Advances in Artificial Intelligence
Searching moving objects in a spatio-temporal distributed database servers system
ODBASE'06/OTM'06 Proceedings of the 2006 Confederated international conference on On the Move to Meaningful Internet Systems: CoopIS, DOA, GADA, and ODBASE - Volume Part II
CLEF'04 Proceedings of the 5th conference on Cross-Language Evaluation Forum: multilingual Information Access for Text, Speech and Images
Search engine indexing storage optimisation using Hamming distance
International Journal of Intelligent Information and Database Systems
Modelling web changes data recatched during A spread of internet virus
Mathematical and Computer Modelling: An International Journal
Semantic APIs: Scaling up towards the Semantic Web
International Journal of Information Management: The Journal for Information Professionals
International Journal of Web Based Communities
Sentimental Spidering: Leveraging Opinion Information in Focused Crawlers
ACM Transactions on Information Systems (TOIS)
Mobile search engine as a business model
Proceedings of the 12th International Conference on Electronic Commerce: Roadmap for the Future of Electronic Business
Generation of SMACA and its application in web services
PaCT'07 Proceedings of the 9th international conference on Parallel Computing Technologies
RetriBlog: An architecture-centered framework for developing blog crawlers
Expert Systems with Applications: An International Journal
Amharic-English bilingual web search engine
Proceedings of the International Conference on Management of Emergent Digital EcoSystems
Architecture specification of rule-based deep web crawler with indexer
International Journal of Knowledge and Web Intelligence
Topical crawling on the web through local site-searches
Journal of Web Engineering
Hi-index | 0.00 |
We offer an overview of current Web search engine design. After introducing a generic search engine architecture, we examine each engine component in turn. We cover crawling, local Web page storage, indexing, and the use of link analysis for boosting search performance. The most common design and implementation techniques for each of these components are presented. For this presentation we draw from the literature and from our own experimental search engine testbed. Emphasis is on introducing the fundamental concepts and the results of several performance analyses we conducted to compare different designs.