Indexing and retrieval of scientific literature
Proceedings of the eighth international conference on Information and knowledge management
Power browser: efficient Web browsing for PDAs
Proceedings of the SIGCHI conference on Human Factors in Computing Systems
Synchronizing a database to improve freshness
SIGMOD '00 Proceedings of the 2000 ACM SIGMOD international conference on Management of data
SIGIR '00 Proceedings of the 23rd annual international ACM SIGIR conference on Research and development in information retrieval
Information retrieval on the web
ACM Computing Surveys (CSUR)
Intelligent crawling on the World Wide Web with arbitrary predicates
Proceedings of the 10th international conference on World Wide Web
An adaptive model for optimizing performance of an incremental web crawler
Proceedings of the 10th international conference on World Wide Web
Breadth-first crawling yields high-quality pages
Proceedings of the 10th international conference on World Wide Web
Retrieving and organizing web pages by “information unit”
Proceedings of the 10th international conference on World Wide Web
Personalized spiders for web search and analysis
Proceedings of the 1st ACM/IEEE-CS joint conference on Digital libraries
ACM Transactions on Internet Technology (TOIT)
Evaluating topic-driven web crawlers
Proceedings of the 24th annual international ACM SIGIR conference on Research and development in information retrieval
On the design of a learning crawler for topical resource discovery
ACM Transactions on Information Systems (TOIS)
Proceedings of the 11th international conference on World Wide Web
Accelerated focused crawling through online relevance feedback
Proceedings of the 11th international conference on World Wide Web
Using web structure for classifying and describing web pages
Proceedings of the 11th international conference on World Wide Web
Topic-oriented collaborative crawling
Proceedings of the eleventh international conference on Information and knowledge management
Categorizing information objects from user access patterns
Proceedings of the eleventh international conference on Information and knowledge management
I/O-efficient techniques for computing pagerank
Proceedings of the eleventh international conference on Information and knowledge management
ACM Computing Surveys (CSUR)
Automating the Construction of Internet Portals with Machine Learning
Information Retrieval
Text Retrieval Systems for the Web
Programming and Computing Software
Mercator: A scalable, extensible Web crawler
World Wide Web
Hyperlink Analysis for the Web
IEEE Internet Computing
Query Relaxation by Structure and Semantics for Retrieval of Logical Web Documents
IEEE Transactions on Knowledge and Data Engineering
Design and evaluation of a multi-agent collaborative Web mining system
Decision Support Systems - Web retrieval and mining
Living Hypertext - Web Retrieval Techniques for Traditional Database-Centric Information
IICS '02 Proceedings of the Second International Workshop on Innovative Internet Computing Systems
Optimal Allocation of Heterogeneous Robots in World Wide Web Search Engines
IPDPS '01 Proceedings of the 15th International Parallel & Distributed Processing Symposium
Finding Similar Queries to Satisfy Searches Based on Query Traces
OOIS '02 Proceedings of the Workshops on Advances in Object-Oriented Information Systems
Distributed Hypertext Resource Discovery Through Examples
VLDB '99 Proceedings of the 25th International Conference on Very Large Data Bases
The Evolution of the Web and Implications for an Incremental Crawler
VLDB '00 Proceedings of the 26th International Conference on Very Large Data Bases
Focused Crawling Using Context Graphs
VLDB '00 Proceedings of the 26th International Conference on Very Large Data Bases
Proceedings of the 27th International Conference on Very Large Data Bases
Metadata Based Web Mining for Topic-Specific Information Gathering
EC-WEB '00 Proceedings of the First International Conference on Electronic Commerce and Web Technologies
Design and Implementation of a Distributed Crawler and Filtering Processor
NGITS '02 Proceedings of the 5th International Workshop on Next Generation Information Technologies and Systems
Crawling for Images on the WWW
VISUAL '99 Proceedings of the Third International Conference on Visual Information and Information Systems
Crawlets: Agents for High Performance Web Search Engines
MA '01 Proceedings of the 5th International Conference on Mobile Agents
Agents, Crawlers, and Web Retrieval
CIA '02 Proceedings of the 6th International Workshop on Cooperative Information Agents VI
Web Information Retrieval - an Algorithmic Perspective
ESA '00 Proceedings of the 8th Annual European Symposium on Algorithms
Building Topic-Specific Collections with Intelligent Agents
IS&N '99 Proceedings of the 6th International Conference on Intelligence and Services in Networks: Paving the Way for an Open Service Market
Holmes: a prototype for the targeted search of information about hi-tech companies
Second international workshop on Intelligent systems design and application
Adaptive on-line page importance computation
WWW '03 Proceedings of the 12th international conference on World Wide Web
Efficient URL caching for world wide web crawling
WWW '03 Proceedings of the 12th international conference on World Wide Web
Algorithmic aspects of information retrieval on the web
Handbook of massive data sets
Handbook of massive data sets
Searching large text collections
Handbook of massive data sets
Complementing search engines with online web mining agents
Decision Support Systems - Special issue: Web data mining
Journal of the American Society for Information Science and Technology
HelpfulMed: intelligent searching for medical information over the internet
Journal of the American Society for Information Science and Technology
Reinforcement learning based on local state feature learning and policy adjustment
Information Sciences—Informatics and Computer Science: An International Journal - Special issue: Introduction to multimedia and mobile agents
Improving web search by the identification of contextual information
Intelligent exploration of the web
Deriving and verifying statistical distribution of a hyperlink-based Web page quality metric
Data & Knowledge Engineering
Ontology-focused crawling of Web documents
Proceedings of the 2003 ACM symposium on Applied computing
Effective page refresh policies for Web crawlers
ACM Transactions on Database Systems (TODS)
Assigning document identifiers to enhance compressibility of Web Search Engines indexes
Proceedings of the 2004 ACM symposium on Applied computing
Proceedings of the 13th international conference on World Wide Web
Panorama: extending digital libraries with topical crawlers
Proceedings of the 4th ACM/IEEE-CS joint conference on Digital libraries
Average-clicks: a new measure of distance on the World Wide Web
Journal of Intelligent Information Systems - Special issue on web intelligence
Discovery of ads web hosts through traffic data analysis
Proceedings of the 9th ACM SIGMOD workshop on Research issues in data mining and knowledge discovery
Scaling IR-system evaluation using term relevance sets
Proceedings of the 27th annual international ACM SIGIR conference on Research and development in information retrieval
Design of a crawler with bounded bandwidth
Proceedings of the 13th international World Wide Web conference on Alternate track papers & posters
A Weighted Freshness Metric for Maintaining Search Engine Local Repository
WI '04 Proceedings of the 2004 IEEE/WIC/ACM International Conference on Web Intelligence
Researchexplorer: gaining insights through exploration in multimedia scientific data
Proceedings of the 6th ACM SIGMM international workshop on Multimedia information retrieval
High performance crawling system
Proceedings of the 6th ACM SIGMM international workshop on Multimedia information retrieval
Topical web crawlers: Evaluating adaptive algorithms
ACM Transactions on Internet Technology (TOIT)
Local methods for estimating pagerank values
Proceedings of the thirteenth ACM international conference on Information and knowledge management
Exploiting Interclass Rules for Focused Crawling
IEEE Intelligent Systems
Learnable topic-specific web crawler
Journal of Network and Computer Applications - Special issue on computational intelligence on the internet
A General Evaluation Framework for Topical Crawlers
Information Retrieval
Crawling a country: better strategies than breadth-first for web page ordering
WWW '05 Special interest tracks and posters of the 14th international conference on World Wide Web
What's there and what's not?: focused crawling for missing documents in digital libraries
Proceedings of the 5th ACM/IEEE-CS joint conference on Digital libraries
A personal system for web image retrieval
WISICT '05 Proceedings of the 4th international symposium on Information and communication technologies
Using web structure and summarisation techniques for web content mining
Information Processing and Management: an International Journal
Hyperlink analysis on the world wide web
Proceedings of the sixteenth ACM conference on Hypertext and hypermedia
Characterizing a national community web
ACM Transactions on Internet Technology (TOIT)
Focused crawling for both topical relevance and quality of medical information
Proceedings of the 14th ACM international conference on Information and knowledge management
WebGuard: A Web Filtering Engine Combining Textual, Structural, and Visual Content-Based Analysis
IEEE Transactions on Knowledge and Data Engineering
MultiSumQA '02 proceedings of the 2002 conference on multilingual summarization and question answering - Volume 19
Efficient PageRank approximation via graph aggregation
Information Retrieval
Quality and relevance of domain-specific search: A case study in mental health
Information Retrieval
Geographically focused collaborative crawling
Proceedings of the 15th international conference on World Wide Web
Effective web-scale crawling through website analysis
Proceedings of the 15th international conference on World Wide Web
Topic-specific crawling on the web with the measurements of the relevancy context graph
Information Systems - Special issue: The semantic web and web services
Modelling information persistence on the web
ICWE '06 Proceedings of the 6th international conference on Web engineering
Stanford WebBase components and applications
ACM Transactions on Internet Technology (TOIT)
Estimating the global pagerank of web communities
Proceedings of the 12th ACM SIGKDD international conference on Knowledge discovery and data mining
A weighted ranking algorithm for facet-based component retrieval system
ACST'06 Proceedings of the 2nd IASTED international conference on Advances in computer science and technology
Efficient, automatic web resource harvesting
WIDM '06 Proceedings of the 8th annual ACM international workshop on Web information and data management
A Relation-Based Search Engine in Semantic Web
IEEE Transactions on Knowledge and Data Engineering
Exploiting structural similarity for effective Web information extraction
Data & Knowledge Engineering
Personalized mining of web documents using link structures and fuzzy concept networks
Applied Soft Computing
Mining communities and their relationships in blogs: A study of online hate groups
International Journal of Human-Computer Studies
Architecture of a grid-enabled Web search engine
Information Processing and Management: an International Journal
Using HMM to learn user browsing patterns for focused web crawling
Data & Knowledge Engineering - Special issue: WIDM 2004
Automatic classification of Web queries using very large unlabeled query logs
ACM Transactions on Information Systems (TOIS)
CMedPort: an integrated approach to facilitating Chinese medical information seeking
Decision Support Systems
Information categorization in web pages and sites
Web Intelligence and Agent Systems
Detecting near-duplicates for web crawling
Proceedings of the 16th international conference on World Wide Web
The discoverability of the web
Proceedings of the 16th international conference on World Wide Web
Implementation of a modern web search engine cluster
ATEC '03 Proceedings of the annual conference on USENIX Annual Technical Conference
Cha-Cha: a system for organizing intranet search results
USITS'99 Proceedings of the 2nd conference on USENIX Symposium on Internet Technologies and Systems - Volume 2
I/O-conscious data preparation for large-scale web search engines
VLDB '02 Proceedings of the 28th international conference on Very Large Data Bases
Repeatable evaluation of search services in dynamic environments
ACM Transactions on Information Systems (TOIS)
Accurate and efficient crawling for relevant websites
VLDB '04 Proceedings of the Thirtieth international conference on Very large data bases - Volume 30
Computing pagerank in a distributed internet search system
VLDB '04 Proceedings of the Thirtieth international conference on Very large data bases - Volume 30
Designing clustering-based web crawling policies for search engine crawlers
Proceedings of the sixteenth ACM conference on Conference on information and knowledge management
A machine learning approach to web page filtering using content and structure analysis
Decision Support Systems
RankMass crawler: a crawler with high personalized pagerank coverage guarantee
VLDB '07 Proceedings of the 33rd international conference on Very large data bases
The Viúva Negra crawler: an experience report
Software—Practice & Experience
Crawl ordering by search impact
WSDM '08 Proceedings of the 2008 International Conference on Web Search and Data Mining
DistanceRank: An intelligent ranking algorithm for web pages
Information Processing and Management: an International Journal
CEA'07 Proceedings of the 2007 annual Conference on International Conference on Computer Engineering and Applications
A punishment/reward based approach to ranking
Proceedings of the 2nd international conference on Scalable information systems
Microscale evolution of web pages
Proceedings of the 17th international conference on World Wide Web
Predicting defects using network analysis on dependency graphs
Proceedings of the 30th international conference on Software engineering
BioCrawler: An intelligent crawler for the semantic web
Expert Systems with Applications: An International Journal
Focused web crawling in the acquisition of comparable corpora
Information Retrieval
Search effectiveness with a breadth-first crawl
Proceedings of the 31st annual international ACM SIGIR conference on Research and development in information retrieval
Exploiting Multiple Features with MEMMs for Focused Web Crawling
NLDB '08 Proceedings of the 13th international conference on Natural Language and Information Systems: Applications of Natural Language to Information Systems
Ant Focused Crawling Algorithm
ICAISC '08 Proceedings of the 9th international conference on Artificial Intelligence and Soft Computing
A Quantitative Evaluation of Dissemination-Time Preservation Metadata
ECDL '08 Proceedings of the 12th European conference on Research and Advanced Technology for Digital Libraries
Parallel crawler architecture and web page change detection
WSEAS Transactions on Computers
Local approximation of pagerank and reverse pagerank
Proceedings of the 17th ACM conference on Information and knowledge management
On the feasibility of geographically distributed web crawling
Proceedings of the 3rd international conference on Scalable information systems
Network structure mining: locating and isolating core members in covert terrorist networks
WSEAS Transactions on Information Science and Applications
Nuclear Threat Detection Via the Nuclear Web and Dark Web: Framework and Preliminary Study
EuroISI '08 Proceedings of the 1st European Conference on Intelligence and Security Informatics
Focused Crawling with Heterogeneous Semantic Information
WI-IAT '08 Proceedings of the 2008 IEEE/WIC/ACM International Conference on Web Intelligence and Intelligent Agent Technology - Volume 01
An attentive self-organizing neural model for text mining
Expert Systems with Applications: An International Journal
Choose the Damping, Choose the Ranking?
WAW '09 Proceedings of the 6th International Workshop on Algorithms and Models for the Web-Graph
A cross-language focused crawling algorithm based on multiple relevance prediction strategies
Computers & Mathematics with Applications
Design of CORE: context ontology rule enhanced focused web crawler
Proceedings of the International Conference on Advances in Computing, Communication and Control
Proceedings of the 3rd workshop on Information credibility on the web
Measuring the Search Effectiveness of a Breadth-First Crawl
ECIR '09 Proceedings of the 31th European Conference on IR Research on Advances in Information Retrieval
Quantifying performance and quality gains in distributed web search engines
Proceedings of the 32nd international ACM SIGIR conference on Research and development in information retrieval
The impact of crawl policy on web search effectiveness
Proceedings of the 32nd international ACM SIGIR conference on Research and development in information retrieval
State of the Art in Semantic Focused Crawlers
ICCSA '09 Proceedings of the International Conference on Computational Science and Its Applications: Part II
HITS Can Converge Slowly, but Not Too Slowly, in Score and Rank
COCOON '09 Proceedings of the 15th Annual International Conference on Computing and Combinatorics
Multiple-goal heuristic search
Journal of Artificial Intelligence Research
Adaptive geospatially focused crawling
Proceedings of the 18th ACM conference on Information and knowledge management
The graph neural network model
IEEE Transactions on Neural Networks
FICA: A novel intelligent crawling algorithm based on reinforcement learning
Web Intelligence and Agent Systems
SHARC: framework for quality-conscious web archiving
Proceedings of the VLDB Endowment
Proceedings of the VLDB Endowment
Adaptive focused crawler based on tunneling and link analysis
ICACT'09 Proceedings of the 11th international conference on Advanced Communication Technology - Volume 3
Internet information broadcaster for China emerging rural market
Mobility '09 Proceedings of the 6th International Conference on Mobile Technology, Application & Systems
Using Web structure and summarisation techniques for Web content mining
Information Processing and Management: an International Journal
Application of rough ensemble classifier to web services categorization and focused crawling
Web Intelligence and Agent Systems
Foundations and Trends in Information Retrieval
Proceedings of the International Conference and Workshop on Emerging Trends in Technology
Choose the damping, choose the ranking?
Journal of Discrete Algorithms
Eliminate redundancy in parallel search: a multi-agent coordination approach
PRICAI'06 Proceedings of the 9th Pacific Rim international conference on Artificial intelligence
Implementation of a web robot and statistics on the Korean web
HSI'03 Proceedings of the 2nd international conference on Human.society@internet
Towards a next-generation search engine
PRICAI'00 Proceedings of the 6th Pacific Rim international conference on Artificial intelligence
The adaptive web
New-web search with microblog annotations
Proceedings of the 19th international conference on World wide web
News page discovery policy for instant crawlers
AIRS'08 Proceedings of the 4th Asia information retrieval conference on Information retrieval technology
Connectivity of the Thai web graph
APWeb'08 Proceedings of the 10th Asia-Pacific web conference on Progress in WWW research and development
CRAYSE: design and implementation of efficient text search algorithm in a web crawler
ACM SIGSOFT Software Engineering Notes
A Collection of Comparable Corpora for Under-resourced Languages
Proceedings of the 2010 conference on Human Language Technologies -- The Baltic Perspective: Proceedings of the Fourth International Conference Baltic HLT 2010
Don't tread on me: moderating access to OSN data with spikestrip
WOSN'10 Proceedings of the 3rd conference on Online social networks
Wise search engine based on LSI
ADMI'10 Proceedings of the 6th international conference on Agents and data mining interaction
Research on the establishment of structural e-learning resources
Edutainment'10 Proceedings of the Entertainment for education, and 5th international conference on E-learning and games
Where to crawl next for focused crawlers
KES'10 Proceedings of the 14th international conference on Knowledge-based and intelligent information and engineering systems: Part IV
Application of structured document parsing to focused web crawling
Computer Standards & Interfaces
Scalable information extraction for web queries
International Journal of Computational Science and Engineering
Fixing the threshold for effective detection of near duplicate web documents in web crawling
ADMA'10 Proceedings of the 6th international conference on Advanced data mining and applications: Part I
Design and implementation of contextual information portals
Proceedings of the 20th international conference companion on World wide web
The SHARC framework for data quality in Web archiving
The VLDB Journal — The International Journal on Very Large Data Bases
Focused web crawler with revisit policy
Proceedings of the International Conference & Workshop on Emerging Trends in Technology
Freshness tuning in focused crawler
Proceedings of the International Conference & Workshop on Emerging Trends in Technology
Architecture for a parallel focused crawler for clickstream analysis
ACIIDS'11 Proceedings of the Third international conference on Intelligent information and database systems - Volume Part I
Archiving the web using page changes patterns: a case study
Proceedings of the 11th annual international ACM/IEEE joint conference on Digital libraries
Benefits of bias: towards better characterization of network sampling
Proceedings of the 17th ACM SIGKDD international conference on Knowledge discovery and data mining
Improving the quality of web archives through the importance of changes
DEXA'11 Proceedings of the 22nd international conference on Database and expert systems applications - Volume Part I
Coherence-oriented crawling and navigation using patterns for web archives
TPDL'11 Proceedings of the 15th international conference on Theory and practice of digital libraries: research and advanced technology for digital libraries
A vertical search engine for school information based on Heritrix and Lucene
ICHIT'11 Proceedings of the 5th international conference on Convergence and hybrid information technology
Mining popular menu items of a restaurant from web reviews
WISM'11 Proceedings of the 2011 international conference on Web information systems and mining - Volume Part II
A framework for incremental deep web crawler based on URL classification
WISM'11 Proceedings of the 2011 international conference on Web information systems and mining - Volume Part II
Learning regional transliteration variants
Information Processing and Management: an International Journal
Information Sciences: an International Journal
Babouk: Focused Web Crawling for Corpus Compilation and Automatic Terminology Extraction
WI-IAT '11 Proceedings of the 2011 IEEE/WIC/ACM International Conferences on Web Intelligence and Intelligent Agent Technology - Volume 01
Discovering URLs through user feedback
Proceedings of the 20th ACM international conference on Information and knowledge management
User browsing behavior-driven web crawling
Proceedings of the 20th ACM international conference on Information and knowledge management
Local computation of PageRank: the ranking side
Proceedings of the 20th ACM international conference on Information and knowledge management
Characterization of evaluation metrics in topical web crawling based on genetic algorithm
ICNC'05 Proceedings of the First international conference on Advances in Natural Computation - Volume Part II
PaSE: locating online copy of scientific documents effectively
ICADL'04 Proceedings of the 7th international Conference on Digital Libraries: international collaboration and cross-fertilization
Using content-based and link-based analysis in building vertical search engines
ICADL'04 Proceedings of the 7th international Conference on Digital Libraries: international collaboration and cross-fertilization
wHunter: a focused web crawler – a tool for digital library
ICADL'04 Proceedings of the 7th international Conference on Digital Libraries: international collaboration and cross-fertilization
Combining text and link analysis for focused crawling
ICAPR'05 Proceedings of the Third international conference on Advances in Pattern Recognition - Volume Part I
Adaptive topical web crawling for domain-specific resource discovery guided by link-context
MICAI'06 Proceedings of the 5th Mexican international conference on Artificial Intelligence
A focused crawling for the web resource discovery using a modified proximal support vector machines
ICCSA'05 Proceedings of the 2005 international conference on Computational Science and its Applications - Volume Part I
A focused crawler with document segmentation
IDEAL'05 Proceedings of the 6th international conference on Intelligent Data Engineering and Automated Learning
Crawling Ajax-Based Web Applications through Dynamic Analysis of User Interface State Changes
ACM Transactions on the Web (TWEB)
Multi-modal services for web information collection based on multi-agent techniques
PRIMA'06 Proceedings of the 9th Pacific Rim international conference on Agent Computing and Multi-Agent Systems
LocalRank: ranking web pages considering geographical locality by integrating web and databases
DEXA'05 Proceedings of the 16th international conference on Database and Expert Systems Applications
An incremental approach to link evaluation in topic-driven web resource discovery
AAIM'05 Proceedings of the First international conference on Algorithmic Applications in Management
Ontology based web crawling – a novel approach
AWIC'05 Proceedings of the Third international conference on Advances in Web Intelligence
Focused crawling using latent semantic indexing – an application for vertical search engines
ECDL'05 Proceedings of the 9th European conference on Research and Advanced Technology for Digital Libraries
Analyzing terrorist networks: a case study of the global salafi jihad network
ISI'05 Proceedings of the 2005 IEEE international conference on Intelligence and Security Informatics
Algorithm for generating fuzzy rules for WWW document classification
ICAISC'06 Proceedings of the 8th international conference on Artificial Intelligence and Soft Computing
Online sampling of high centrality individuals in social networks
PAKDD'10 Proceedings of the 14th Pacific-Asia conference on Advances in Knowledge Discovery and Data Mining - Volume Part I
ARCOMEM: from collect-all ARchives to COmmunity MEMories
Proceedings of the 21st international conference companion on World Wide Web
A novel crawling algorithm for web pages
AIRS'11 Proceedings of the 7th Asia conference on Information Retrieval Technology
Lexical profiling of existing web directories to support fine-grained topic-focused web crawling
IRSG'08 Proceedings of the 2008 BCS-IRSG conference on Corpus Profiling
Availability of the OGC geoprocessing standard: March 2011 reality check
Computers & Geosciences
Proceedings of the 3rd Annual ACM Web Science Conference
Sentimental Spidering: Leveraging Opinion Information in Focused Crawlers
ACM Transactions on Information Systems (TOIS)
Semantic ranking of web pages based on formal concept analysis
Journal of Systems and Software
Computer Networks: The International Journal of Computer and Telecommunications Networking
Reprint of: The anatomy of a large-scale hypertextual web search engine
Computer Networks: The International Journal of Computer and Telecommunications Networking
Designing a fast file system crawler with incremental differencing
ACM SIGOPS Operating Systems Review
E-FFC: an enhanced form-focused crawler for domain-specific deep web databases
Journal of Intelligent Information Systems
Archival HTTP redirection retrieval policies
Proceedings of the 22nd international conference on World Wide Web companion
Researcher homepage classification using unlabeled data
Proceedings of the 22nd international conference on World Wide Web
Chronicle security against covert crawling
Proceedings of the First International Conference on Security of Internet of Things
Incorporating the surfing behavior of web users into pagerank
Proceedings of the 22nd ACM international conference on Conference on information & knowledge management
CUVIM: extracting fresh information from social network
WAIM'13 Proceedings of the 14th international conference on Web-Age Information Management
Hi-index | 0.01 |