Optimization for dynamic inverted index maintenance
SIGIR '90 Proceedings of the 13th annual international ACM SIGIR conference on Research and development in information retrieval
First steps towards electronic research communication
Computers in Physics
Incremental updates of inverted lists for text document retrieval
SIGMOD '94 Proceedings of the 1994 ACM SIGMOD international conference on Management of data
Copy detection mechanisms for digital documents
SIGMOD '95 Proceedings of the 1995 ACM SIGMOD international conference on Management of data
Referral Web: combining social networks and collaborative filtering
Communications of the ACM
Citation linking: improving access to online journals
DL '97 Proceedings of the second ACM international conference on Digital libraries
CiteSeer: an automatic citation indexing system
Proceedings of the third ACM conference on Digital libraries
AGENTS '98 Proceedings of the second international conference on Autonomous agents
Syntactic clustering of the Web
Selected papers from the sixth international conference on World Wide Web
Advantages of query biased summaries in information retrieval
Proceedings of the 21st annual international ACM SIGIR conference on Research and development in information retrieval
Improved algorithms for topic distillation in a hyperlinked environment
Proceedings of the 21st annual international ACM SIGIR conference on Research and development in information retrieval
The anatomy of a large-scale hypertextual Web search engine
WWW7 Proceedings of the seventh international conference on World Wide Web 7
Efficient crawling through URL ordering
WWW7 Proceedings of the seventh international conference on World Wide Web 7
A system for automatic personalized tracking of scientific literature on the Web
Proceedings of the fourth ACM conference on Digital libraries
Proceedings of the fourth ACM conference on Digital libraries
Authoritative sources in a hyperlinked environment
Proceedings of the ninth annual ACM-SIAM symposium on Discrete algorithms
Managing gigabytes (2nd ed.): compressing and indexing documents and images
Managing gigabytes (2nd ed.): compressing and indexing documents and images
Context and Page Analysis for Improved Web Search
IEEE Internet Computing
Using Reinforcement Learning to Spider the Web Efficiently
ICML '99 Proceedings of the Sixteenth International Conference on Machine Learning
Fast Incremental Indexing for Full-Text Information Retrieval
VLDB '94 Proceedings of the 20th International Conference on Very Large Data Bases
Models for reader interaction systems
Proceedings of the ninth international conference on Information and knowledge management
Persistence of information on the web: analyzing citations contained in research articles
Proceedings of the ninth international conference on Information and knowledge management
Proceedings of the 24th annual international ACM SIGIR conference on Research and development in information retrieval
Exposing document context in the personal web
Proceedings of the 7th international conference on Intelligent user interfaces
Algorithm for documents ranking: idea and simulation results
SEKE '02 Proceedings of the 14th international conference on Software engineering and knowledge engineering
On the recommending of citations for research papers
CSCW '02 Proceedings of the 2002 ACM conference on Computer supported cooperative work
Text Retrieval Systems for the Web
Programming and Computing Software
Search Behavior in a Research-Oriented Digital Library
ECDL '01 Proceedings of the 5th European Conference on Research and Advanced Technology for Digital Libraries
Semantic profile-based document logistics for cooperative research
Future Generation Computer Systems - Special issue: Semantic grid and knowledge grid: the next-generation web
Focus dependent multi-level graph clustering
Proceedings of the working conference on Advanced visual interfaces
Learning to find answers to questions on the Web
ACM Transactions on Internet Technology (TOIT)
On the temporal dimension of search
Proceedings of the 13th international World Wide Web conference on Alternate track papers & posters
TSSP: A Reinforcement Algorithm to Find Related Papers
WI '04 Proceedings of the 2004 IEEE/WIC/ACM International Conference on Web Intelligence
CiteSeer-API: towards seamless resource location and interlinking for digital libraries
Proceedings of the thirteenth ACM international conference on Information and knowledge management
A service-oriented architecture for digital libraries
Proceedings of the 2nd international conference on Service oriented computing
A new perspective to automatically rank scientific conferences using digital libraries
Information Processing and Management: an International Journal
Characteristics of scientific web publications: preliminary data gathering and analysis
Journal of the American Society for Information Science and Technology - Special issue: Webometrics
Transposition of the cocitation method with a view to classifying web pages
Journal of the American Society for Information Science and Technology - Special issue: Webometrics
CiteSeerχ: a scalable autonomous scientific digital library
InfoScale '06 Proceedings of the 1st international conference on Scalable information systems
Autonomous authoring tools for hypertext
ACM Computing Surveys (CSUR)
Using Social Networks to Organize Researcher Community
PAISI, PACCF and SOCO '08 Proceedings of the IEEE ISI 2008 PAISI, PACCF, and SOCO international workshops on Intelligence and Security Informatics
International Journal of Metadata, Semantics and Ontologies
How to find better index terms through citations
CLIIR '06 Proceedings of the Workshop on How Can Computational Linguistics Improve Information Retrieval?
A hybrid cache and prefetch mechanism for scientific literature search engines
ICWE'07 Proceedings of the 7th international conference on Web engineering
Using query context models to construct topical search engines
Proceedings of the third symposium on Information interaction in context
People searching for people: analysis of a people search engine log
Proceedings of the 34th international ACM SIGIR conference on Research and development in Information Retrieval
Variable-strength conditional preferences for ranking objects in ontologies
ESWC'06 Proceedings of the 3rd European conference on The Semantic Web: research and applications
XCDF: a canonical and structured document format
DAS'06 Proceedings of the 7th international conference on Document Analysis Systems
Best faces forward: a large-scale study of people search in the enterprise
Proceedings of the SIGCHI Conference on Human Factors in Computing Systems
A lightweight approach to semantic annotation of research papers
NLDB'07 Proceedings of the 12th international conference on Applications of Natural Language to Information Systems
Hi-index | 0.00 |
The web has greatly improved access to scientific literature. However, scientific articles on the web are largely disorganized, with research articles being spread across archive sites, institution sites, journal sites, and researcher homepages. No index covers all of the available literature, and the major web search engines typically do not index the content of Postscript/PDF documents at all. This paper discusses the creation of digital libraries of scientific literature on the web, including the efficient location of articles, full-text indexing of the articles, autonomous citation indexing, information extraction, display of query-sensitive summaries and citation context, hubs and authorities computation, similar document detection, user profiling, distributed error correction, graph analysis, and detection of overlapping documents. The software for the system is available at no cost for non-commercial use.