On the limited memory BFGS method for large scale optimization
Mathematical Programming: Series A and B
Fast discovery of association rules
Advances in knowledge discovery and data mining
Selective Sampling Using the Query by Committee Algorithm
Machine Learning
A scalable comparison-shopping agent for the World-Wide Web
AGENTS '97 Proceedings of the first international conference on Autonomous agents
A hierarchical approach to wrapper induction
Proceedings of the third annual conference on Autonomous Agents
Record-boundary discovery in Web documents
SIGMOD '99 Proceedings of the 1999 ACM SIGMOD international conference on Management of data
Generating finite-state transducers for semi-structured data extraction from the Web
Information Systems - Special issue on semistructured data
Learning to Parse Natural Language with Maximum Entropy Models
Machine Learning - Special issue on natural language learning
Learning Information Extraction Rules for Semi-Structured and Free Text
Machine Learning - Special issue on natural language learning
Foundations of statistical natural language processing
Foundations of statistical natural language processing
MetaCost: a general method for making classifiers cost-sensitive
KDD '99 Proceedings of the fifth ACM SIGKDD international conference on Knowledge discovery and data mining
Regression testing for wrapper maintenance
AAAI '99/IAAI '99 Proceedings of the sixteenth national conference on Artificial intelligence and the eleventh Innovative applications of artificial intelligence conference innovative applications of artificial intelligence
Relational learning of pattern-match rules for information extraction
AAAI '99/IAAI '99 Proceedings of the sixteenth national conference on Artificial intelligence and the eleventh Innovative applications of artificial intelligence conference innovative applications of artificial intelligence
Snowball: extracting relations from large plain-text collections
DL '00 Proceedings of the fifth ACM conference on Digital libraries
Automatic segmentation of text into structured records
SIGMOD '01 Proceedings of the 2001 ACM SIGMOD international conference on Management of data
Learning and making decisions when costs and probabilities are both unknown
Proceedings of the seventh ACM SIGKDD international conference on Knowledge discovery and data mining
Accelerated focused crawling through online relevance feedback
Proceedings of the 11th international conference on World Wide Web
A flexible learning system for wrapping tables and lists in HTML documents
Proceedings of the 11th international conference on World Wide Web
A machine learning based approach for table detection on the web
Proceedings of the 11th international conference on World Wide Web
Extracting query modifications from nonlinear SVMs
Proceedings of the 11th international conference on World Wide Web
Managing Gigabytes: Compressing and Indexing Documents and Images
Managing Gigabytes: Compressing and Indexing Documents and Images
Hierarchical Wrapper Induction for Semistructured Information Sources
Autonomous Agents and Multi-Agent Systems
Learning Logical Definitions from Relations
Machine Learning
Mining the Web: Discovering Knowledge from HyperText Data
Mining the Web: Discovering Knowledge from HyperText Data
FOCS '02 Proceedings of the 43rd Symposium on Foundations of Computer Science
Conditional Random Fields: Probabilistic Models for Segmenting and Labeling Sequence Data
ICML '01 Proceedings of the Eighteenth International Conference on Machine Learning
Active Learning for Natural Language Parsing and Information Extraction
ICML '99 Proceedings of the Sixteenth International Conference on Machine Learning
Maximum Entropy Markov Models for Information Extraction and Segmentation
ICML '00 Proceedings of the Seventeenth International Conference on Machine Learning
Information Extraction: Techniques and Challenges
SCIE '97 International Summer School on Information Extraction: A Multidisciplinary Approach to an Emerging Information Technology
Machine Learning for Sequential Data: A Review
Proceedings of the Joint IAPR International Workshop on Structural, Syntactic, and Statistical Pattern Recognition
Building Light-Weight Wrappers for Legacy Web Data-Sources Using W4F
VLDB '99 Proceedings of the 25th International Conference on Very Large Data Bases
Visual Web Information Extraction with Lixto
Proceedings of the 27th International Conference on Very Large Data Bases
Selective Sampling with Redundant Views
Proceedings of the Seventeenth National Conference on Artificial Intelligence and Twelfth Conference on Innovative Applications of Artificial Intelligence
Interactive deduplication using active learning
Proceedings of the eighth ACM SIGKDD international conference on Knowledge discovery and data mining
Information extraction for enhanced access to disease outbreak reports
Journal of Biomedical Informatics - Special issue: Sublanguage
TheaterLoc: Using Information Integration Technology to Rapidly Build Virtual Applications
ICDE '00 Proceedings of the 16th International Conference on Data Engineering
XWRAP: An XML-Enabled Wrapper Construction System for Web Information Sources
ICDE '00 Proceedings of the 16th International Conference on Data Engineering
Table extraction using conditional random fields
Proceedings of the 26th annual international ACM SIGIR conference on Research and development in informaion retrieval
Optimal aggregation algorithms for middleware
Journal of Computer and System Sciences - Special issu on PODS 2001
Robust and efficient fuzzy match for online data cleaning
Proceedings of the 2003 ACM SIGMOD international conference on Management of data
Extracting structured data from Web pages
Proceedings of the 2003 ACM SIGMOD international conference on Management of data
Evaluating probabilistic queries over imprecise data
Proceedings of the 2003 ACM SIGMOD international conference on Management of data
A Fast Regular Expression Indexing Engine
ICDE '02 Proceedings of the 18th International Conference on Data Engineering
Wrapper induction for information extraction
Wrapper induction for information extraction
Kernel methods for relation extraction
The Journal of Machine Learning Research
Nymble: a high-performance learning name-finder
ANLC '97 Proceedings of the fifth conference on Applied natural language processing
Web-scale information extraction in knowitall: (preliminary results)
Proceedings of the 13th international conference on World Wide Web
Automatic detection of fragments in dynamically generated web pages
Proceedings of the 13th international conference on World Wide Web
Automatic acquisition of hyponyms from large text corpora
COLING '92 Proceedings of the 14th conference on Computational linguistics - Volume 2
Message Understanding Conference-6: a brief history
COLING '96 Proceedings of the 16th conference on Computational linguistics - Volume 1
Mining reference tables for automatic text segmentation
Proceedings of the tenth ACM SIGKDD international conference on Knowledge discovery and data mining
Proceedings of the tenth ACM SIGKDD international conference on Knowledge discovery and data mining
A survey of table recognition: Models, observations, transformations, and inferences
International Journal on Document Analysis and Recognition
Natural Language Engineering
Finding parts in very large corpora
ACL '99 Proceedings of the 37th annual meeting of the Association for Computational Linguistics on Computational Linguistics
UAI '04 Proceedings of the 20th conference on Uncertainty in artificial intelligence
Aggregate operators in probabilistic databases
Journal of the ACM (JACM)
Opinion observer: analyzing and comparing opinions on the Web
WWW '05 Proceedings of the 14th international conference on World Wide Web
A search engine for natural language applications
WWW '05 Proceedings of the 14th international conference on World Wide Web
MYSTIQ: a system for finding more answers by using probabilities
Proceedings of the 2005 ACM SIGMOD international conference on Management of data
Personal information management with SEMEX
Proceedings of the 2005 ACM SIGMOD international conference on Management of data
Automatic wrapper maintenance for semi-structured web sources using results from previous queries
Proceedings of the 2005 ACM symposium on Applied computing
Survey of semantic annotation platforms
Proceedings of the 2005 ACM symposium on Applied computing
Automatic Fragment Detection in Dynamic Web Pages and Its Impact on Caching
IEEE Transactions on Knowledge and Data Engineering
Overview of the third message understanding evaluation and conference
MUC3 '91 Proceedings of the 3rd conference on Message understanding
Proceedings of the 28th annual international ACM SIGIR conference on Research and development in information retrieval
SPIN: searching personal information networks
Proceedings of the 28th annual international ACM SIGIR conference on Research and development in information retrieval
n-gram/2L: a space and time efficient two-level n-gram inverted index structure
VLDB '05 Proceedings of the 31st international conference on Very large data bases
OLAP over uncertain and imprecise data
VLDB '05 Proceedings of the 31st international conference on Very large data bases
Learning structured prediction models: a large margin approach
Learning structured prediction models: a large margin approach
Large Margin Methods for Structured and Interdependent Output Variables
The Journal of Machine Learning Research
Mining knowledge from text using information extraction
ACM SIGKDD Explorations Newsletter - Natural language processing and text mining
Using text mining and natural language processing for health care claims processing
ACM SIGKDD Explorations Newsletter - Natural language processing and text mining
Learning Non-Generative Grammatical Models for Document Analysis
ICCV '05 Proceedings of the Tenth IEEE International Conference on Computer Vision - Volume 2
Evaluating machine learning for information extraction
ICML '05 Proceedings of the 22nd international conference on Machine learning
Predicting good probabilities with supervised learning
ICML '05 Proceedings of the 22nd international conference on Machine learning
Extracting relations from large text collections
Extracting relations from large text collections
Information Extraction: Distilling Structured Data from Unstructured Text
Queue - Social Computing
Clustering with qualitative information
Journal of Computer and System Sciences - Special issue: Learning theory 2003
TEG—a hybrid approach to information extraction
Knowledge and Information Systems
Conditional structure versus conditional estimation in NLP models
EMNLP '02 Proceedings of the ACL-02 conference on Empirical methods in natural language processing - Volume 10
Information extraction from voicemail transcripts
EMNLP '02 Proceedings of the ACL-02 conference on Empirical methods in natural language processing - Volume 10
A comparison of algorithms for maximum entropy parameter estimation
COLING-02 proceedings of the 6th conference on Natural language learning - Volume 20
Markov models for language-independent named entity recognition
COLING-02 proceedings of the 6th conference on Natural language learning - Volume 20
Use of support vector machines in extended named entity recognition
COLING-02 proceedings of the 6th conference on Natural language learning - Volume 20
UMass/Hughes: description of the CIRCUS system used for Tipster text
TIPSTER '93 Proceedings of a workshop on held at Fredericksburg, Virginia: September 19-23, 1993
Introduction to the CoNLL-2003 shared task: language-independent named entity recognition
CONLL '03 Proceedings of the seventh conference on Natural language learning at HLT-NAACL 2003 - Volume 4
Exploring personal information
Communications of the ACM - Supporting exploratory search
Efficient Batch Top-k Search for Dictionary-based Entity Recognition
ICDE '06 Proceedings of the 22nd International Conference on Data Engineering
Integrating Unstructured Data into Relational Databases
ICDE '06 Proceedings of the 22nd International Conference on Data Engineering
Adaptive information extraction
ACM Computing Surveys (CSUR)
Optimizing scoring functions and indexes for proximity search in type-annotated corpora
Proceedings of the 15th international conference on World Wide Web
Adaptive Name Matching in Information Integration
IEEE Intelligent Systems
Documentum ECI self-repairing wrappers: performance analysis
Proceedings of the 2006 ACM SIGMOD international conference on Management of data
ICML '06 Proceedings of the 23rd international conference on Machine learning
Efficient inference on sequence segmentation models
ICML '06 Proceedings of the 23rd international conference on Machine learning
Accelerated training of conditional random fields with stochastic gradient methods
ICML '06 Proceedings of the 23rd international conference on Machine learning
Text mining for product attribute extraction
ACM SIGKDD Explorations Newsletter
Combining linguistic and statistical analysis to extract relations from web documents
Proceedings of the 12th ACM SIGKDD international conference on Knowledge discovery and data mining
Efficiently linking text documents with relevant structured information
VLDB '06 Proceedings of the 32nd international conference on Very large data bases
Creating probabilistic databases from information extraction models
VLDB '06 Proceedings of the 32nd international conference on Very large data bases
Bioinformatics
A divide-and-merge methodology for clustering
ACM Transactions on Database Systems (TODS)
Entity Resolution with Markov Logic
ICDM '06 Proceedings of the Sixth International Conference on Data Mining
A Framework for Learning Predictive Structures from Multiple Tasks and Unlabeled Data
The Journal of Machine Learning Research
Dependency tree kernels for relation extraction
ACL '04 Proceedings of the 42nd Annual Meeting on Association for Computational Linguistics
Collective information extraction with relational Markov networks
ACL '04 Proceedings of the 42nd Annual Meeting on Association for Computational Linguistics
Incorporating non-local information into information extraction systems by Gibbs sampling
ACL '05 Proceedings of the 43rd Annual Meeting on Association for Computational Linguistics
Extracting relations with integrated information using kernel methods
ACL '05 Proceedings of the 43rd Annual Meeting on Association for Computational Linguistics
Expressing implicit semantic relations without supervision
ACL-44 Proceedings of the 21st International Conference on Computational Linguistics and the 44th annual meeting of the Association for Computational Linguistics
A composite kernel to extract relations between entities with both flat and structured features
ACL-44 Proceedings of the 21st International Conference on Computational Linguistics and the 44th annual meeting of the Association for Computational Linguistics
An effective two-stage model for exploiting non-local dependencies in named entity recognition
ACL-44 Proceedings of the 21st International Conference on Computational Linguistics and the 44th annual meeting of the Association for Computational Linguistics
Extracting product features and opinions from reviews
HLT '05 Proceedings of the conference on Human Language Technology and Empirical Methods in Natural Language Processing
Identifying sources of opinions with conditional random fields and extraction patterns
HLT '05 Proceedings of the conference on Human Language Technology and Empirical Methods in Natural Language Processing
Extracting personal names from email: applying named entity recognition to informal text
HLT '05 Proceedings of the conference on Human Language Technology and Empirical Methods in Natural Language Processing
KnowItNow: fast, scalable information extraction from the web
HLT '05 Proceedings of the conference on Human Language Technology and Empirical Methods in Natural Language Processing
A shortest path dependency kernel for relation extraction
HLT '05 Proceedings of the conference on Human Language Technology and Empirical Methods in Natural Language Processing
Flexible text segmentation with structured multilabel classification
HLT '05 Proceedings of the conference on Human Language Technology and Empirical Methods in Natural Language Processing
Preemptive information extraction using unrestricted relation discovery
HLT-NAACL '06 Proceedings of the main conference on Human Language Technology Conference of the North American Chapter of the Association of Computational Linguistics
Transforming arbitrary tables into logical form with TARTAR
Data & Knowledge Engineering
The linguist's search engine: an overview
ACLdemo '05 Proceedings of the ACL 2005 on Interactive poster and demonstration sessions
Corrective feedback and persistent learning for information extraction
Artificial Intelligence
Communications of the ACM - ACM at sixty: a look back in time
Towards domain-independent information extraction from web tables
Proceedings of the 16th international conference on World Wide Web
Yago: a core of semantic knowledge
Proceedings of the 16th international conference on World Wide Web
Hierarchical, perceptron-like learning for ontology-based information extraction
Proceedings of the 16th international conference on World Wide Web
LIPTUS: associating structured and unstructured information in a banking environment
Proceedings of the 2007 ACM SIGMOD international conference on Management of data
Structured Prediction, Dual Extragradient and Bregman Projections
The Journal of Machine Learning Research
TableSeer: automatic table metadata extraction and searching in digital libraries
Proceedings of the 7th ACM/IEEE-CS joint conference on Digital libraries
Solving multiclass support vector machines with LaRank
Proceedings of the 24th international conference on Machine learning
Efficient inference with cardinality-based clique potentials
Proceedings of the 24th international conference on Machine learning
A semantic approach to contextual advertising
SIGIR '07 Proceedings of the 30th annual international ACM SIGIR conference on Research and development in information retrieval
Extracting relevant named entities for automated expense reimbursement
Proceedings of the 13th ACM SIGKDD international conference on Knowledge discovery and data mining
Eliminating fuzzy duplicates in data warehouses
VLDB '02 Proceedings of the 28th international conference on Very Large Data Bases
Towards a query optimizer for text-centric tasks
ACM Transactions on Database Systems (TODS)
Top-k query evaluation with probabilistic guarantees
VLDB '04 Proceedings of the Thirtieth international conference on Very large data bases - Volume 30
Efficient query evaluation on probabilistic databases
VLDB '04 Proceedings of the Thirtieth international conference on Very large data bases - Volume 30
An automatic data grabber for large web sites
VLDB '04 Proceedings of the Thirtieth international conference on Very large data bases - Volume 30
Autonomously semantifying wikipedia
Proceedings of the sixteenth ACM conference on Conference on information and knowledge management
OLAP over imprecise data with domain constraints
VLDB '07 Proceedings of the 33rd international conference on Very large data bases
Declarative information extraction using datalog with embedded extraction predicates
VLDB '07 Proceedings of the 33rd international conference on Very large data bases
Opinion Mining and Sentiment Analysis
Foundations and Trends in Information Retrieval
An Algebraic Approach to Rule-Based Information Extraction
ICDE '08 Proceedings of the 2008 IEEE 24th International Conference on Data Engineering
Efficient Information Extraction over Evolving Text Data
ICDE '08 Proceedings of the 2008 IEEE 24th International Conference on Data Engineering
RAD: A Scalable Framework for Annotator Development
ICDE '08 Proceedings of the 2008 IEEE 24th International Conference on Data Engineering
Domain adaptation with structural correspondence learning
EMNLP '06 Proceedings of the 2006 Conference on Empirical Methods in Natural Language Processing
Entity annotation based on inverse index operations
EMNLP '06 Proceedings of the 2006 Conference on Empirical Methods in Natural Language Processing
Learning field compatibilities to extract database records from unstructured text
EMNLP '06 Proceedings of the 2006 Conference on Empirical Methods in Natural Language Processing
Wrapper maintenance: a machine learning approach
Journal of Artificial Intelligence Research
Creating relational data from unstructured and ungrammatical data sources
Journal of Artificial Intelligence Research
Journal of Artificial Intelligence Research
Open information extraction from the web
IJCAI'07 Proceedings of the 20th international joint conference on Artifical intelligence
Locating complex named entities in web text
IJCAI'07 Proceedings of the 20th international joint conference on Artifical intelligence
Adaptive information extraction from text by rule induction and generalisation
IJCAI'01 Proceedings of the 17th international joint conference on Artificial intelligence - Volume 2
Learning to understand web site update requests
IJCAI'05 Proceedings of the 19th international joint conference on Artificial intelligence
A probabilistic model of redundancy in information extraction
IJCAI'05 Proceedings of the 19th international joint conference on Artificial intelligence
Semantic annotation of unstructured and ungrammatical text
IJCAI'05 Proceedings of the 19th international joint conference on Artificial intelligence
Comparative experiments on learning information extractors for proteins and their interactions
Artificial Intelligence in Medicine
Semantic annotation for knowledge management: Requirements and a survey of the state of the art
Web Semantics: Science, Services and Agents on the World Wide Web
Per-node optimization of finite-state mechanisms for natural language processing
CICLing'03 Proceedings of the 4th international conference on Computational linguistics and intelligent text processing
Using ILP to construct features for information extraction from semi-structured text
ILP'07 Proceedings of the 17th international conference on Inductive logic programming
Harvesting, searching, and ranking knowledge on the web: invited talk
Proceedings of the Second ACM International Conference on Web Search and Data Mining
Database and information-retrieval methods for knowledge discovery
Communications of the ACM - A Direct Path to Dependable Software
Domain adaptation of information extraction models
ACM SIGMOD Record
The YAGO-NAGA approach to knowledge discovery
ACM SIGMOD Record
SOFIE: a self-organizing framework for information extraction
Proceedings of the 18th international conference on World wide web
Collective annotation of Wikipedia entities in web text
Proceedings of the 15th ACM SIGKDD international conference on Knowledge discovery and data mining
Do we mean the same?: disambiguation of extracted keyword queries for database search
Proceedings of the First International Workshop on Keyword Search on Structured Data
Efficiently incorporating user feedback into information extraction and integration programs
Proceedings of the 2009 ACM SIGMOD International Conference on Management of data
Optimizing complex extraction programs over evolving text data
Proceedings of the 2009 ACM SIGMOD International Conference on Management of data
A translation model for matching reviews to objects
Proceedings of the 18th ACM conference on Information and knowledge management
Language-model-based ranking for queries on RDF-graphs
Proceedings of the 18th ACM conference on Information and knowledge management
Answering table augmentation queries from unstructured lists on the web
Proceedings of the VLDB Endowment
FOCIH: Form-Based Ontology Creation and Information Harvesting
ER '09 Proceedings of the 28th International Conference on Conceptual Modeling
Granular Computing for Text Mining: New Research Challenges and Opportunities
RSFDGrC '09 Proceedings of the 12th International Conference on Rough Sets, Fuzzy Sets, Data Mining and Granular Computing
Matching reviews to objects using a language model
EMNLP '09 Proceedings of the 2009 Conference on Empirical Methods in Natural Language Processing: Volume 2 - Volume 2
Leveraging web streams for contractual situational awareness in operational BI
Proceedings of the 2010 EDBT/ICDT Workshops
Graph-based concept identification and disambiguation for enterprise search
Proceedings of the 19th international conference on World wide web
From information to knowledge: harvesting entities and relationships from web sources
Proceedings of the twenty-ninth ACM SIGMOD-SIGACT-SIGART symposium on Principles of database systems
ONDUX: on-demand unsupervised learning for information extraction
Proceedings of the 2010 ACM SIGMOD International Conference on Management of data
SIE-OBI: a streaming information extraction platform for operational business intelligence
Proceedings of the 2010 ACM SIGMOD International Conference on Management of data
Unsupervised strategies for information extraction by text segmentation
Proceedings of the Fourth SIGMOD PhD Workshop on Innovative Database Research
Method combination for information extraction
Proceedings of the 11th International Conference on Computer Systems and Technologies and Workshop for PhD Students in Computing on International Conference on Computer Systems and Technologies
Find your advisor: robust knowledge gathering from the web
Procceedings of the 13th International Workshop on the Web and Databases
PROSPECT: a system for screening candidates for recruitment
CIKM '10 Proceedings of the 19th ACM international conference on Information and knowledge management
Extracting structured information from Wikipedia articles to populate infoboxes
CIKM '10 Proceedings of the 19th ACM international conference on Information and knowledge management
Digging for knowledge with information extraction: a case study on human gene-disease associations
CIKM '10 Proceedings of the 19th ACM international conference on Information and knowledge management
Entity-focused sentence simplification for relation extraction
COLING '10 Proceedings of the 23rd International Conference on Computational Linguistics
Towards approximate SQL: infobright's approach
RSCTC'10 Proceedings of the 7th international conference on Rough sets and current trends in computing
Portable extraction of partially structured facts from the web
IceTAL'10 Proceedings of the 7th international conference on Advances in natural language processing
Evaluating information extraction
CLEF'10 Proceedings of the 2010 international conference on Multilingual and multimodal information access evaluation: cross-language evaluation forum
Annotating and searching web tables using entities, types and relationships
Proceedings of the VLDB Endowment
KBB: a knowledge-bundle builder for research studies
ER'10 Proceedings of the 2010 international conference on Advances in conceptual modeling: applications and challenges
Joint training for open-domain extraction on the web: exploiting overlap when supervision is limited
Proceedings of the fourth ACM international conference on Web search and data mining
Effective decision support systems for workforce deployment
IBM Journal of Research and Development
Shallow information extraction from medical forum data
COLING '10 Proceedings of the 23rd International Conference on Computational Linguistics: Posters
SCAD: collective discovery of attribute values
Proceedings of the 20th international conference on World wide web
Using web-based knowledge extraction techniques to support cultural modeling
SBP'11 Proceedings of the 4th international conference on Social computing, behavioral-cultural modeling and prediction
Shopping for top forums: discovering online discussion for product research
Proceedings of the First Workshop on Social Media Analytics
Service-oriented information extraction
Proceedings of the 2011 Joint EDBT/ICDT Ph.D. Workshop
An overview of business intelligence technology
Communications of the ACM
Live business intelligence for the real-time enterprise
From active data management to event-based systems and more
Joint unsupervised structure discovery and information extraction
Proceedings of the 2011 ACM SIGMOD International Conference on Management of data
Entity set expansion in opinion documents
Proceedings of the 22nd ACM conference on Hypertext and hypermedia
A metadata geoparsing system for place name recognition and resolution in metadata records
Proceedings of the 11th annual international ACM/IEEE joint conference on Digital libraries
Query relaxation for entity-relationship search
ESWC'11 Proceedings of the 8th extended semantic web conference on The semanic web: research and applications - Volume Part II
Collective graph identification
Proceedings of the 17th ACM SIGKDD international conference on Knowledge discovery and data mining
Web information extraction using markov logic networks
Proceedings of the 17th ACM SIGKDD international conference on Knowledge discovery and data mining
Multidimensional database design from document-centric XML documents
DaWaK'11 Proceedings of the 13th international conference on Data warehousing and knowledge discovery
Enabling search for facts and implied facts in historical documents
Proceedings of the 2011 Workshop on Historical Document Imaging and Processing
Intelligent self-repairable web wrappers
AI*IA'11 Proceedings of the 12th international conference on Artificial intelligence around man and beyond
Keyword search over RDF graphs
Proceedings of the 20th ACM international conference on Information and knowledge management
Semi-supervised multi-task learning of structured prediction models for web information extraction
Proceedings of the 20th ACM international conference on Information and knowledge management
Enabling information extraction by inference of regular expressions from sample entities
Proceedings of the 20th ACM international conference on Information and knowledge management
Exploring the corporate ecosystem with a semi-supervised entity graph
Proceedings of the 20th ACM international conference on Information and knowledge management
OpinioNetIt: understanding the opinions-people network for politically controversial topics
Proceedings of the 20th ACM international conference on Information and knowledge management
Accurate information extraction for quantitative financial events
Proceedings of the 20th ACM international conference on Information and knowledge management
Multilingual ontologies for cross-language information extraction and semantic search
ER'11 Proceedings of the 30th international conference on Conceptual modeling
Identifying destinations automatically from human generated route directions
Proceedings of the 19th ACM SIGSPATIAL International Conference on Advances in Geographic Information Systems
Harmony and dissonance: organizing the people's voices on political controversies
Proceedings of the fifth ACM international conference on Web search and data mining
A platform for situational awareness in operational BI
Decision Support Systems
Information extraction, real-time processing and DW2.0 in operational business intelligence
DNIS'10 Proceedings of the 6th international conference on Databases in Networked Information Systems
IDA'10 Proceedings of the 9th international conference on Advances in Intelligent Data Analysis
Chapter 3: search for knowledge
Search Computing
Compressed data structures for annotated web search
Proceedings of the 21st international conference on World Wide Web
Extraction of procedural knowledge from the web: a comparison of two workflow extraction approaches
Proceedings of the 21st international conference companion on World Wide Web
Data mining for improving textbooks
ACM SIGKDD Explorations Newsletter
An analysis of the named entity recognition problem in digital library metadata
Proceedings of the 12th ACM/IEEE-CS joint conference on Digital Libraries
A schema-driven approach for knowledge-oriented retrieval and query formulation
KEYS '12 Proceedings of the Third International Workshop on Keyword Search on Structured Data
Self-supervised learning approach for extracting citation information on the web
APWeb'12 Proceedings of the 14th Asia-Pacific international conference on Web Technologies and Applications
Combining information extraction, deductive reasoning and machine learning for relation prediction
ESWC'12 Proceedings of the 9th international conference on The Semantic Web: research and applications
An approach for named entity recognition in poorly structured data
ESWC'12 Proceedings of the 9th international conference on The Semantic Web: research and applications
Making sense of location context
Proceedings of the 1st International Workshop on Context Discovery and Data Mining
Exploiting evidence from unstructured data to enhance master data management
Proceedings of the VLDB Endowment
A case for semantic full-text search
Proceedings of the 1st Joint International Workshop on Entity-Oriented and Semantic Search
Learning to "read between the lines" using Bayesian logic programs
ACL '12 Proceedings of the 50th Annual Meeting of the Association for Computational Linguistics: Long Papers - Volume 1
A simple approach to the design of site-level extractors using domain-centric principles
Proceedings of the 21st ACM international conference on Information and knowledge management
Entity centric query expansion for enterprise search
Proceedings of the 21st ACM international conference on Information and knowledge management
Metadata enrichment services for the europeana digital library
TPDL'12 Proceedings of the Second international conference on Theory and Practice of Digital Libraries
Learning to predict from textual data
Journal of Artificial Intelligence Research
A performance comparison of parallel DBMSs and MapReduce on large-scale text analytics
Proceedings of the 16th International Conference on Extending Database Technology
Towards query model integration: topology-aware, IR-inspired metrics for declarative graph querying
Proceedings of the Joint EDBT/ICDT 2013 Workshops
Knowledge harvesting in the big-data era
Proceedings of the 2013 ACM SIGMOD International Conference on Management of Data
Automated crime report analysis and classification for e-government and decision support
Proceedings of the 14th Annual International Conference on Digital Government Research
Proceedings of the 10th Working Conference on Mining Software Repositories
SEED: a framework for extracting social events from press news
Proceedings of the 22nd international conference on World Wide Web companion
Learning joint query interpretation and response ranking
Proceedings of the 22nd international conference on World Wide Web
Information extraction as a filtering task
Proceedings of the 22nd ACM international conference on Conference on information & knowledge management
Efficient parsing-based search over structured data
Proceedings of the 22nd ACM international conference on Conference on information & knowledge management
Reporting bias and knowledge acquisition
Proceedings of the 2013 workshop on Automated knowledge base construction
Proceedings of the 2013 workshop on Automated knowledge base construction
Locating Discharge Medications in Natural Language Summaries
Proceedings of the International Conference on Bioinformatics, Computational Biology and Biomedical Informatics
Extraction of financial information from online business reports
ACM SIGMIS Database
ADC '13 Proceedings of the Twenty-Fourth Australasian Database Conference - Volume 137
Entity extraction, linking, classification, and tagging for social media: a wikipedia-based approach
Proceedings of the VLDB Endowment
Identifying the Truth: Aggregation of Named Entity Extraction Results
Proceedings of International Conference on Information Integration and Web-based Applications & Services
Entity resolution for distributed probabilistic data
Distributed and Parallel Databases
CALA: An unsupervised URL-based web page classification system
Knowledge-Based Systems
Hi-index | 0.03 |
The automatic extraction of information from unstructured sources has opened up new avenues for querying, organizing, and analyzing data by drawing upon the clean semantics of structured databases and the abundance of unstructured data. The field of information extraction has its genesis in the natural language processing community where the primary impetus came from competitions centered around the recognition of named entities like people names and organization from news articles. As society became more data oriented with easy online access to both structured and unstructured data, new applications of structure extraction came around. Now, there is interest in converting our personal desktops to structured databases, the knowledge in scientific publications to structured records, and harnessing the Internet for structured fact finding queries. Consequently, there are many different communities of researchers bringing in techniques from machine learning, databases, information retrieval, and computational linguistics for various aspects of the information extraction problem. This review is a survey of information extraction research of over two decades from these diverse communities. We create a taxonomy of the field along various dimensions derived from the nature of the extraction task, the techniques used for extraction, the variety of input resources exploited, and the type of output produced. We elaborate on rule-based and statistical methods for entity and relationship extraction. In each case we highlight the different kinds of models for capturing the diversity of clues driving the recognition process and the algorithms for training and efficiently deploying the models. We survey techniques for optimizing the various steps in an information extraction pipeline, adapting to dynamic data, integrating with existing entities and handling uncertainty in the extraction process.