Learning Information Extraction Rules for Semi-Structured and Free Text

Authors:
Stephen Soderland
Affiliations:
Department Computer Science and Engineering, University of Washington, Seattle, WA 98195-2350. soderlan@cs.washington.edu
Venue:
Machine Learning - Special issue on natural language learning
Year:
1999

Citing 14
Cited 253

A theory of the learnable

Communications of the ACM
C4.5: programs for machine learning

C4.5: programs for machine learning
A sequential algorithm for training text classifiers

SIGIR '94 Proceedings of the 17th annual international ACM SIGIR conference on Research and development in information retrieval
Improving Generalization with Active Learning

Machine Learning - Special issue on structured connectionist systems
Wrapper generation for semi-structured Internet sources

ACM SIGMOD Record
Learning Logical Definitions from Relations

Machine Learning
Multistrategy Learning for Information Extraction

ICML '98 Proceedings of the Fifteenth International Conference on Machine Learning
Learning information extraction patterns from examples

Connectionist, Statistical, and Symbolic Approaches to Learning for Natural Language Processing
Synthesizing Objects

ECOOP '99 Proceedings of the 13th European Conference on Object-Oriented Programming
Learning Text Analysis Rules for Domain-specific Natural Language Processing

Learning Text Analysis Rules for Domain-specific Natural Language Processing
Description of the UMass system as used for MUC-6

MUC6 '95 Proceedings of the 6th conference on Message understanding
SRA: description of the SRA system as used for MUC-6

MUC6 '95 Proceedings of the 6th conference on Message understanding
CRYSTAL inducing a conceptual dictionary

IJCAI'95 Proceedings of the 14th international joint conference on Artificial intelligence - Volume 2
Learning trees and rules with set-valued features

AAAI'96 Proceedings of the thirteenth national conference on Artificial intelligence - Volume 1

Learning to remove Internet advertisements

Proceedings of the third annual conference on Autonomous Agents
Relational learning of pattern-match rules for information extraction

AAAI '99/IAAI '99 Proceedings of the sixteenth national conference on Artificial intelligence and the eleventh Innovative applications of artificial intelligence conference innovative applications of artificial intelligence
Two dimensional generalization in information extraction

AAAI '99/IAAI '99 Proceedings of the sixteenth national conference on Artificial intelligence and the eleventh Innovative applications of artificial intelligence conference innovative applications of artificial intelligence
Learning dictionaries for information extraction by multi-level bootstrapping

AAAI '99/IAAI '99 Proceedings of the sixteenth national conference on Artificial intelligence and the eleventh Innovative applications of artificial intelligence conference innovative applications of artificial intelligence
Active learning for hierarchical wrapper induction

AAAI '99/IAAI '99 Proceedings of the sixteenth national conference on Artificial intelligence and the eleventh Innovative applications of artificial intelligence conference innovative applications of artificial intelligence
NaturalJava: a natural language interface for programming in Java

Proceedings of the 5th international conference on Intelligent user interfaces
A framework for specifying explicit bias for revision of approximate information extraction rules

Proceedings of the sixth ACM SIGKDD international conference on Knowledge discovery and data mining
Learning to extract hierarchical information from semi-structured documents

Proceedings of the ninth international conference on Information and knowledge management
Web mining research: a survey

ACM SIGKDD Explorations Newsletter
Learning for semantic interpretation: scaling up without dumbing down

Learning language in logic
Learning for text categorization and information extraction with ILP

Learning language in logic
Improving learning by choosing examples intelligently in two natural language tasks

Learning language in logic
Automatic segmentation of text into structured records

SIGMOD '01 Proceedings of the 2001 ACM SIGMOD international conference on Management of data
An approach to automatic classification of text for information retrieval

Proceedings of the 2nd ACM/IEEE-CS joint conference on Digital libraries
A brief survey of web data extraction tools

ACM SIGMOD Record
DEByE - Date extraction by example

Data & Knowledge Engineering
Hierarchical Wrapper Induction for Semistructured Information Sources

Autonomous Agents and Multi-Agent Systems
Human Language Technologies for Knowledge Management

IEEE Intelligent Systems
Mining Information for Functional Genomics

IEEE Intelligent Systems
Gleaning the Web

IEEE Intelligent Systems
First steps in building a model for the retrieval of court decisions

International Journal of Human-Computer Studies
Automatic information extraction from semi-structured Web pages by pattern discovery

Decision Support Systems - Web retrieval and mining
Semi-automatic Content Extraction from Specifications

NLDB '02 Proceedings of the 6th International Conference on Applications of Natural Language to Information Systems-Revised Papers
Extracting Information from Semi-structured Web Documents

OOIS '02 Proceedings of the Workshops on Advances in Object-Oriented Information Systems
Sentence Filtering for Information Extraction in Genomics, a Classification Problem

PKDD '01 Proceedings of the 5th European Conference on Principles of Data Mining and Knowledge Discovery
Information Extraction in Structured Documents Using Tree Automata Induction

PKDD '02 Proceedings of the 6th European Conference on Principles of Data Mining and Knowledge Discovery
RoadRunner: Towards Automatic Data Extraction from Large Web Sites

Proceedings of the 27th International Conference on Very Large Data Bases
A Practical Agent-Based Method to Extract Semantic Information from the Web

CAiSE '02 Proceedings of the 14th International Conference on Advanced Information Systems Engineering
DubLet: An Online CBR System for Rental Property Recommendation

ICCBR '01 Proceedings of the 4th International Conference on Case-Based Reasoning: Case-Based Reasoning Research and Development
A Knowledge-Based Information Extraction System for Semi-structured Labeled Documents

IDEAL '02 Proceedings of the Third International Conference on Intelligent Data Engineering and Automated Learning
Selecting a Relevant Set of Examples to Learn IE-Rules

Proceedings of the 14th International conference on Industrial and engineering applications of artificial intelligence and expert systems: engineering of intelligent systems
Where to Position the Precision in Knowledge Extraction from Text

Proceedings of the 14th International conference on Industrial and engineering applications of artificial intelligence and expert systems: engineering of intelligent systems
Information Extraction from HTML: Combining XML and Standard Techniques for IE from the Web

Proceedings of the 14th International conference on Industrial and engineering applications of artificial intelligence and expert systems: engineering of intelligent systems
Wrapper Generation by Using XML-Based Domain Knowledge for Intelligent Information Extraction

PRICAI '02 Proceedings of the 7th Pacific Rim International Conference on Artificial Intelligence: Trends in Artificial Intelligence
Automatic Extraction of Semantically-Meaningful Information from the Web.

AH '02 Proceedings of the Second International Conference on Adaptive Hypermedia and Adaptive Web-Based Systems
Extending Elementary Formal Systems

ALT '01 Proceedings of the 12th International Conference on Algorithmic Learning Theory
Event Pattern Discovery from the Stock Market Bulletin

DS '02 Proceedings of the 5th International Conference on Discovery Science
A Unifying Approach to HTML Wrapper Representation and Learning

DS '00 Proceedings of the Third International Conference on Discovery Science
A Term-Based Methodology for Template Creation in Information Extraction

NLP '00 Proceedings of the Second International Conference on Natural Language Processing
Information Extraction - Tree Alignment Approach to Pattern Discovery in Web Documents

DEXA '02 Proceedings of the 13th International Conference on Database and Expert Systems Applications
Mediation in a dynamic context: arguing for a request-oriented approach and structuring it

Web-enabled systems integration
Advanced elementary formal systems

Theoretical Computer Science - Selected papers in honour of Setsuo Arikawa
Complex relationships and knowledge discovery support in the InfoQuilt system

The VLDB Journal — The International Journal on Very Large Data Bases
A maximum entropy approach to information extraction from semi-structured and free text

Eighteenth national conference on Artificial intelligence
A System for Building Intelligent Agents that Learn to Retrieve and Extract Information

User Modeling and User-Adapted Interaction
Unsupervised learning of mDTD extraction patterns for web text mining

Information Processing and Management: an International Journal
Intelligent Web agents that learn to retrieve and extract information

Intelligent exploration of the web
Accurately and reliably extracting data from the Web: a machine learning approach

Intelligent exploration of the web
Learning rules and their exceptions

The Journal of Machine Learning Research
Bottom-up relational learning of pattern matching rules for information extraction

The Journal of Machine Learning Research
Ontology extraction and conceptual modeling for web information

Information modeling for internet applications
On Precision and Recall of Multi-Attribute Data Extraction from Semistructured Sources

ICDM '03 Proceedings of the Third IEEE International Conference on Data Mining
Datarover: a taxonomy based crawler for automated data extraction from data-intensive websites

WIDM '03 Proceedings of the 5th ACM international workshop on Web information and data management
Learning rules for information extraction

Natural Language Engineering
Event detection from online news documents for supporting environmental scanning

Decision Support Systems - Special issue: Knowledge management technique
Unsupervised learning of soft patterns for generating definitions from online news

Proceedings of the 13th international conference on World Wide Web
Web-scale information extraction in knowitall: (preliminary results)

Proceedings of the 13th international conference on World Wide Web
LearningPinocchio: adaptive information extraction for real world applications

Natural Language Engineering
Information Extraction from the Web: System and Techniques

Applied Intelligence
Automatic information extraction from large websites

Journal of the ACM (JACM)
Efficient Phrase-Based Document Indexing for Web Document Clustering

IEEE Transactions on Knowledge and Data Engineering
Ontology-Based Scalable and Portable Information Extraction System to Extract Biological Knowledge from Huge Collection of Biomedical Web Documents

WI '04 Proceedings of the 2004 IEEE/WIC/ACM International Conference on Web Intelligence
Web-Based Knowledge Acquisition to Impute Missing Values for Classification

WI '04 Proceedings of the 2004 IEEE/WIC/ACM International Conference on Web Intelligence
Learning by googling

ACM SIGKDD Explorations Newsletter
Information extraction with automatic knowledge expansion

Information Processing and Management: an International Journal
Gimme' the context: context-driven automatic semantic annotation with C-PANKOW

WWW '05 Proceedings of the 14th international conference on World Wide Web
Semantic case role detection for information extraction

COLING '02 Proceedings of the 19th international conference on Computational linguistics - Volume 2
Closing the gap: learning-based information extraction rivaling knowledge-engineering methods

ACL '03 Proceedings of the 41st Annual Meeting on Association for Computational Linguistics - Volume 1
Automating the extraction of data from HTML tables with unknown structure

Data & Knowledge Engineering - Special issue: ER 2002
Mining knowledge from text using information extraction

ACM SIGKDD Explorations Newsletter - Natural language processing and text mining
Unsupervised named-entity extraction from the web: an experimental study

Artificial Intelligence
Mining information extraction rules from datasheets without linguistic parsing

IEA/AIE'2005 Proceedings of the 18th international conference on Innovations in Applied Artificial Intelligence
Learning IE rules for a set of related concepts

ConLL '00 Proceedings of the 2nd workshop on Learning language in logic and the 4th conference on Computational natural language learning - Volume 7
Using HLT for acquiring, retrieving and publishing knowledge in AKT: position paper

HLTKM '01 Proceedings of the workshop on Human Language Technology and Knowledge Management - Volume 2001
Learning extraction patterns for subjective expressions

EMNLP '03 Proceedings of the 2003 conference on Empirical methods in natural language processing
Rule identification from web pages by the XRML approach

Decision Support Systems
Two-phase learning for biological event extraction and verification

ACM Transactions on Asian Language Information Processing (TALIP)
Adaptive information extraction

ACM Computing Surveys (CSUR)
Approaches to text mining for clinical medical records

Proceedings of the 2006 ACM symposium on Applied computing
Combining linguistic and statistical analysis to extract relations from web documents

Proceedings of the 12th ACM SIGKDD international conference on Knowledge discovery and data mining
Information extraction from structured documents using k-testable tree automaton inference

Data & Knowledge Engineering
A two-phase rule generation and optimization approach for wrapper generation

ADC '06 Proceedings of the 17th Australasian Database Conference - Volume 49
A Survey of Web Information Extraction Systems

IEEE Transactions on Knowledge and Data Engineering
Adapting Web information extraction knowledge via mining site-invariant and site-dependent features

ACM Transactions on Internet Technology (TOIT)
Combining Information Extraction Systems Using Voting and Stacked Generalization

The Journal of Machine Learning Research
Hierarchical rule generalisation for speaker identification in fiction books

SAICSIT '06 Proceedings of the 2006 annual research conference of the South African institute of computer scientists and information technologists on IT research in developing countries
Web wrapper induction: a brief survey

AI Communications
A semantic approach to IE pattern induction

ACL '05 Proceedings of the 43rd Annual Meeting on Association for Computational Linguistics
Cascading use of soft and hard matching pattern rules for weakly supervised information extraction

COLING '04 Proceedings of the 20th international conference on Computational Linguistics
Information extraction from single and multiple sentences

COLING '04 Proceedings of the 20th international conference on Computational Linguistics
Composition of conditional random fields for transfer learning

HLT '05 Proceedings of the conference on Human Language Technology and Empirical Methods in Natural Language Processing
How can information extraction ease formalizing treatment processes in clinical practice guidelines?

Artificial Intelligence in Medicine
MBOI: discovery of business opportunities on the internet

HLT-Demo '05 Proceedings of HLT/EMNLP on Interactive Demonstrations
POSBIOTM/W: a development workbench for machine learning oriented biomedical text mining system

HLT-Demo '05 Proceedings of HLT/EMNLP on Interactive Demonstrations
Automatising the learning of lexical patterns: An application to the enrichment of WordNet by extracting semantic relationships from Wikipedia

Data & Knowledge Engineering
SERGEANT: A framework for building more flexible web agents by exploiting a search engine

Web Intelligence and Agent Systems
Dynamic Conditional Random Fields: Factorized Probabilistic Models for Labeling and Segmenting Sequence Data

The Journal of Machine Learning Research
Data Mining and Predictive Modeling of Biomolecular Network from Biomedical Literature Databases

IEEE/ACM Transactions on Computational Biology and Bioinformatics (TCBB)
FLUX-CIM: flexible unsupervised extraction of citation metadata

Proceedings of the 7th ACM/IEEE-CS joint conference on Digital libraries
A rote extractor with edit distance-based generalisation and multi-corpora precision calculation

COLING-ACL '06 Proceedings of the COLING/ACL on Main conference poster sessions
ARE: instance splitting strategies for dependency relation-based information extraction

COLING-ACL '06 Proceedings of the COLING/ACL on Main conference poster sessions
URES: an unsupervised web relation extraction system

COLING-ACL '06 Proceedings of the COLING/ACL on Main conference poster sessions
Webpage understanding: an integrated approach

Proceedings of the 13th ACM SIGKDD international conference on Knowledge discovery and data mining
Context-aware wrapping: synchronized data extraction

VLDB '07 Proceedings of the 33rd international conference on Very large data bases
From dirt to shovels: fully automatic tool generation from ad hoc data

Proceedings of the 35th annual ACM SIGPLAN-SIGACT symposium on Principles of programming languages
Mining and analysing scale-free protein protein interaction network

International Journal of Bioinformatics Research and Applications
Automated data extraction from the web with conditional models

International Journal of Business Intelligence and Data Mining
A wrapper generation system for PDF documents

Proceedings of the 2008 ACM symposium on Applied computing
Heuristic learning of rules for information extraction from web documents

Proceedings of the 2nd international conference on Scalable information systems
A modular information extraction system

Intelligent Data Analysis
Learning (k,l)-contextual tree languages for information extraction from web pages

Machine Learning
Automatic extraction of morphological information from botanical collections

Proceedings of the 8th ACM/IEEE-CS joint conference on Digital libraries
Boosting text segmentation via progressive classification

Knowledge and Information Systems
Enhancing keyword-based botanical information retrieval with information extraction

Proceedings of the 31st annual international ACM SIGIR conference on Research and development in information retrieval
Negation recognition in medical narrative reports

Information Retrieval
Open information extraction from the web

Communications of the ACM - Surviving the data deluge
Architecture and performance of the rule based comparison shopping: delivery cost experience

Proceedings of the 10th international conference on Electronic commerce
Using the Web to Reduce Data Sparseness in Pattern-Based Information Extraction

PKDD 2007 Proceedings of the 11th European conference on Principles and Practice of Knowledge Discovery in Databases
Bootstrapping Information Extraction from Semi-structured Web Pages

ECML PKDD '08 Proceedings of the 2008 European Conference on Machine Learning and Knowledge Discovery in Databases - Part I
WRAPPER INFERENCE FOR AMBIGUOUS WEB PAGES

Applied Artificial Intelligence
Automated Semantic Analysis of Schematic Data

World Wide Web
Self-supervised relation extraction from the Web

Knowledge and Information Systems
Extracting Semantic Frames from Thai Medical-Symptom Phrases with Unknown Boundaries

ASWC '08 Proceedings of the 3rd Asian Semantic Web Conference on The Semantic Web
Information Extraction

Foundations and Trends in Databases
Ad Hoc Data and the Token Ambiguity Problem

PADL '09 Proceedings of the 11th International Symposium on Practical Aspects of Declarative Languages
Data integration flows for business intelligence

Proceedings of the 12th International Conference on Extending Database Technology: Advances in Database Technology
Adapting svm for data sparseness and imbalance: A case study in information extraction

Natural Language Engineering
Information Extraction from Thai Text with Unknown Phrase Boundaries

PAKDD '09 Proceedings of the 13th Pacific-Asia Conference on Advances in Knowledge Discovery and Data Mining
Information Extraction System Based on Hidden Markov Model

ISNN '09 Proceedings of the 6th International Symposium on Neural Networks on Advances in Neural Networks
Sub Node Extraction with Tree Based Wrappers

Proceedings of the 2008 conference on ECAI 2008: 18th European Conference on Artificial Intelligence
Methods for domain-independent information extraction from the web: an experimental comparison

AAAI'04 Proceedings of the 19th national conference on Artifical intelligence
A task-based comparison of information extraction pattern models

DeepLP '07 Proceedings of the Workshop on Deep Linguistic Processing
Boosting unsupervised relation extraction by using NER

EMNLP '06 Proceedings of the 2006 Conference on Empirical Methods in Natural Language Processing
Regular expression learning for information extraction

EMNLP '08 Proceedings of the Conference on Empirical Methods in Natural Language Processing
A high accuracy method for semi-supervised information extraction

NAACL-Short '07 Human Language Technologies 2007: The Conference of the North American Chapter of the Association for Computational Linguistics; Companion Volume, Short Papers
Exploiting subjectivity classification to improve information extraction

AAAI'05 Proceedings of the 20th national conference on Artificial intelligence - Volume 3
A local tree alignment-based soft pattern matching approach for information extraction

NAACL-Short '09 Proceedings of Human Language Technologies: The 2009 Annual Conference of the North American Chapter of the Association for Computational Linguistics, Companion Volume: Short Papers
Acquiring word-meaning mappings for natural language interfaces

Journal of Artificial Intelligence Research
Wrapper maintenance: a machine learning approach

Journal of Artificial Intelligence Research
Active learning with multiple views

Journal of Artificial Intelligence Research
Creating relational data from unstructured and ungrammatical data sources

Journal of Artificial Intelligence Research
Information extraction from web documents based on local unranked tree automaton inference

IJCAI'03 Proceedings of the 18th international joint conference on Artificial intelligence
Bayesian information extraction network

IJCAI'03 Proceedings of the 18th international joint conference on Artificial intelligence
Comparing information extraction pattern models

IEBeyondDoc '06 Proceedings of the Workshop on Information Extraction Beyond The Document
Improving semi-supervised acquisition of relation extraction patterns

IEBeyondDoc '06 Proceedings of the Workshop on Information Extraction Beyond The Document
Learning domain-specific information extraction patterns from the Web

IEBeyondDoc '06 Proceedings of the Workshop on Information Extraction Beyond The Document
Adaptive information extraction from text by rule induction and generalisation

IJCAI'01 Proceedings of the 17th international joint conference on Artificial intelligence - Volume 2
Relational learning via propositional algorithms: an information extraction case study

IJCAI'01 Proceedings of the 17th international joint conference on Artificial intelligence - Volume 2
Semantic annotation of unstructured and ungrammatical text

IJCAI'05 Proceedings of the 19th international joint conference on Artificial intelligence
Unsupervised named-entity extraction from the Web: An experimental study

Artificial Intelligence
ExSearch: a novel vertical search engine for online barter business

Proceedings of the 18th ACM conference on Information and knowledge management
Rule identification from Web pages by the XRML approach

Decision Support Systems
Evaluation and optimization of the catalog search process of e-procurement platforms

Electronic Commerce Research and Applications
Using uneven margins SVM and perceptron for information extraction

CONLL '05 Proceedings of the Ninth Conference on Computational Natural Language Learning
An information extraction approach to reorganizing and summarizing specifications

Information and Software Technology
Visual extraction of information from web pages

Journal of Visual Languages and Computing
Semantic annotation of biosystematics literature without training examples

Journal of the American Society for Information Science and Technology
Neural based approach to keyword extraction from documents

ICCSA'03 Proceedings of the 2003 international conference on Computational science and its applications: PartI
A semi-supervised algorithm for pattern discovery in information extraction from textual data

PAKDD'03 Proceedings of the 7th Pacific-Asia conference on Advances in knowledge discovery and data mining
Learning rules to extract protein interactions from biomedical text

PAKDD'03 Proceedings of the 7th Pacific-Asia conference on Advances in knowledge discovery and data mining
An integrated system of mining HTML texts and filtering structured documents

PAKDD'03 Proceedings of the 7th Pacific-Asia conference on Advances in knowledge discovery and data mining
Combining relations for information extraction from free text

ACM Transactions on Information Systems (TOIS)
Fuzzy pattern rule induction for information extraction

ISICA'07 Proceedings of the 2nd international conference on Advances in computation and intelligence
Automatic feeding of an innovation knowledge base using a semantic representation of field knowledge

OTM'07 Proceedings of the 2007 OTM Confederated international conference on On the move to meaningful internet systems: CoopIS, DOA, ODBASE, GADA, and IS - Volume Part I
A method for web information extraction

APWeb'08 Proceedings of the 10th Asia-Pacific web conference on Progress in WWW research and development
Pattern-based extraction of addresses from web page content

APWeb'08 Proceedings of the 10th Asia-Pacific web conference on Progress in WWW research and development
Using support vector machines for terrorism information extraction

ISI'03 Proceedings of the 1st NSF/NIJ conference on Intelligence and security informatics
A context-free markup language for semi-structured text

PLDI '10 Proceedings of the 2010 ACM SIGPLAN conference on Programming language design and implementation
Analysis of a probabilistic model of redundancy in unsupervised information extraction

Artificial Intelligence
Adaptive information extraction: core technologies for information agents

Intelligent information agents
Scientific literature metadata extraction based on HMM

CDVE'09 Proceedings of the 6th international conference on Cooperative design, visualization, and engineering
Method combination for information extraction

Proceedings of the 11th International Conference on Computer Systems and Technologies and Workshop for PhD Students in Computing on International Conference on Computer Systems and Technologies
An agent-based system framework for multi-slot web information extraction

CAR'10 Proceedings of the 2nd international Asia conference on Informatics in control, automation and robotics - Volume 3
Name entity recognition using inductive logic programming

Proceedings of the 2010 Symposium on Information and Communication Technology
Survey of data management and analysis in disaster situations

Journal of Systems and Software
Comparable entity mining from comparative questions

ACL '10 Proceedings of the 48th Annual Meeting of the Association for Computational Linguistics
Semantic role labeling for open information extraction

FAM-LbR '10 Proceedings of the NAACL HLT 2010 First International Workshop on Formalisms and Methodology for Learning by Reading
Clustering based approach to learning regular expressions over large alphabet for noisy unstructured text

AND '10 Proceedings of the fourth workshop on Analytics for noisy unstructured text data
A survey of paraphrasing and textual entailment methods

Journal of Artificial Intelligence Research
Incorporating linguistic expertise using ILP for named entity recognition in data hungry Indian languages

ILP'09 Proceedings of the 19th international conference on Inductive logic programming
Extracting chemical reactions from Thai text for semantics-based information retrieval

ACIIDS'10 Proceedings of the Second international conference on Intelligent information and database systems: Part I
An ontology-driven rote extractor for pattern disambiguation

Proceedings of the 48th Annual Southeast Regional Conference
Wrangler: interactive visual specification of data transformation scripts

Proceedings of the SIGCHI Conference on Human Factors in Computing Systems
Joint unsupervised structure discovery and information extraction

Proceedings of the 2011 ACM SIGMOD International Conference on Management of data
Classification based automatic information extraction system from free text

ACELAE'11 Proceedings of the 10th WSEAS international conference on communications, electrical & computer engineering, and 9th WSEAS international conference on Applied electromagnetics, wireless and optical communications
A local tree alignment approach to relation extraction of multiple arguments

Information Processing and Management: an International Journal
An analysis of open information extraction based on semantic role labeling

Proceedings of the sixth international conference on Knowledge capture
Insights from network structure for text mining

HLT '11 Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies - Volume 1
A research of the internet based on web information extraction and data fusion

ICWL'10 Proceedings of the 2010 international conference on New horizons in web-based learning
From one tree to a forest: a unified solution for structured web data extraction

Proceedings of the 34th international ACM SIGIR conference on Research and development in Information Retrieval
An improved KNN algorithm for vertical search engines

WISM'11 Proceedings of the 2011 international conference on Web information systems and mining - Volume Part II
Mining popular menu items of a restaurant from web reviews

WISM'11 Proceedings of the 2011 international conference on Web information systems and mining - Volume Part II
Enabling information extraction by inference of regular expressions from sample entities

Proceedings of the 20th ACM international conference on Information and knowledge management
A text-based decision support system for financial sequence prediction

Decision Support Systems
A robust web personal name information extraction system

Expert Systems with Applications: An International Journal
A simhash-based scheme for locating product information from the web

Proceedings of the Second Symposium on Information and Communication Technology
Rule-based personalized comparison shopping including delivery cost

Electronic Commerce Research and Applications
Metadata extraction from bibliographies using bigram HMM

ICADL'04 Proceedings of the 7th international Conference on Digital Libraries: international collaboration and cross-fertilization
Document mining based on semantic understanding of text

CIARP'06 Proceedings of the 11th Iberoamerican conference on Progress in Pattern Recognition, Image Analysis and Applications
Self-supervised relation extraction from the web

ISMIS'06 Proceedings of the 16th international conference on Foundations of Intelligent Systems
Relation-Based document retrieval for biomedical literature databases

DASFAA'06 Proceedings of the 11th international conference on Database Systems for Advanced Applications
ESpotter: adaptive named entity recognition for web browsing

WM'05 Proceedings of the Third Biennial conference on Professional Knowledge Management
Using a more powerful teacher to reduce the number of queries of the l* algorithm in practical applications

EPIA'05 Proceedings of the 12th Portuguese conference on Progress in Artificial Intelligence
The adaptability of english based web search algorithms to chinese search engines

APWeb'06 Proceedings of the 8th Asia-Pacific Web conference on Frontiers of WWW Research and Development
Mechanisms of knowledge evolution for web information extraction

Proceedings of the 2005 international conference on Federation over the Web
Automatic keyphrases extraction from document using neural network

ICMLC'05 Proceedings of the 4th international conference on Advances in Machine Learning and Cybernetics
Information extraction for user's utterance processing on ubiquitous robot companion

NLDB'05 Proceedings of the 10th international conference on Natural Language Processing and Information Systems
Information extraction from email announcements

NLDB'05 Proceedings of the 10th international conference on Natural Language Processing and Information Systems
Learning (k,l)-contextual tree languages for information extraction

ECML'05 Proceedings of the 16th European conference on Machine Learning
A machine learning approach to information extraction

CICLing'05 Proceedings of the 6th international conference on Computational Linguistics and Intelligent Text Processing
Wrapping PDF documents exploiting uncertain knowledge

CAiSE'06 Proceedings of the 18th international conference on Advanced Information Systems Engineering
iASA: learning to annotate the semantic web

Journal on Data Semantics IV
An overview and classification of adaptive approaches to information extraction

Journal on Data Semantics IV
SVM based learning system for information extraction

Proceedings of the First international conference on Deterministic and Statistical Methods in Machine Learning
Identifying relations for open information extraction

EMNLP '11 Proceedings of the Conference on Empirical Methods in Natural Language Processing
Automatic relation extraction with model order selection and discriminative label identification

IJCNLP'05 Proceedings of the Second international joint conference on Natural Language Processing
Gaining process information from clinical practice guidelines using information extraction

AIME'05 Proceedings of the 10th conference on Artificial Intelligence in Medicine
Automatic generation of data types for classification of deep web sources

DILS'05 Proceedings of the Second international conference on Data Integration in the Life Sciences
Ontology creation: extraction of domain knowledge from web documents

ER'05 Proceedings of the 24th international conference on Conceptual Modeling
An interface agent for wrapper-based information extraction

PRIMA'04 Proceedings of the 7th Pacific Rim international conference on Intelligent Agents and Multi-Agent Systems
Information extraction, real-time processing and DW2.0 in operational business intelligence

DNIS'10 Proceedings of the 6th international conference on Databases in Networked Information Systems
Document interrogation: architecture, information extraction and approximate answers

EDBT'06 Proceedings of the 2006 international conference on Current Trends in Database Technology
Relation-Based document retrieval for biomedical IR

Transactions on Computational Systems Biology V
Chapter 6: web data extraction for service creation

Search Computing
Extracting structured subject information from digital document archives

ICADL'06 Proceedings of the 9th international conference on Asian Digital Libraries: achievements, Challenges and Opportunities
Towards ontology enrichment with treatment relations extracted from medical abstracts

ICADL'06 Proceedings of the 9th international conference on Asian Digital Libraries: achievements, Challenges and Opportunities
Using concept-based indexing to improve language modeling approach to genomic IR

ECIR'06 Proceedings of the 28th European conference on Advances in Information Retrieval
The HiLeX system for semantic information extraction

Transactions on Large-Scale Data- and Knowledge-Centered Systems V
REV: extracting entity relations from world wide web

Proceedings of the 6th International Conference on Ubiquitous Information Management and Communication
CharaParser for fine-grained semantic annotation of organism morphological descriptions

Journal of the American Society for Information Science and Technology
Learning twig and path queries

Proceedings of the 15th International Conference on Database Theory
Open information extraction: the second generation

IJCAI'11 Proceedings of the Twenty-Second international joint conference on Artificial Intelligence - Volume Volume One
Learning to adapt cross language information extraction wrapper

Applied Intelligence
Research directions in data wrangling: visuatizations and transformations for usable and credible data

Information Visualization - Special issue on State of the Field and New Research Directions
Using information extraction to generate trigger questions for academic writing support

ITS'12 Proceedings of the 11th international conference on Intelligent Tutoring Systems
User-driven relational models for entity-relation search and extraction

Proceedings of the 1st Joint International Workshop on Entity-Oriented and Semantic Search
A lexico-semantic pattern language for learning ontology instances from text

Web Semantics: Science, Services and Agents on the World Wide Web
WizIE: a best practices guided development environment for information extraction

ACL '12 Proceedings of the ACL 2012 System Demonstrations
Towards efficient named-entity rule induction for customizability

EMNLP-CoNLL '12 Proceedings of the 2012 Joint Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning
TEX: An efficient and effective unsupervised Web information extractor

Knowledge-Based Systems
The Medical Semantic Web: Opportunities and Issues

International Journal of Information Technology and Web Engineering
Learning to predict from textual data

Journal of Artificial Intelligence Research
A general theory of spatial relations to support a graphical tool for visual information extraction

Journal of Visual Languages and Computing
Enhancing search: events and their discourse context

CICLing'13 Proceedings of the 14th international conference on Computational Linguistics and Intelligent Text Processing - Volume 2
I can do text analytics!: designing development tools for novice developers

Proceedings of the SIGCHI Conference on Human Factors in Computing Systems
Unsupervised wrapper induction using linked data

Proceedings of the seventh international conference on Knowledge capture
SEED: a framework for extracting social events from press news

Proceedings of the 22nd international conference on World Wide Web companion
Web news extraction via path ratios

Proceedings of the 22nd ACM international conference on Conference on information & knowledge management
Unsupervised discovery and extraction of semi-structured regions in text via self-information

Proceedings of the 2013 workshop on Automated knowledge base construction
"Mining events from the literature for bioinformatics applications" by S. Ananiadou, P. Thompson, and R. Nawaz; with Martin Vesely as coordinator

ACM SIGWEB Newsletter
Entity extraction, linking, classification, and tagging for social media: a wikipedia-based approach

Proceedings of the VLDB Endowment
Learning regular expressions to template-based FAQ retrieval systems

Knowledge-Based Systems
Autonomous knowledge acquisition based on artificial curiosity: Application to mobile robots in an indoor environment

Robotics and Autonomous Systems

Quantified Score

Hi-index	0.00

Visualization

Abstract

A wealth of on-line text information can be made available toautomatic processing by information extraction (IE) systems. Each IEapplication needs a separate set of rules tuned to the domain andwriting style. WHISK helps to overcome this knowledge-engineeringbottleneck by learning text extraction rules automatically.WHISK is designed to handle text styles ranging from highly structuredto free text, including text that is neither rigidly formatted norcomposed of grammatical sentences. Such semi-structured text haslargely been beyond the scope of previous systems. When used inconjunction with a syntactic analyzer and semantic tagging, WHISK canalso handle extraction from free text such as news stories.