Hierarchical hidden Markov models for information extraction

Authors:
Marios Skounakis;Mark Craven;Soumya Ray
Affiliations:
Department of Computer Sciences, University of Wisconsin, Madison, Wisconsin;Department of Biostatistics & Medical Informatics, University of Wisconsin, Madison, Wisconsin;Department of Computer Sciences, University of Wisconsin, Madison, Wisconsin
Venue:
IJCAI'03 Proceedings of the 18th international joint conference on Artificial intelligence
Year:
2003

Citing 6
Cited 37

An Algorithm that Learns What‘s in a Name

Machine Learning - Special issue on natural language learning
The Hierarchical Hidden Markov Model: Analysis and Applications

Machine Learning
Conditional Random Fields: Probabilistic Models for Segmenting and Labeling Sequence Data

ICML '01 Proceedings of the Eighteenth International Conference on Machine Learning
Maximum Entropy Markov Models for Information Extraction and Segmentation

ICML '00 Proceedings of the Seventeenth International Conference on Machine Learning
A novel use of statistical parsing to extract information from text

NAACL 2000 Proceedings of the 1st North American chapter of the Association for Computational Linguistics conference
Representing sentence structure in hidden Markov models for information extraction

IJCAI'01 Proceedings of the 17th international joint conference on Artificial intelligence - Volume 2

Biological applications of multi-relational data mining

ACM SIGKDD Explorations Newsletter
Web-scale information extraction in knowitall: (preliminary results)

Proceedings of the 13th international conference on World Wide Web
Two supervised learning approaches for name disambiguation in author citations

Proceedings of the 4th ACM/IEEE-CS joint conference on Digital libraries
Dynamic conditional random fields: factorized probabilistic models for labeling and segmenting sequence data

ICML '04 Proceedings of the twenty-first international conference on Machine learning
Name disambiguation in author citations using a K-way spectral clustering method

Proceedings of the 5th ACM/IEEE-CS joint conference on Digital libraries
Generic soft pattern models for definitional question answering

Proceedings of the 28th annual international ACM SIGIR conference on Research and development in information retrieval
Adaptive information extraction

ACM Computing Surveys (CSUR)
Simultaneous record detection and attribute labeling in web data extraction

Proceedings of the 12th ACM SIGKDD international conference on Knowledge discovery and data mining
Soft pattern matching models for definitional question answering

ACM Transactions on Information Systems (TOIS)
Dynamic Conditional Random Fields: Factorized Probabilistic Models for Labeling and Segmenting Sequence Data

The Journal of Machine Learning Research
Techniques to incorporate the benefits of a hierarchy in a modified hidden Markov model

COLING-ACL '06 Proceedings of the COLING/ACL on Main conference poster sessions
Methodological Review: Extracting interactions between proteins from the literature

Journal of Biomedical Informatics
Perception-oriented online news extraction

Proceedings of the 8th ACM/IEEE-CS joint conference on Digital libraries
Relational Transformation-based Tagging for Activity Recognition

Fundamenta Informaticae - Progress on Multi-Relational Data Mining
Uncertainty management in rule-based information extraction systems

Proceedings of the 2009 ACM SIGMOD International Conference on Management of data
Mining of Protein Subcellular Localizations based on a Syntactic Dependency Tree and WordNet

Proceedings of the 2008 conference on Knowledge-Based Software Engineering: Proceedings of the Eighth Joint Conference on Knowledge-Based Software Engineering
Using automated feature optimisation to create an adaptable relation extraction system

BioNLP '08 Proceedings of the Workshop on Current Trends in Biomedical Natural Language Processing
Hierarchical hidden Markov models with general state hierarchy

AAAI'04 Proceedings of the 19th national conference on Artifical intelligence
A probabilistic learning method for XML annotation of documents

IJCAI'05 Proceedings of the 19th international joint conference on Artificial intelligence
An adaptive bottom up clustering approach for web news extraction

WOCC'09 Proceedings of the 18th international conference on Wireless and Optical Communications Conference
Graph mutual reinforcement based bootstrapping

AIRS'08 Proceedings of the 4th Asia information retrieval conference on Information retrieval technology
Pattern-based extraction of addresses from web page content

APWeb'08 Proceedings of the 10th Asia-Pacific web conference on Progress in WWW research and development
Querying probabilistic information extraction

Proceedings of the VLDB Endowment
A text-based decision support system for financial sequence prediction

Decision Support Systems
Tagging complex NEs with maxent models: layered structures versus extended tagset

IJCNLP'04 Proceedings of the First international joint conference on Natural Language Processing
An overview and classification of adaptive approaches to information extraction

Journal on Data Semantics IV
Learning regular expressions from noisy sequences

SARA'05 Proceedings of the 6th international conference on Abstraction, Reformulation and Approximation
Integrating text chunking with mixture hidden markov models for effective biomedical information extraction

ICCS'05 Proceedings of the 5th international conference on Computational Science - Volume Part II
KXtractor: an effective biomedical information extraction technique based on mixture hidden markov models

Transactions on Computational Systems Biology II
Hierarchical hidden conditional random fields for information extraction

LION'05 Proceedings of the 5th international conference on Learning and Intelligent Optimization
REV: extracting entity relations from world wide web

Proceedings of the 6th International Conference on Ubiquitous Information Management and Communication
P-top-k queries in a probabilistic framework from information extraction models

Computers & Mathematics with Applications
Relational Transformation-based Tagging for Activity Recognition

Fundamenta Informaticae - Progress on Multi-Relational Data Mining
Decision making aid in mobile environment by behavioral characteristic

Proceedings of the 13th International Conference on Electronic Commerce
A structural approach for modelling the hierarchical dynamic process of Web workload in a large-scale campus network

Journal of Network and Computer Applications
Marginalized Viterbi algorithm for hierarchical hidden Markov models

Pattern Recognition
Finding the most likely upper level state sequence for hierarchical HMMs

SLSP'13 Proceedings of the First international conference on Statistical Language and Speech Processing

Quantified Score

Hi-index	0.00

Visualization

Abstract

Information extraction can be defined as the task of automatically extracting instances of specified classes or relations from text. We consider the case of using machine learning methods to induce models for extracting relation instances from biomedical articles. We propose and evaluate an approach that is based on using hierarchical hidden Markov models to represent the grammatical structure of the sentences being processed. Our approach first uses a shallow parser to construct a multi-level representation of each sentence being processed. Then we train hierarchical HMMs to capture the regularities of the parses for both positive and negative sentences. We evaluate our method by inducing models to extract binary relations in three biomedical domains. Our experiments indicate that our approach results in more accurate models than several baseline HMM approaches.