Mining association language patterns using a distributional semantic model for negative life event classification

Authors:
Liang-Chih Yu;Chien-Lung Chan;Chao-Cheng Lin;I-Chun Lin
Affiliations:
Department of Information Management, Yuan Ze University, Chung-Li, Taiwan, ROC;Department of Information Management, Yuan Ze University, Chung-Li, Taiwan, ROC;Department of Psychiatry, National Taiwan University Hospital and National Taiwan University College of Medicine, Taipei, Taiwan, ROC;Department of Computer Science and Information Management, HungKuang University, Taichung, Taiwan, ROC and Department of Industrial Management, National Yunlin University of Science and Technology ...
Venue:
Journal of Biomedical Informatics
Year:
2011

Citing 35
Cited 1

Term-weighting approaches in automatic text retrieval

Information Processing and Management: an International Journal
Advances in knowledge discovery and data mining

Advances in knowledge discovery and data mining
Bayesian Network Classifiers

Machine Learning - Special issue on learning with probabilistic representations
Combining labeled and unlabeled data with co-training

COLT' 98 Proceedings of the eleventh annual conference on Computational learning theory
Text Classification from Labeled and Unlabeled Documents using EM

Machine Learning - Special issue on information retrieval
Analyzing the effectiveness and applicability of co-training

Proceedings of the ninth international conference on Information and knowledge management
Machine learning in automated text categorization

ACM Computing Surveys (CSUR)
The use of bigrams to enhance text categorization

Information Processing and Management: an International Journal
Semi-Naive Bayesian Classifier

EWSL '91 Proceedings of the European Working Session on Machine Learning
Transductive Inference for Text Classification using Support Vector Machines

ICML '99 Proceedings of the Sixteenth International Conference on Machine Learning
Fast Algorithms for Mining Association Rules in Large Databases

VLDB '94 Proceedings of the 20th International Conference on Very Large Data Bases
Automatic retrieval and clustering of similar words

COLING '98 Proceedings of the 17th international conference on Computational linguistics - Volume 2
Co-EM support vector learning

ICML '04 Proceedings of the twenty-first international conference on Machine learning
Scoring and Selecting Terms for Text Categorization

IEEE Intelligent Systems
University of Massachusetts: description of the CIRCUS system as used for MUC-4

MUC4 '92 Proceedings of the 4th conference on Message understanding
An introduction to ROC analysis

Pattern Recognition Letters - Special issue: ROC analysis in pattern recognition
Emotion recognition from text using semantic labels and separable mixture models

ACM Transactions on Asian Language Information Processing (TALIP)
Data Mining: Practical Machine Learning Tools and Techniques, Second Edition (Morgan Kaufmann Series in Data Management Systems)

Data Mining: Practical Machine Learning Tools and Techniques, Second Edition (Morgan Kaufmann Series in Data Management Systems)
Characterising measures of lexical distributional similarity

COLING '04 Proceedings of the 20th international conference on Computational Linguistics
Contextual feature selection for text classification

Information Processing and Management: an International Journal - Special issue: AIRS2005: Information retrieval research in Asia
Mining sequential patterns for protein fold recognition

Journal of Biomedical Informatics
Extended probabilistic HAL with close temporal association for psychiatric query document retrieval

ACM Transactions on Information Systems (TOIS)
Psychiatric document retrieval using a discourse-aware model

Artificial Intelligence
Associative Naïve Bayes classifier: Automated linking of gene ontology to medline documents

Pattern Recognition
Self-training for biomedical parsing

HLT-Short '08 Proceedings of the 46th Annual Meeting of the Association for Computational Linguistics on Human Language Technologies: Short Papers
Investigating statistical techniques for sentence-level event classification

COLING '08 Proceedings of the 22nd International Conference on Computational Linguistics - Volume 1
Rule-based information extraction from patients' clinical data

Journal of Biomedical Informatics
Feature generation and representations for protein-protein interaction classification

Journal of Biomedical Informatics
A discriminative model for semi-supervised learning

Journal of the ACM (JACM)
Multi-view semi-supervised learning for dialog act segmentation of speech

IEEE Transactions on Audio, Speech, and Language Processing
Annotation and verification of sense pools in OntoNotes

Information Processing and Management: an International Journal
Experiments in graph-based semi-supervised learning methods for class-instance acquisition

ACL '10 Proceedings of the 48th Annual Meeting of the Association for Computational Linguistics
Multidimensional text classification for drug information

IEEE Transactions on Information Technology in Biomedicine
Discovering Novel Causal Patterns From Biomedical Natural-Language Texts Using Bayesian Nets

IEEE Transactions on Information Technology in Biomedicine
Probability of error of some adaptive pattern-recognition machines

IEEE Transactions on Information Theory

Using a contextual entropy model to expand emotion words and their intensity for the sentiment classification of stock market news

Knowledge-Based Systems

Quantified Score

Hi-index	0.00

Visualization

Abstract

Purpose: Negative life events, such as the death of a family member, an argument with a spouse or the loss of a job, play an important role in triggering depressive episodes. Therefore, it is worthwhile to develop psychiatric services that can automatically identify such events. This study describes the use of association language patterns, i.e., meaningful combinations of words (e.g., ), as features to classify sentences with negative life events into predefined categories (e.g., Family, Love, Work). Methods: This study proposes a framework that combines a supervised data mining algorithm and an unsupervised distributional semantic model to discover association language patterns. The data mining algorithm, called association rule mining, was used to generate a set of seed patterns by incrementally associating frequently co-occurring words from a small corpus of sentences labeled with negative life events. The distributional semantic model was then used to discover more patterns similar to the seed patterns from a large, unlabeled web corpus. Results: The experimental results showed that association language patterns were significant features for negative life event classification. Additionally, the unsupervised distributional semantic model was not only able to improve the level of performance but also to reduce the reliance of the classification process on the availability of a large, labeled corpus.