Statistical Models for Text Segmentation
Machine Learning - Special issue on natural language learning
Topic-based document segmentation with probabilistic latent semantic analysis
Proceedings of the eleventh international conference on Information and knowledge management
Topic segmentation: algorithms and applications
Topic segmentation: algorithms and applications
TextTiling: segmenting text into multi-paragraph subtopic passages
Computational Linguistics
Advances in domain independent linear text segmentation
NAACL 2000 Proceedings of the 1st North American chapter of the Association for Computational Linguistics conference
RCV1: A New Benchmark Collection for Text Categorization Research
The Journal of Machine Learning Research
A Dynamic Programming Algorithm for Linear Text Segmentation
Journal of Intelligent Information Systems
A statistical model for domain-independent text segmentation
ACL '01 Proceedings of the 39th Annual Meeting on Association for Computational Linguistics
Text segmentation with LDA-based Fisher kernel
HLT-Short '08 Proceedings of the 46th Annual Meeting of the Association for Computational Linguistics on Human Language Technologies: Short Papers
Using LDA to detect semantically incoherent documents
CoNLL '08 Proceedings of the Twelfth Conference on Computational Natural Language Learning
Improving text segmentation with non-systematic semantic relation
CICLing'11 Proceedings of the 12th international conference on Computational linguistics and intelligent text processing - Volume Part I
Linear text segmentation using affinity propagation
EMNLP '11 Proceedings of the Conference on Empirical Methods in Natural Language Processing
TV news story segmentation based on semantic coherence and content similarity
MMM'10 Proceedings of the 16th international conference on Advances in Multimedia Modeling
Semantic based adaptive movie summarisation
MMM'10 Proceedings of the 16th international conference on Advances in Multimedia Modeling
It is the time for portuguese texts!
PROPOR'12 Proceedings of the 10th international conference on Computational Processing of the Portuguese Language
How text segmentation algorithms gain from topic models
NAACL HLT '12 Proceedings of the 2012 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies
Sweeping through the topic space: bad luck? Roll again!
ROBUS-UNSUP '12 Proceedings of the Joint Workshop on Unsupervised and Semi-Supervised Learning in NLP
TopicTiling: a text segmentation algorithm based on LDA
ACL '12 Proceedings of ACL 2012 Student Research Workshop
Optimizing temporal topic segmentation for intelligent text visualization
Proceedings of the 2013 international conference on Intelligent user interfaces
Exploiting hybrid contexts for Tweet segmentation
Proceedings of the 36th international ACM SIGIR conference on Research and development in information retrieval
An unsupervised topic segmentation model incorporating word order
Proceedings of the 36th international ACM SIGIR conference on Research and development in information retrieval
Gem-based entity-knowledge maintenance
Proceedings of the 22nd ACM international conference on Conference on information & knowledge management
Unsupervised text segmentation using LDA and MCMC
AusDM '12 Proceedings of the Tenth Australasian Data Mining Conference - Volume 134
Hi-index | 0.00 |
In this paper, the task of text segmentation is approached from a topic modeling perspective. We investigate the use of latent Dirichlet allocation (LDA) topic model to segment a text into semantically coherent segments. A major benefit of the proposed approach is that along with the segment boundaries, it outputs the topic distribution associated with each segment. This information is of potential use in applications like segment retrieval and discourse analysis. The new approach outperforms a standard baseline method and yields significantly better performance than most of the available unsupervised methods on a benchmark dataset.