Statistical Models for Text Segmentation
Machine Learning - Special issue on natural language learning
Topic-based document segmentation with probabilistic latent semantic analysis
Proceedings of the eleventh international conference on Information and knowledge management
ICML '01 Proceedings of the Eighteenth International Conference on Machine Learning
Domain-independent text segmentation using anisotropic diffusion and dynamic programming
Proceedings of the 26th annual international ACM SIGIR conference on Research and development in informaion retrieval
Topic segmentation: algorithms and applications
Topic segmentation: algorithms and applications
The Journal of Machine Learning Research
Advances in domain independent linear text segmentation
NAACL 2000 Proceedings of the 1st North American chapter of the Association for Computational Linguistics conference
Text segmentation with multiple surface linguistic cues
COLING '98 Proceedings of the 17th international conference on Computational linguistics - Volume 2
Multi-paragraph segmentation of expository text
ACL '94 Proceedings of the 32nd annual meeting on Association for Computational Linguistics
A Dynamic Programming Algorithm for Linear Text Segmentation
Journal of Intelligent Information Systems
Text segmentation via topic modeling: an analytical study
Proceedings of the 18th ACM conference on Information and knowledge management
Automatic evaluation of topic coherence
HLT '10 Human Language Technologies: The 2010 Annual Conference of the North American Chapter of the Association for Computational Linguistics
Text segmentation: A topic modeling perspective
Information Processing and Management: an International Journal
A statistical model for topically segmented documents
DS'11 Proceedings of the 14th international conference on Discovery science
Large-scale learning of word relatedness with constraints
Proceedings of the 18th ACM SIGKDD international conference on Knowledge discovery and data mining
How text segmentation algorithms gain from topic models
NAACL HLT '12 Proceedings of the 2012 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies
Sweeping through the topic space: bad luck? Roll again!
ROBUS-UNSUP '12 Proceedings of the Joint Workshop on Unsupervised and Semi-Supervised Learning in NLP
TopicTiling: a text segmentation algorithm based on LDA
ACL '12 Proceedings of ACL 2012 Student Research Workshop
Density-based logistic regression
Proceedings of the 19th ACM SIGKDD international conference on Knowledge discovery and data mining
Unsupervised text segmentation using LDA and MCMC
AusDM '12 Proceedings of the Tenth Australasian Data Mining Conference - Volume 134
Hi-index | 0.00 |
In this paper we propose a domain-independent text segmentation method, which consists of three components. Latent Dirichlet allocation (LDA) is employed to compute words semantic distribution, and we measure semantic similarity by the Fisher kernel. Finally global best segmentation is achieved by dynamic programming. Experiments on Chinese data sets with the technique show it can be effective. Introducing latent semantic information, our algorithm is robust on irregular-sized segments.