The JPEG still picture compression standard
Communications of the ACM - Special issue on digital multimedia systems
A Model Selection Criterion for Classification: Application to HMM Topology Optimization
ICDAR '03 Proceedings of the Seventh International Conference on Document Analysis and Recognition - Volume 1
The Journal of Machine Learning Research
Probabilistic model-based clustering of complex data
Probabilistic model-based clustering of complex data
Generative model-based document clustering: a comparative study
Knowledge and Information Systems
Incorporating with Recursive Model Training in Time Series Clustering
CIT '05 Proceedings of the The Fifth International Conference on Computer and Information Technology
Pachinko allocation: DAG-structured mixture models of topic correlations
ICML '06 Proceedings of the 23rd international conference on Machine learning
A General Framework for Agglomerative Hierarchical Clustering Algorithms
ICPR '06 Proceedings of the 18th International Conference on Pattern Recognition - Volume 02
Exploiting asymmetry in hierarchical topic extraction
CIKM '06 Proceedings of the 15th ACM international conference on Information and knowledge management
Topic sentiment mixture: modeling facets and opinions in weblogs
Proceedings of the 16th international conference on World Wide Web
Short communication: Variable space hidden Markov model for topic detection and analysis
Knowledge-Based Systems
WC-Clustering: Hierarchical Clustering Using the Weighted Confidence Affinity Measure
ICDMW '07 Proceedings of the Seventh IEEE International Conference on Data Mining Workshops
Modeling online reviews with multi-grain topic models
Proceedings of the 17th international conference on World Wide Web
Hierarchical Clustering of Time-Series Data Streams
IEEE Transactions on Knowledge and Data Engineering
Incorporating topic transition in topic detection and tracking algorithms
Expert Systems with Applications: An International Journal
Fast algorithm for computing discrete cosine transform
IEEE Transactions on Signal Processing
Integer DCTs and fast algorithms
IEEE Transactions on Signal Processing
Computing Semantic Relatedness Based on Search Result Analysis
WI-IAT '12 Proceedings of the The 2012 IEEE/WIC/ACM International Joint Conferences on Web Intelligence and Intelligent Agent Technology - Volume 03
Web objectionable text content detection using topic modeling technique
Expert Systems with Applications: An International Journal
Hi-index | 12.05 |
Granular topic extraction and modeling are fundament tasks in text analysis. Hierarchical topic clustering algorithms and hierarchical topic models are usually employed for these purposes. However, it is difficult to make a clear distinguish between each pair of hierarchical topics from the semantic granularity point of view. STG (semantic topic granularity) is proposed to indicate the details degree of topic description, and aim at providing discrimination for topics from semantic aspect. A new model, mgMTM (multi-grain mixture topic model) based on STG is then proposed to model grain topics. DCT (discrete cosine transform) is employed to provide a mechanism for computing STG, extracting grain topics and learning mgMTM. Experiments on real world datasets show that the proposed model has lower perplexity score than that of LDA model and thus has better generalization performance in describing text. Experiments also show that the description of the extracted grain topics can be well explained with respect to a dataset including topics about recent global financial crisis.