Aboutness from a commonsense perspective
Journal of the American Society for Information Science
Unsupervised learning by probabilistic latent semantic analysis
Machine Learning
Discovering information flow suing high dimensional conceptual space
Proceedings of the 24th annual international ACM SIGIR conference on Research and development in information retrieval
Conceptual Spaces: The Geometry of Thought
Conceptual Spaces: The Geometry of Thought
Improving Short-Text Classification using Unlabeled Data for Classification Problems
ICML '00 Proceedings of the Seventeenth International Conference on Machine Learning
Towards context sensitive information inference
Journal of the American Society for Information Science and Technology - Mathematical, logical, and formal methods in information retrieval
Measuring praise and criticism: Inference of semantic orientation from association
ACM Transactions on Information Systems (TOIS)
Thumbs up or thumbs down?: semantic orientation applied to unsupervised classification of reviews
ACL '02 Proceedings of the 40th Annual Meeting on Association for Computational Linguistics
Probabilistic hyperspace analogue to language
Proceedings of the 28th annual international ACM SIGIR conference on Research and development in information retrieval
A web-based kernel function for measuring the similarity of short text snippets
Proceedings of the 15th international conference on World Wide Web
Sentence Similarity Based on Semantic Nets and Corpus Statistics
IEEE Transactions on Knowledge and Data Engineering
Extending WHIRL with background knowledge for improved text classification
Information Retrieval
Spam filtering for short messages
Proceedings of the sixteenth ACM conference on Conference on information and knowledge management
Proceedings of the 17th international conference on World Wide Web
Extended probabilistic HAL with close temporal association for psychiatric query document retrieval
ACM Transactions on Information Systems (TOIS)
Information Flow: The Logic of Distributed Systems
Information Flow: The Logic of Distributed Systems
Using the Web as corpus for self-training text categorization
Information Retrieval
Clustering Narrow-Domain Short Texts by Using the Kullback-Leibler Distance
CICLing '07 Proceedings of the 8th International Conference on Computational Linguistics and Intelligent Text Processing
Improving similarity measures for short segments of text
AAAI'07 Proceedings of the 22nd national conference on Artificial intelligence - Volume 2
Evaluation of internal validity measures in short-text corpora
CICLing'08 Proceedings of the 9th international conference on Computational linguistics and intelligent text processing
Short text classification in twitter to improve information filtering
Proceedings of the 33rd international ACM SIGIR conference on Research and development in information retrieval
Short text similarity based on probabilistic topics
Knowledge and Information Systems
A probabilistic approach to semantic collaborative filtering using world knowledge
Journal of Information Science
Short text clustering by finding core terms
Knowledge and Information Systems
A Self-enriching Methodology for Clustering Narrow Domain Short Texts
The Computer Journal
Revisiting the importance of cognition in information science
Journal of Information Science
Information flow analysis with chinese text
IJCNLP'04 Proceedings of the First international joint conference on Natural Language Processing
Text based knowledge discovery with information flow analysis
APWeb'06 Proceedings of the 8th Asia-Pacific Web conference on Frontiers of WWW Research and Development
Clustering abstracts of scientific texts using the transition point technique
CICLing'06 Proceedings of the 7th international conference on Computational Linguistics and Intelligent Text Processing
Identifying the semantic orientation of terms using S-HAL for sentiment analysis
Knowledge-Based Systems
Hi-index | 0.00 |
Traditional text-processing methods encounter significant performance degradation when they are applied to web short texts, with their inherent characteristics including feature sparseness, lack of sufficient hand-labelled training examples, domain dependence, and asyntactic expression. In this paper we propose a modified information inference model that can mimic human cognitive behaviour to categorize various web short texts in an unsupervised manner. The model is based on the conceptual space theory and hyperspace analogue to language (HAL) model, and it is a novel development in that it combines domain-specific knowledge and universal knowledge via a fusion mechanism for multiple HAL spaces. Moreover, in the realization of conceptual space, a concept is represented geometrically by a two-tuple of property sets, which can effectively improve the representation accuracy of the information contained in combined concepts. Two measurements of the relationship between concepts are used to implement the information inference for web short texts. The experimental evaluation of our model is conducted via three different tasks on web short text categorization, and the results indicate the applicability and usefulness of the proposed method.