The Journal of Machine Learning Research
GROUP '03 Proceedings of the 2003 international ACM SIGGROUP conference on Supporting group work
Combining Topic Models and Social Networks for Chat Data Mining
WI '04 Proceedings of the 2004 IEEE/WIC/ACM International Conference on Web Intelligence
The author-topic model for authors and documents
UAI '04 Proceedings of the 20th conference on Uncertainty in artificial intelligence
Journal of the American Society for Information Science and Technology
Multi-document summarization using off the shelf compression software
HLT-NAACL-DUC '03 Proceedings of the HLT-NAACL 03 on Text summarization workshop - Volume 5
Thread detection in dynamic text message streams
SIGIR '06 Proceedings of the 29th annual international ACM SIGIR conference on Research and development in information retrieval
LDA-based document models for ad-hoc retrieval
SIGIR '06 Proceedings of the 29th annual international ACM SIGIR conference on Research and development in information retrieval
Why we twitter: understanding microblogging usage and communities
Proceedings of the 9th WebKDD and 1st SNA-KDD 2007 workshop on Web mining and social network analysis
Foundations and Trends in Information Retrieval
Chat mining: Predicting user and message attributes in computer-mediated communication
Information Processing and Management: an International Journal
Proceedings of the first workshop on Online social networks
Military Textual Analysis and Chat Research
ICSC '08 Proceedings of the 2008 IEEE International Conference on Semantic Computing
Evaluating topic models for information retrieval
Proceedings of the 17th ACM conference on Information and knowledge management
Topic models and a revisit of text-related applications
Proceedings of the 2nd PhD workshop on Information and knowledge management
Computational methods in authorship attribution
Journal of the American Society for Information Science and Technology
A survey of modern authorship attribution methods
Journal of the American Society for Information Science and Technology
A Comparative Study of Utilizing Topic Models for Information Retrieval
ECIR '09 Proceedings of the 31th European Conference on IR Research on Advances in Information Retrieval
PLDA: Parallel Latent Dirichlet Allocation for Large-Scale Applications
AAIM '09 Proceedings of the 5th International Conference on Algorithmic Aspects in Information and Management
Context-based message expansion for disentanglement of interleaved text conversations
NAACL '09 Proceedings of Human Language Technologies: The 2009 Annual Conference of the North American Chapter of the Association for Computational Linguistics
Mining police digital archives to link criminal styles with offender characteristics
ICADL'07 Proceedings of the 10th international conference on Asian digital libraries: looking back 10 years and forging new frontiers
On smoothing and inference for topic models
UAI '09 Proceedings of the Twenty-Fifth Conference on Uncertainty in Artificial Intelligence
Unsupervised modeling of Twitter conversations
HLT '10 Human Language Technologies: The 2010 Annual Conference of the North American Chapter of the Association for Computational Linguistics
Part-of-speech tagging for Twitter: annotation, features, and experiments
HLT '11 Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies: short papers - Volume 2
Message Receiver Determination in Multiple Simultaneous IM Conversations
IEEE Intelligent Systems
Statistics of online user-generated short documents
ECIR'2010 Proceedings of the 32nd European conference on Advances in Information Retrieval
PIKM 2011: the 4th ACM workshop for Ph.D. students in information and knowledge management
Proceedings of the 20th ACM international conference on Information and knowledge management
Proceedings of the Second International Conference on Computational Science, Engineering and Information Technology
Hi-index | 0.00 |
The increasing popularity of online-based services (Twitter, Facebook, IRC, Myspace, blogs, just to mention few of them) results in a production of a huge amount of novel documents. These documents present properties that can not be found in standard edited documents. In particular the messages generated during the use of Instant Messages Services (IM) like chat-rooms or Twitter are short, user-generated and "noisy". We are investigating two different but related aspect of the content of these colloquial messages: the topic identification and the author identification tasks. In the first case we would like to answer the question what is the conversation about? while in the second case the question is who are the people involved in the conversation?. The combination of these two tasks into a unique model is a novel and interesting research problem that is the main topic of our research.