Genetic programming: on the programming of computers by means of natural selection
Genetic programming: on the programming of computers by means of natural selection
Threading electronic mail: a preliminary study
Information Processing and Management: an International Journal - Special issue: methods and tools for the automatic construction of hypertext
Indexing emails and email threads for retrieval
Proceedings of the 28th annual international ACM SIGIR conference on Research and development in information retrieval
Thread detection in dynamic text message streams
SIGIR '06 Proceedings of the 29th annual international ACM SIGIR conference on Research and development in information retrieval
Email Conversations Reconstruction Based on Messages Threading for Multi-person
ETTANDGRS '08 Proceedings of the 2008 International Workshop on Education Technology and Training & 2008 International Workshop on Geoscience and Remote Sensing - Volume 01
Conversation detection in email systems
ECIR'08 Proceedings of the IR research, 30th European conference on Advances in information retrieval
Exploiting thread structures to improve smoothing of language models for forum post retrieval
ECIR'11 Proceedings of the 33rd European conference on Advances in information retrieval
A learning approach for email conversation thread reconstruction
Journal of Information Science
Hi-index | 0.00 |
Email is a type of Web data which is produced in enormous quantities. It is beneficial to detect conversation threads contained in the email corpora for various applications, including discussion search, expert finding and even email clustering and classification. Conversation thread in email corpora can be defined as a cluster of exchanged emails among the same group of people by reply or forwarding on the same topic. According to this definition, we can define parent-child relation between emails, so email conversation threads seem to demonstrate tree structure. This paper presents a new approach based on genetic programming for reconstruction of conversation threads in emails data. This approach considers finding email conversation threads as an optimization problem, and exploits genetic programming to search intelligently in the space of possible solutions. Rather than several studies that have been conducted on this problem, this work concentrates on detecting accurate structure of conversation threads in high recall. This paper provides a comprehensive evaluation on the BC3 data set. Preliminary results suggest that our method provides acceptable precision and higher recall than existing methods.