Conversation map: a content-based Usenet newsgroup browser
Proceedings of the 5th international conference on Intelligent user interfaces
Summarization of discussion groups
Proceedings of the tenth international conference on Information and knowledge management
Exploring discussion lists: steps and directions
Proceedings of the 2nd ACM/IEEE-CS joint conference on Digital libraries
Complexity and Approximation: Combinatorial Optimization Problems and Their Approximability Properties
Incremental Learning in SwiftFile
ICML '00 Proceedings of the Seventeenth International Conference on Machine Learning
Mining newsgroups using networks arising from social behavior
WWW '03 Proceedings of the 12th international conference on World Wide Web
Email classification for contact centers
Proceedings of the 2003 ACM symposium on Applied computing
Automatic Reassembly of Document Fragments via Context Based Statistical Models
ACSAC '03 Proceedings of the 19th Annual Computer Security Applications Conference
Discovery and regeneration of hidden emails
Proceedings of the 2005 ACM symposium on Applied computing
NAACL-ANLP-AutoSum '00 Proceedings of the 2000 NAACL-ANLPWorkshop on Automatic summarization - Volume 4
Combining linguistic and machine learning techniques for email summarization
ConLL '01 Proceedings of the 2001 workshop on Computational Natural Language Learning - Volume 7
Summarizing email conversations with clue words
Proceedings of the 16th international conference on World Wide Web
Automated social hierarchy detection through email network analysis
Proceedings of the 9th WebKDD and 1st SNA-KDD 2007 workshop on Web mining and social network analysis
Segmentation and Automated Social Hierarchy Detection through Email Network Analysis
Advances in Web Mining and Web Usage Analysis
AskHERMES: An online question answering system for complex clinical questions
Journal of Biomedical Informatics
Social network extraction from texts: a thesis proposal
HLT-SS '11 Proceedings of the ACL 2011 Student Session
Analyzing Communication Interaction Networks (CINs) in enterprises and inferring hierarchies
Computer Networks: The International Journal of Computer and Telecommunications Networking
Hi-index | 0.00 |
The popularity of email has triggered researchers to look for ways to help users better organize the enormous amount of information stored in their email folders. One challenge that has not been studied extensively in text mining is the identification and reconstruction of hidden emails. A hidden email is an original email that has been quoted in at least one email in a folder, but does not present itself in the same folder. It may have been (un)intentionally deleted or may never have been received. The discovery and reconstruction of hidden emails is critical for many applications including email classification, summarization and forensics. This paper proposes a framework for reconstructing hidden emails using the embedded quotations found in messages further down the thread hierarchy. We evaluate the robustness and scalability of our framework by using the Enron public email corpus. Our experiments show that hidden emails exist widely in that corpus and also that our optimization techniques are effective in processing large email folders.