Mining data records in Web pages
Proceedings of the ninth ACM SIGKDD international conference on Knowledge discovery and data mining
Mining Web Pages for Data Records
IEEE Intelligent Systems
Automating Content Extraction of HTML Documents
World Wide Web
Resume information extraction with cascaded hybrid model
ACL '05 Proceedings of the 43rd Annual Meeting on Association for Computational Linguistics
Web Information Extraction by HTML Tree Edit Distance Matching
ICCIT '07 Proceedings of the 2007 International Conference on Convergence Information Technology
ArnetMiner: extraction and mining of academic social networks
Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining
Coreex: content extraction from online news articles
Proceedings of the 17th ACM conference on Information and knowledge management
Named entity recognition for web content filtering
NLDB'05 Proceedings of the 10th international conference on Natural Language Processing and Information Systems
Hi-index | 0.00 |
We outline a web personal information mining system that enables robots or devices like mobile phones which possess a visual perception system to discover a person's identity and his personal information (such as phone number, email, address, etc.) by using NLP methods based on the result of the visual perception. At the core of the system lies a rule based personal information extraction algorithm that does not require any supervision or manual annotation, and can easily be applied to other domains such as travel or books. This first implementation was used as a proof of concept and experimental results showed that our annotation-free method is promising and compares favorably to supervised approaches.