Resume information extraction with cascaded hybrid model

Authors:
Kun Yu;Gang Guan;Ming Zhou
Affiliations:
University of Science and Technology of China, Hefei, Anhui, China;Tsinghua University, Bejing, China;Microsoft Research Asia, Haidian, Bejing, China
Venue:
ACL '05 Proceedings of the 43rd Annual Meeting on Association for Computational Linguistics
Year:
2005

Citing 9
Cited 11

Making large-scale support vector machine learning practical

Advances in kernel methods
An Algorithm that Learns What‘s in a Name

Machine Learning - Special issue on natural language learning
Automatic segmentation of text into structured records

SIGMOD '01 Proceedings of the 2001 ACM SIGMOD international conference on Management of data
Text categorization for multi-page documents: a hybrid naive Bayes HMM approach

Proceedings of the 1st ACM/IEEE-CS joint conference on Digital libraries
Machine learning in automated text categorization

ACM Computing Surveys (CSUR)
LearningPinocchio: adaptive information extraction for real world applications

Natural Language Engineering
Improved source-channel models for Chinese word segmentation

ACL '03 Proceedings of the 41st Annual Meeting on Association for Computational Linguistics - Volume 1
Using N-best lists for named entity recognition from Chinese speech

HLT-NAACL-Short '04 Proceedings of HLT-NAACL 2004: Short Papers
Bayesian information extraction network

IJCAI'03 Proceedings of the 18th international joint conference on Artificial intelligence

ArnetMiner: extraction and mining of academic social networks

Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining
RENS --- Enabling a Robot to Identify a Person

ICIRA '09 Proceedings of the 2nd International Conference on Intelligent Robotics and Applications
A Combination Approach to Web User Profiling

ACM Transactions on Knowledge Discovery from Data (TKDD)
PROSPECT: a system for screening candidates for recruitment

CIKM '10 Proceedings of the 19th ACM international conference on Information and knowledge management
Peeling back the layers: detecting event role fillers in secondary contexts

HLT '11 Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies - Volume 1
An approach to extract special skills to improve the performance of resume selection

DNIS'10 Proceedings of the 6th international conference on Databases in Networked Information Systems
Mining special features to improve the performance of e-commerce product selection and resume processing

International Journal of Computational Science and Engineering
User real-time interest prediction based on collaborative filtering and interactive computing for academic recommendation

ICIC'12 Proceedings of the 8th international conference on Intelligent Computing Theories and Applications
Bootstrapped training of event extraction classifiers

EACL '12 Proceedings of the 13th Conference of the European Chapter of the Association for Computational Linguistics
An ontology-based information extraction approach for résumés

ICPCA/SWS'12 Proceedings of the 2012 international conference on Pervasive Computing and the Networked World
iHR: an online recruiting system for Xiamen Talent Service Center

Proceedings of the 19th ACM SIGKDD international conference on Knowledge discovery and data mining

Quantified Score

Hi-index	0.00

Visualization

Abstract

This paper presents an effective approach for resume information extraction to support automatic resume management and routing. A cascaded information extraction (IE) framework is designed. In the first pass, a resume is segmented into a consecutive blocks attached with labels indicating the information types. Then in the second pass, the detailed information, such as Name and Address, are identified in certain blocks (e.g. blocks labelled with Personal Information), instead of searching globally in the entire resume. The most appropriate model is selected through experiments for each IE task in different passes. The experimental results show that this cascaded hybrid model achieves better F-score than flat models that do not apply the hierarchical structure of resumes. It also shows that applying different IE models in different passes according to the contextual structure is effective.