Authoritative sources in a hyperlinked environment
Proceedings of the ninth annual ACM-SIAM symposium on Discrete algorithms
QProber: A system for automatic classification of hidden-Web databases
ACM Transactions on Information Systems (TOIS)
Hierarchically Classifying Documents Using Very Few Words
ICML '97 Proceedings of the Fourteenth International Conference on Machine Learning
The automated acquisition of topic signatures for text summarization
COLING '00 Proceedings of the 18th conference on Computational linguistics - Volume 1
Producing biographical summaries: combining linguistic knowledge with corpus statistics
ACL '01 Proceedings of the 39th Annual Meeting on Association for Computational Linguistics
Is it the right answer?: exploiting web redundancy for Answer Validation
ACL '02 Proceedings of the 40th Annual Meeting on Association for Computational Linguistics
Counter-training in discovery of semantic patterns
ACL '03 Proceedings of the 41st Annual Meeting on Association for Computational Linguistics - Volume 1
Statistical acquisition of content selection rules for natural language generation
EMNLP '03 Proceedings of the 2003 conference on Empirical methods in natural language processing
Question answering using constraint satisfaction: QA-by-Dossier-with-Constraints
ACL '04 Proceedings of the 42nd Annual Meeting on Association for Computational Linguistics
Automatic creation of domain templates
COLING-ACL '06 Proceedings of the COLING/ACL on Main conference poster sessions
Broad expertise retrieval in sparse data environments
SIGIR '07 Proceedings of the 30th annual international ACM SIGIR conference on Research and development in information retrieval
BioNLP '06 Proceedings of the Workshop on Linking Natural Language Processing and Biology: Towards Deeper Biological Literature Analysis
Structural, transitive and latent models for biographic fact extraction
EACL '09 Proceedings of the 12th Conference of the European Chapter of the Association for Computational Linguistics
LNLBioNLP '06 Proceedings of the HLT-NAACL BioNLP Workshop on Linking Natural Language and Biology
Automatically generating Wikipedia articles: a structure-aware approach
ACL '09 Proceedings of the Joint Conference of the 47th Annual Meeting of the ACL and the 4th International Joint Conference on Natural Language Processing of the AFNLP: Volume 1 - Volume 1
Estimating importance features for fact mining: with a case study in biography mining
Large Scale Semantic Access to Content (Text, Image, Video, and Sound)
Editorial: Occupation inference through detection and classification of biographical activities
Data & Knowledge Engineering
Hi-index | 0.00 |
Biography creation requires the identification of important events in the life of the individual in question. While there are events such as birth and death that apply to everyone, most of the other activities tend to be occupation-specific. Hence, occupation gives important clues as to which activities should be included in the biography. We present techniques for automatically identifying which important events apply to the general population, which ones are occupation-specific, and which ones are person-specific. We use the extracted information as features for a multi-class SVM classifier, which is then used to automatically identify the occupation of a previously unseen individual. We present experiments involving 189 individuals from ten occupations, and we show that our approach accurately identifies general and occupation-specific activities and assigns unseen individuals to the correct occupations. Finally, we present evidence that our technique can lead to efficient and effective biography generation relying only on statistical techniques.