A practical approach to feature selection
ML92 Proceedings of the ninth international workshop on Machine learning
Estimating attributes: analysis and extensions of RELIEF
ECML-94 Proceedings of the European conference on machine learning on Machine Learning
Machine Learning - Special issue on inductive transfer
Kernel principal component analysis
Advances in kernel methods
Machine Learning
A Tutorial on Support Vector Machines for Pattern Recognition
Data Mining and Knowledge Discovery
An adaptation of Relief for attribute estimation in regression
ICML '97 Proceedings of the Fourteenth International Conference on Machine Learning
An introduction to variable and feature selection
The Journal of Machine Learning Research
A tutorial on support vector regression
Statistics and Computing
Gaussian Processes for Machine Learning (Adaptive Computation and Machine Learning)
Gaussian Processes for Machine Learning (Adaptive Computation and Machine Learning)
Demographic prediction based on user's browsing behavior
Proceedings of the 16th international conference on World Wide Web
A convex formulation for learning shared structures from multiple tasks
ICML '09 Proceedings of the 26th Annual International Conference on Machine Learning
Text categorization with knowledge transfer from heterogeneous data sources
AAAI'08 Proceedings of the 23rd national conference on Artificial intelligence - Volume 2
You are who you know: inferring user profiles in online social networks
Proceedings of the third ACM international conference on Web search and data mining
PageSense: style-wise web page advertising
Proceedings of the 19th international conference on World wide web
The demographics of web search
Proceedings of the 33rd international ACM SIGIR conference on Research and development in information retrieval
Multi-task learning for boosting with application to web search ranking
Proceedings of the 16th ACM SIGKDD international conference on Knowledge discovery and data mining
Inferring gender of movie reviewers: exploiting writing style, content and metadata
CIKM '10 Proceedings of the 19th ACM international conference on Information and knowledge management
AAAI'96 Proceedings of the thirteenth national conference on Artificial intelligence - Volume 1
Who's Who with Big-Five: Analyzing and Classifying Personality Traits with Smartphones
ISWC '11 Proceedings of the 2011 15th Annual International Symposium on Wearable Computers
Trace Norm Regularization: Reformulations, Algorithms, and Multi-Task Learning
SIAM Journal on Optimization
Linking visual concept detection with viewer demographics
Proceedings of the 2nd ACM International Conference on Multimedia Retrieval
Friends don't lie: inferring personality traits from social network structure
Proceedings of the 2012 ACM Conference on Ubiquitous Computing
Predicting personality using novel mobile phone-based metrics
SBP'13 Proceedings of the 6th international conference on Social Computing, Behavioral-Cultural Modeling and Prediction
Hi-index | 0.00 |
Demographics prediction is an important component of user profile modeling. The accurate prediction of users' demographics can help promote many applications, ranging from web search, personalization to behavior targeting. In this paper, we focus on how to predict users' demographics, including ''gender'', ''job type'', ''marital status'', ''age'' and ''number of family members'', based on mobile data, such as users' usage logs, physical activities and environmental contexts. The core idea is to build a supervised learning framework, where each user is represented as a feature vector and users' demographics are considered as prediction targets. The most important component is to construct features from raw data and then supervised learning models can be applied. We propose a feature construction framework, CFC (contextual feature construction), where each feature is defined as the conditional probability of one user activity under the given contexts. Consequently, besides employing standard supervised learning models, we propose a regularized multi-task learning framework to model different kinds of demographics predictions collectively. We also propose a cost-sensitive classification framework for regression tasks, in order to benefit from the existing dimension reduction methods. Finally, due to the limited training instances, we employ ensemble to avoid overfitting. The experimental results show that the framework achieves classification accuracies on ''gender'', ''job'' and ''marital status'' as high as 96%, 83% and 86%, respectively, and achieves Root Mean Square Error (RMSE) on ''age'' and ''number of family members'' as low as 0.69 and 0.66 respectively, under the leave-one-out evaluation.