Feature Extraction, Construction and Selection: A Data Mining Perspective
Feature Extraction, Construction and Selection: A Data Mining Perspective
Machine Learning
Discretization and Grouping: Preprocessing Steps for Data Mining
PKDD '98 Proceedings of the Second European Symposium on Principles of Data Mining and Knowledge Discovery
An introduction to variable and feature selection
The Journal of Machine Learning Research
Data Mining: Concepts and Techniques
Data Mining: Concepts and Techniques
Constructive induction on decision trees
IJCAI'89 Proceedings of the 11th international joint conference on Artificial intelligence - Volume 1
A study of cross-validation and bootstrap for accuracy estimation and model selection
IJCAI'95 Proceedings of the 14th international joint conference on Artificial intelligence - Volume 2
Comparison of Feature Construction Methods for Video Relevance Prediction
MMM '09 Proceedings of the 15th International Multimedia Modeling Conference on Advances in Multimedia Modeling
The PDG-Mixture Model for Clustering
DaWaK '09 Proceedings of the 11th International Conference on Data Warehousing and Knowledge Discovery
Simulated evaluation of faceted browsing based on feature selection
Multimedia Tools and Applications
Pattern Recognition Letters
Fast wrapper feature subset selection in high-dimensional datasets by means of filter re-ranking
Knowledge-Based Systems
Global feature subset selection on high-dimensional datasets using re-ranking-based EDAs
CAEPIA'11 Proceedings of the 14th international conference on Advances in artificial intelligence: spanish association for artificial intelligence
Hi-index | 0.00 |
Manchego sheep breeding represents an important factor in the economy in the region of Castilla-La Mancha, Spain. For this reason, the selection scheme for Manchego sheep (ESROM) was created to improve milk production in ewes belonging to the Manchego breed. This scheme relies on the use of several tools that depend on ewes' genetic merit, which is calculated by using a sophisticated linear regression model. This paper presents a study about how the use of data mining techniques can help to approximate the genetic qualities of a ewe, before the official 6 months assessment is carried out, and by using less input. This study focuses on two well-known data mining tasks: pre-processing and classification. In the pre-processing stage, state-of-the-art algorithms and new proposals are used to identify relevant subsets of features by means of selection and construction. By using these subsets of highly predictive variables, different classifiers are trained, which in turn, are used to assess the genetic quality merit of any given ewe. As a result, original and constructed relevant variables have been identified for the target problem, this being a valuable result in itself. Furthermore, from simulated tests, reliable classification rates have been obtained when using the identified classifiers to approach ESROM tasks.