The KDD process for extracting useful knowledge from volumes of data
Communications of the ACM
Statistical inference and data mining
Communications of the ACM
The data warehouse and data mining
Communications of the ACM
Communications of the ACM
Using Data Mining Techniques in Monitoring Diabetes Care. The Simpler the Better?
Journal of Medical Systems
Hi-index | 0.00 |
Primary data mining on alkanes for seeking accurate quantitative relationship between molecular structure and retention indices of gas chromatography is developed in this paper. Based on the results obtained from projection pursuit (PP), a new variable named class distance variable, which essentially describes the branching structure of the alkanes, is proposed. With the help of the new variable, both fitting and prediction accuracy of the regression model can be dramatically improved. The results obtained in this work show that the technique of PP developed in statistics is a quite promising tool for seeking accurate quantitative structure-activity relationship (QSAR) and/or quantitative structure-property relationship (QSPR) researches.