Fuzzy sets, uncertainty, and information
Fuzzy sets, uncertainty, and information
Conceptual database design: an Entity-relationship approach
Conceptual database design: an Entity-relationship approach
C4.5: programs for machine learning
C4.5: programs for machine learning
Improving data warehouse and business information quality: methods for reducing costs and increasing profits
BOAT—optimistic decision tree construction
SIGMOD '99 Proceedings of the 1999 ACM SIGMOD international conference on Management of data
Data Mining Techniques: For Marketing, Sales, and Customer Support
Data Mining Techniques: For Marketing, Sales, and Customer Support
RainForest—A Framework for Fast Decision Tree Construction of Large Datasets
Data Mining and Knowledge Discovery
A Framework for Analysis of Data Quality Research
IEEE Transactions on Knowledge and Data Engineering
Knowledge Discovery from Telecommunication Network Alarm Databases
ICDE '96 Proceedings of the Twelfth International Conference on Data Engineering
Hi-index | 0.00 |
Since the data is collected from disparate sources in many actual data mining environments, it is common to have data values in different abstraction levels. This paper shows that such multiple abstraction levels of data can cause undesirable effects in decision tree classification. After explaining that equalizing abstraction levels by force cannot provide satisfactory solutions of this problem, it presents a method to utilize the data as it is. The proposed method accommodates the generalization/specialization relationship between data values in both of the construction and the class assignment phases of decision tree classification. The experimental results show that the proposed method reduces classification error rates significantly when multiple abstraction levels of data are involved.