Using MPI: portable parallel programming with the message-passing interface
Using MPI: portable parallel programming with the message-passing interface
IDEA: interactive data exploration and analysis
SIGMOD '96 Proceedings of the 1996 ACM SIGMOD international conference on Management of data
Data preparation for data mining
Data preparation for data mining
SIGMOD '99 Proceedings of the 1999 ACM SIGMOD international conference on Management of data
Beyond calculation
Python; Essential Reference
Data Mining Techniques: For Marketing, Sales, and Customer Support
Data Mining Techniques: For Marketing, Sales, and Customer Support
Data Mining: An Overview from a Database Perspective
IEEE Transactions on Knowledge and Data Engineering
Towards a Parallel Data Mining Toolbox
IPDPS '01 Proceedings of the 15th International Parallel & Distributed Processing Symposium
Hi-index | 0.00 |
This paper describes a flexible and efficient toolbox based on the scripting language Python, capable of handling common tasks in data mining. Using either a relational database or flat files the toolbox gives the user a uniform view of a data collection. Two core features of the toolbox are caching of database queries and parallelism within a collection of independent queries. Our toolbox provides a number of routines for basic data mining tasks on top of which the user can add more functions - mainly domain and data collection dependent - for complex and time consuming data mining tasks.