Term-weighting approaches in automatic text retrieval
Information Processing and Management: an International Journal
Data mining: practical machine learning tools and techniques with Java implementations
Data mining: practical machine learning tools and techniques with Java implementations
A vector space model for automatic indexing
Communications of the ACM
Detecting Concept Drift with Support Vector Machines
ICML '00 Proceedings of the Seventeenth International Conference on Machine Learning
Advances in Computational Intelligence: Theory and Practice
Advances in Computational Intelligence: Theory and Practice
Software infrastructure for natural language processing
ANLC '97 Proceedings of the fifth conference on Applied natural language processing
Automatic Feature Extraction for Classifying Audio Data
Machine Learning
The Design of Discovery Net: Towards Open Grid Services for Knowledge Discovery
International Journal of High Performance Computing Applications
Sampling-based sequential subgroup mining
Proceedings of the eleventh ACM SIGKDD international conference on Knowledge discovery in data mining
Information preserving multi-objective feature selection for unsupervised learning
Proceedings of the 8th annual conference on Genetic and evolutionary computation
Learning drifting concepts: Example selection vs. example weighting
Intelligent Data Analysis
Boosting classifiers for drifting concepts
Intelligent Data Analysis - Knowlegde Discovery from Data Streams
A customizable multi-agent system for distributed data mining
Proceedings of the 2007 ACM symposium on Applied computing
Controlling overfitting with multi-objective support vector machines
Proceedings of the 9th annual conference on Genetic and evolutionary computation
RA: ResearchAssistant for the computational sciences
Proceedings of the 2007 workshop on Experimental computer science
RA: research assistant for the computational sciences
ecs'07 Experimental computer science on Experimental computer science
Investigating statistical machine learning as a tool for software development
Proceedings of the SIGCHI Conference on Human Factors in Computing Systems
Automatically identifying localizable queries
Proceedings of the 31st annual international ACM SIGIR conference on Research and development in information retrieval
An inductive database prototype based on virtual mining views
Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining
Towards MKDA: A Knowledge Discovery Assistant for Researches in Medicine
AI*IA '07 Proceedings of the 10th Congress of the Italian Association for Artificial Intelligence on AI*IA 2007: Artificial Intelligence and Human-Oriented Computing
Enhanced Services for Targeted Information Retrieval by Event Extraction and Data Mining
NLDB '08 Proceedings of the 13th international conference on Natural Language and Information Systems: Applications of Natural Language to Information Systems
ELKI: A Software System for Evaluation of Subspace Clustering Algorithms
SSDBM '08 Proceedings of the 20th international conference on Scientific and Statistical Database Management
Helping Teachers Handle the Flood of Data in Online Student Discussions
ITS '08 Proceedings of the 9th international conference on Intelligent Tutoring Systems
Towards Heterogeneous Similarity Function Learning for the k-Nearest Neighbors Classification
ICAISC '08 Proceedings of the 9th international conference on Artificial Intelligence and Soft Computing
Extending KDDML with a Visual Metaphor for the KDD Process
VISUAL '08 Proceedings of the 10th international conference on Visual Information Systems: Web-Based Visual Information Search and Management
Client-Friendly Classification over Random Hyperplane Hashes
ECML PKDD '08 Proceedings of the European conference on Machine Learning and Knowledge Discovery in Databases - Part II
Support vector regression for link load prediction
Computer Networks: The International Journal of Computer and Telecommunications Networking
Design and Evaluation of a Sound Based Water Flow Measurement System
EuroSSC '08 Proceedings of the 3rd European Conference on Smart Sensing and Context
Global Classifier for Confidential Data in Distributed Datasets
MICAI '08 Proceedings of the 7th Mexican International Conference on Artificial Intelligence: Advances in Artificial Intelligence
Debellor: A Data Mining Platform with Stream Architecture
Transactions on Rough Sets IX
PicAChoo: a tool for customizable feature extraction utilizing characteristics of textual data
Proceedings of the 3rd International Conference on Ubiquitous Information Management and Communication
Evaluating algorithms that learn from data streams
Proceedings of the 2009 ACM symposium on Applied Computing
Case-Sensitivity of Classifiers for WSD: Complex Systems Disambiguate Tough Words Better
CICLing '07 Proceedings of the 8th International Conference on Computational Linguistics and Intelligent Text Processing
How to build repeatable experiments
PROMISE '09 Proceedings of the 5th International Conference on Predictor Models in Software Engineering
Issues in evaluation of stream learning algorithms
Proceedings of the 15th ACM SIGKDD international conference on Knowledge discovery and data mining
Improving data mining utility with projective sampling
Proceedings of the 15th ACM SIGKDD international conference on Knowledge discovery and data mining
Automatically assessing the post quality in online discussions on software
ACL '07 Proceedings of the 45th Annual Meeting of the ACL on Interactive Poster and Demonstration Sessions
Combining Multiple Interrelated Streams for Incremental Clustering
SSDBM 2009 Proceedings of the 21st International Conference on Scientific and Statistical Database Management
Using Machine Learning Techniques to Analyze and Support Mediation of Student E-Discussions
Proceedings of the 2007 conference on Artificial Intelligence in Education: Building Technology Rich Learning Contexts That Work
ELKI in Time: ELKI 0.2 for the Performance Evaluation of Distance Measures for Time Series
SSTD '09 Proceedings of the 11th International Symposium on Advances in Spatial and Temporal Databases
Effect of Background Correction on Cancer Classification with Gene Expression Data
AIME '09 Proceedings of the 12th Conference on Artificial Intelligence in Medicine: Artificial Intelligence in Medicine
Computational Statistics & Data Analysis
An Error Propagation Algorithm for Ad Hoc Wireless Networks
ICARIS '09 Proceedings of the 8th International Conference on Artificial Immune Systems
Clustering for Video Retrieval
DaWaK '09 Proceedings of the 11th International Conference on Data Warehousing and Knowledge Discovery
Knowledge derived from wikipedia for computing semantic relatedness
Journal of Artificial Intelligence Research
Detecting large-scale system problems by mining console logs
Proceedings of the ACM SIGOPS 22nd symposium on Operating systems principles
Automatic identification of discourse moves in scientific article introductions
EANL '08 Proceedings of the Third Workshop on Innovative Use of NLP for Building Educational Applications
Latent dirichlet allocation for tag recommendation
Proceedings of the third ACM conference on Recommender systems
Proceedings of the 18th ACM conference on Information and knowledge management
The WEKA data mining software: an update
ACM SIGKDD Explorations Newsletter
Using Data Mining Techniques to Support the Creation of Competence Ontologies
Proceedings of the 2009 conference on Artificial Intelligence in Education: Building Learning Systems that Care: From Knowledge Representation to Affective Modelling
Reusable components for partitioning clustering algorithms
Artificial Intelligence Review
Comparative evaluation of entity resolution approaches with FEVER
Proceedings of the VLDB Endowment
Comparative Analysis of Premises Valuation Models Using KEEL, RapidMiner, and WEKA
ICCCI '09 Proceedings of the 1st International Conference on Computational Collective Intelligence. Semantic Web, Social Networks and Multiagent Systems
Activity Recognition for Personal Time Management
AmI '09 Proceedings of the European Conference on Ambient Intelligence
Data Mining Using Rules Extracted from SVM: An Application to Churn Prediction in Bank Credit Cards
RSFDGrC '09 Proceedings of the 12th International Conference on Rough Sets, Fuzzy Sets, Data Mining and Granular Computing
On NoMatchs, NoInputs and BargeIns: do non-acoustic features support anger detection?
SIGDIAL '09 Proceedings of the SIGDIAL 2009 Conference: The 10th Annual Meeting of the Special Interest Group on Discourse and Dialogue
Software—Practice & Experience
Data mining applications for diverse industrial application domains with smart archive
SE '08 Proceedings of the IASTED International Conference on Software Engineering
Rapid prototyping of smart garments for activity-aware applications
Journal of Ambient Intelligence and Smart Environments
Splash: ad-hoc querying of data and statistical models
Proceedings of the 13th International Conference on Extending Database Technology
A decision support system to improve e-learning environments
Proceedings of the 2010 EDBT/ICDT Workshops
Information Sciences: an International Journal
Distributed generative data mining
ICDM'07 Proceedings of the 7th industrial conference on Advances in data mining: theoretical aspects and applications
Privacy wizards for social networking sites
Proceedings of the 19th international conference on World wide web
SE-155 DBSA: a device-based software architecture for data mining
Proceedings of the 2010 ACM Symposium on Applied Computing
Similarity clustering of music files according to user preference
MICAI'07 Proceedings of the artificial intelligence 6th Mexican international conference on Advances in artificial intelligence
IDEAL'07 Proceedings of the 8th international conference on Intelligent data engineering and automated learning
Local soft belief updating for relational classification
ISMIS'08 Proceedings of the 17th international conference on Foundations of intelligent systems
Support vector regression based hybrid rule extraction methods for forecasting
Expert Systems with Applications: An International Journal
Using network motifs to identify application protocols
GLOBECOM'09 Proceedings of the 28th IEEE conference on Global telecommunications
The SEASALT architecture and its realization within the docQuery project
KI'09 Proceedings of the 32nd annual German conference on Advances in artificial intelligence
CAFE: Collaboration Aimed at Finding Experts
International Journal of Knowledge and Web Intelligence
Proceedings of the 3rd International Conference on PErvasive Technologies Related to Assistive Environments
ACMOS'10 Proceedings of the 12th WSEAS international conference on Automatic control, modelling & simulation
Using provenance to extract semantic file attributes
TAPP'10 Proceedings of the 2nd conference on Theory and practice of provenance
Mining console logs for large-scale system problem detection
SysML'08 Proceedings of the Third conference on Tackling computer systems problems with machine learning techniques
Challenges in ubiquitous context recognition with personal mobile devices
Proceedings of the 4th ACM International Workshop on Context-Awareness for Self-Managing Systems
Activity recognition of the elderly
Proceedings of the 4th ACM International Workshop on Context-Awareness for Self-Managing Systems
A semantic approach to a framework for business domain software systems
Computers in Industry
WOSN'10 Proceedings of the 3rd conference on Online social networks
Support feature machine for DNA microarray data
RSCTC'10 Proceedings of the 7th international conference on Rough sets and current trends in computing
Performance modeling of embedded applications with zero architectural knowledge
CODES/ISSS '10 Proceedings of the eighth IEEE/ACM/IFIP international conference on Hardware/software codesign and system synthesis
Integrating workflow into agent-based distributed data mining systems
ADMI'10 Proceedings of the 6th international conference on Agents and data mining interaction
Apples-to-apples in cross-validation studies: pitfalls in classifier performance measurement
ACM SIGKDD Explorations Newsletter
The iZi project: easy prototyping of interesting pattern mining algorithms
PAKDD'09 Proceedings of the 13th Pacific-Asia international conference on Knowledge discovery and data mining: new frontiers in applied data mining
ICES'10 Proceedings of the 9th international conference on Evolvable systems: from biology to hardware
KES'10 Proceedings of the 14th international conference on Knowledge-based and intelligent information and engineering systems: Part I
Infosel++: information based feature selection C++ library
ICAISC'10 Proceedings of the 10th international conference on Artificial intelligence and soft computing: Part I
Supporting Collaborative Learning and E-Discussions Using Artificial Intelligence Techniques
International Journal of Artificial Intelligence in Education
Knowledge engineering within the application-independent architecture SEASALT
International Journal of Knowledge Engineering and Data Mining
Decomposing data mining by a process-oriented execution plan
AICI'10 Proceedings of the 2010 international conference on Artificial intelligence and computational intelligence: Part I
An intraday market risk management approach based on textual analysis
Decision Support Systems
Pattern recognition methods: a novel analysis for the pupillographic sleepiness test
Proceedings of the 7th International Conference on Methods and Techniques in Behavioral Research
Auto-experimentation of KDD workflows based on ontological planning
ISWC'10 Proceedings of the 9th international semantic web conference on The semantic web - Volume Part II
Priming: making the reaction to intrusion or fault predictable
Natural Computing: an international journal
Domain-driven KDD for mining functionally novel rules and linking disjoint medical hypotheses
Knowledge-Based Systems
Nemoz: a distributed framework for collaborative media organization
Ubiquitous knowledge discovery
Comparing fine-grained source code changes and code churn for bug prediction
Proceedings of the 8th Working Conference on Mining Software Repositories
Nemoz: a distributed framework for collaborative media organization
Ubiquitous knowledge discovery
Artificial Intelligence in Medicine
Data intensive analysis on the gordon high performance data and compute system
Proceedings of the 17th ACM SIGKDD international conference on Knowledge discovery and data mining
Ask me better questions: active learning queries based on rule induction
Proceedings of the 17th ACM SIGKDD international conference on Knowledge discovery and data mining
HAIS'11 Proceedings of the 6th international conference on Hybrid artificial intelligent systems - Volume Part I
An automatic text comprehension classifier based on mental models and latent semantic features
i-KNOW '11 Proceedings of the 11th International Conference on Knowledge Management and Knowledge Technologies
Using the gini coefficient for bug prediction in eclipse
Proceedings of the 12th International Workshop on Principles of Software Evolution and the 7th annual ERCIM Workshop on Software Evolution
Semantically-guided clustering of text documents via frequent subgraphs discovery
ISMIS'11 Proceedings of the 19th international conference on Foundations of intelligent systems
DiSCo '11 Proceedings of the Workshop on Distributional Semantics and Compositionality
Filter keywords and majority class strategies for company name disambiguation in twitter
CLEF'11 Proceedings of the Second international conference on Multilingual and multimodal information access evaluation
Prediction of classifier training time including parameter optimization
KI'11 Proceedings of the 34th Annual German conference on Advances in artificial intelligence
[KD3] A workflow-based application for exploration of biomedical data sets
Transactions on large-scale data- and knowledge-centered systems IV
Mining methodologies from NLP publications: A case study in automatic terminology recognition
Computer Speech and Language
Learning-based entity resolution with MapReduce
Proceedings of the third international workshop on Cloud data management
Computers and Electronics in Agriculture
Defining classifier regions for WSD ensembles using word space features
MICAI'06 Proceedings of the 5th Mexican international conference on Artificial Intelligence
ECML'06 Proceedings of the 17th European conference on Machine Learning
Automatic classification of building types in 3D city models
Geoinformatica
Can affect be detected from intelligent tutoring system interaction data?: a preliminary study
ITS'10 Proceedings of the 10th international conference on Intelligent Tutoring Systems - Volume Part II
SETN'10 Proceedings of the 6th Hellenic conference on Artificial Intelligence: theories, models and applications
Detecting gaming the system in constraint-based tutors
UMAP'10 Proceedings of the 18th international conference on User Modeling, Adaptation, and Personalization
Computing the Principal Local Binary Patterns for face recognition using data mining tools
Expert Systems with Applications: An International Journal
Detecting the moment of learning
ITS'10 Proceedings of the 10th international conference on Intelligent Tutoring Systems - Volume Part I
NeVer: a tool for artificial neural networks verification
Annals of Mathematics and Artificial Intelligence
What do you want to do next: a novel approach for intent prediction in gaze-based interaction
Proceedings of the Symposium on Eye Tracking Research and Applications
Relation extraction for monitoring economic networks
NLDB'09 Proceedings of the 14th international conference on Applications of Natural Language to Information Systems
EUROCAST'11 Proceedings of the 13th international conference on Computer Aided Systems Theory - Volume Part I
ML-Flex: a flexible toolbox for performing classification analyses in parallel
The Journal of Machine Learning Research
Mutual Information Optimization for Mass Spectra Data Alignment
IEEE/ACM Transactions on Computational Biology and Bioinformatics (TCBB)
A review of recent advances in learner and skill modeling in intelligent learning environments
User Modeling and User-Adapted Interaction
The sum is greater than the parts: ensembling models of student knowledge in educational software
ACM SIGKDD Explorations Newsletter
An architecture for component-based design of representative-based clustering algorithms
Data & Knowledge Engineering
Analyzing Online Review Helpfulness Using a Regressional ReliefF-Enhanced Text Mining Method
ACM Transactions on Management Information Systems (TMIS)
Design and implementation of a clustering model for river sectors based on biotope characteristics
BICA'12 Proceedings of the 5th WSEAS congress on Applied Computing conference, and Proceedings of the 1st international conference on Biologically Inspired Computation
Knowing: a generic data analysis application
Proceedings of the 15th International Conference on Extending Database Technology
Unsupervised generation of data mining features from linked open data
Proceedings of the 2nd International Conference on Web Intelligence, Mining and Semantics
Using multiple models to understand data
IJCAI'11 Proceedings of the Twenty-Second international joint conference on Artificial Intelligence - Volume Volume Two
A machine-learning approach to negation and speculation detection in clinical texts
Journal of the American Society for Information Science and Technology
Screening nonrandomized studies for medical systematic reviews: A comparative study of classifiers
Artificial Intelligence in Medicine
Detecting learning moment-by-moment
International Journal of Artificial Intelligence in Education - Special issue on Best of ITS 2010
Generating balanced classifier-independent training samples from unlabeled data
PAKDD'12 Proceedings of the 16th Pacific-Asia conference on Advances in Knowledge Discovery and Data Mining - Volume Part I
A JAVA application framework for scientific software development
Software—Practice & Experience
Content learning analysis using the moment-by-moment learning detector
ITS'12 Proceedings of the 11th international conference on Intelligent Tutoring Systems
Multivariate time series classification by combining trend-based and value-based approximations
ICCSA'12 Proceedings of the 12th international conference on Computational Science and Its Applications - Volume Part IV
MARVEL: multiple antenna based relative vehicle localizer
Proceedings of the 18th annual international conference on Mobile computing and networking
Rapid prototyping of smart garments for activity-aware applications
Journal of Ambient Intelligence and Smart Environments
WTF? detecting students who are conducting inquiry without thinking fastidiously
UMAP'12 Proceedings of the 20th international conference on User Modeling, Adaptation, and Personalization
Using file system content to organize e-mail
Proceedings of the 4th Information Interaction in Context Symposium
A space-based generic pattern for self-initiative load clustering agents
COORDINATION'12 Proceedings of the 14th international conference on Coordination Models and Languages
The transformation of surgery patient care with a clinical research information system
Expert Systems with Applications: An International Journal
Proceedings of the ACM-IEEE international symposium on Empirical software engineering and measurement
Framework for stream learning algorithms
International Journal of Computational Intelligence Studies
ACM Transactions on Architecture and Code Optimization (TACO) - Special Issue on High-Performance Embedded Architectures and Compilers
CRIWG'12 Proceedings of the 18th international conference on Collaboration and Technology
The multi-engine ASP solver ME-ASP
JELIA'12 Proceedings of the 13th European conference on Logics in Artificial Intelligence
ClowdFlows: a cloud based scientific workflow platform
ECML PKDD'12 Proceedings of the 2012 European conference on Machine Learning and Knowledge Discovery in Databases - Volume Part II
Scientific workflow management with ADAMS
ECML PKDD'12 Proceedings of the 2012 European conference on Machine Learning and Knowledge Discovery in Databases - Volume Part II
Towards automatic structure analysis of digital musical content
AIMSA'12 Proceedings of the 15th international conference on Artificial Intelligence: methodology, systems, and applications
User Modeling and User-Adapted Interaction
On evaluating stream learning algorithms
Machine Learning
Analyzing correspondence between sound objects and body motion
ACM Transactions on Applied Perception (TAP)
Creation of visualizations based on linked data
Proceedings of the 3rd International Conference on Web Intelligence, Mining and Semantics
Discovering filter keywords for company name disambiguation in twitter
Expert Systems with Applications: An International Journal
A survey of intelligent assistants for data analysis
ACM Computing Surveys (CSUR)
Goldenberry: EDA visual programming in orange
Proceedings of the 15th annual conference companion on Genetic and evolutionary computation
Enhancing one-class support vector machines for unsupervised anomaly detection
Proceedings of the ACM SIGKDD Workshop on Outlier Detection and Description
A process-centric data mining and visual analytic tool for exploring complex social networks
Proceedings of the ACM SIGKDD Workshop on Interactive Data Exploration and Analytics
Data stream clustering: A survey
ACM Computing Surveys (CSUR)
Application of tree-structured data mining for analysis of process logs in XML format
AusDM '12 Proceedings of the Tenth Australasian Data Mining Conference - Volume 134
International Journal of Knowledge Discovery in Bioinformatics
International Journal of Open Source Software and Processes
International Journal of Data Mining and Bioinformatics
Processing crowd sourced sensor data: from data acquisition to application
Proceedings of the Sixth ACM SIGSPATIAL International Workshop on Computational Transportation Science
Improving accuracy of classification models induced from anonymized datasets
Information Sciences: an International Journal
Control-flow integrity principles, implementations, and applications
ACM Transactions on Information and System Security (TISSEC)
CALA: An unsupervised URL-based web page classification system
Knowledge-Based Systems
Large margin principle in hyperrectangle learning
Neurocomputing
BodyCloud: A SaaS approach for community Body Sensor Networks
Future Generation Computer Systems
Component-based decision trees for classification
Intelligent Data Analysis
Evolutionary approach for automated component-based decision tree algorithm design
Intelligent Data Analysis - Business Analytics and Intelligent Optimization
Hi-index | 0.01 |
KDD is a complex and demanding task. While a large number of methods has been established for numerous problems, many challenges remain to be solved. New tasks emerge requiring the development of new methods or processing schemes. Like in software development, the development of such solutions demands for careful analysis, specification, implementation, and testing. Rapid prototyping is an approach which allows crucial design decisions as early as possible. A rapid prototyping system should support maximal re-use and innovative combinations of existing methods, as well as simple and quick integration of new ones.This paper describes Yale, a free open-source environment forKDD and machine learning. Yale provides a rich variety of methods whichallows rapid prototyping for new applications and makes costlyre-implementations unnecessary. Additionally, Yale offers extensive functionality for process evaluation and optimization which is a crucial property for any KDD rapid prototyping tool. Following the paradigm of visual programming eases the design of processing schemes. While the graphical user interface supports interactive design, the underlying XML representation enables automated applications after the prototyping phase.After a discussion of the key concepts of Yale, we illustrate the advantages of rapid prototyping for KDD on case studies ranging from data pre-processing to result visualization. These case studies cover tasks like feature engineering, text mining, data stream mining and tracking drifting concepts, ensemble methods and distributed data mining. This variety of applications is also reflected in a broad user base, we counted more than 40,000 downloads during the last twelve months.