Investigating statistical machine learning as a tool for software development

Authors:
Kayur Patel;James Fogarty;James A. Landay;Beverly Harrison
Affiliations:
University of Washington, Seattle, WA, USA;University of Washington, Seattle, WA, USA;University of Washington, Seattle, WA, USA;Intel Research Seattle, Seattle, WA, USA
Venue:
Proceedings of the SIGCHI Conference on Human Factors in Computing Systems
Year:
2008

Citing 23
Cited 12

The reuse of uses in Smalltalk programming

ACM Transactions on Computer-Human Interaction (TOCHI)
Past, present, and future of user interface software tools

ACM Transactions on Computer-Human Interaction (TOCHI) - Special issue on human-computer interaction in the new millennium, Part 1
Machine Learning

Machine Learning
Interactive machine learning

Proceedings of the 8th international conference on Intelligent user interfaces
A design tool for camera-based interaction

Proceedings of the SIGCHI Conference on Human Factors in Computing Systems
Pinpoint: Problem Determination in Large, Dynamic Internet Services

DSN '02 Proceedings of the 2002 International Conference on Dependable Systems and Networks
Learning and reasoning about interruption

Proceedings of the 5th international conference on Multimodal interfaces
An Ethnographic Study of Copy and Paste Programming Practices in OOPL

ISESE '04 Proceedings of the 2004 International Symposium on Empirical Software Engineering
Six Learning Barriers in End-User Programming Systems

VLHCC '04 Proceedings of the 2004 IEEE Symposium on Visual Languages - Human Centric Computing
How Effective Developers Investigate Source Code: An Exploratory Study

IEEE Transactions on Software Engineering
Examining task engagement in sensor-based statistical models of human interruptibility

Proceedings of the SIGCHI Conference on Human Factors in Computing Systems
Eliciting design requirements for maintenance-oriented IDEs: a detailed study of corrective and perfective maintenance tasks

Proceedings of the 27th international conference on Software engineering
Maintaining mental models: a study of developer work habits

Proceedings of the 28th international conference on Software engineering
Statistical debugging: simultaneous identification of multiple bugs

ICML '06 Proceedings of the 23rd international conference on Machine learning
YALE: rapid prototyping for complex data mining tasks

Proceedings of the 12th ACM SIGKDD international conference on Knowledge discovery and data mining
Enabling web browsers to augment web sites' filtering and sorting functionalities

UIST '06 Proceedings of the 19th annual ACM symposium on User interface software and technology
Data Mining: Practical Machine Learning Tools and Techniques, Second Edition (Morgan Kaufmann Series in Data Management Systems)

Data Mining: Practical Machine Learning Tools and Techniques, Second Edition (Morgan Kaufmann Series in Data Management Systems)
Toolkit support for developing and deploying sensor-based statistical models of human situations

Proceedings of the SIGCHI Conference on Human Factors in Computing Systems
Authoring sensor-based interactions by demonstration with direct manipulation and pattern recognition

Proceedings of the SIGCHI Conference on Human Factors in Computing Systems
Assieme: finding and leveraging implicit references in a web search interface for programmers

Proceedings of the 20th annual ACM symposium on User interface software and technology
Eyepatch: prototyping camera-based interaction through examples

Proceedings of the 20th annual ACM symposium on User interface software and technology
A hybrid discriminative/generative approach for modeling human activities

IJCAI'05 Proceedings of the 19th international joint conference on Artificial intelligence
Place lab: device positioning using radio beacons in the wild

PERVASIVE'05 Proceedings of the Third international conference on Pervasive Computing

Fixing the program my computer learned: barriers for end users, challenges for the machine

Proceedings of the 14th international conference on Intelligent user interfaces
API usability: CHI'2009 special interest group meeting

CHI '09 Extended Abstracts on Human Factors in Computing Systems
API usability: report on special interest group at CHI

ACM SIGSOFT Software Engineering Notes
Examining difficulties software developers encounter in the adoption of statistical machine learning

AAAI'08 Proceedings of the 23rd national conference on Artificial intelligence - Volume 3
Lowering the barrier to applying machine learning

CHI '10 Extended Abstracts on Human Factors in Computing Systems
Supporting agile modeling through experimentation in an integrated urban simulation framework

Proceedings of the 11th Annual International Digital Government Research Conference on Public Administration Online: Challenges and Opportunities
Toolkit to support intelligibility in context-aware applications

Proceedings of the 12th ACM international conference on Ubiquitous computing
Gestalt: integrated support for implementation and analysis in machine learning

UIST '10 Proceedings of the 23nd annual ACM symposium on User interface software and technology
Lowering the barrier to applying machine learning

UIST '10 Adjunct proceedings of the 23nd annual ACM symposium on User interface software and technology
Why-oriented end-user debugging of naive Bayes text classification

ACM Transactions on Interactive Intelligent Systems (TiiS)
Using multiple models to understand data

IJCAI'11 Proceedings of the Twenty-Second international joint conference on Artificial Intelligence - Volume Volume Two
I can do text analytics!: designing development tools for novice developers

Proceedings of the SIGCHI Conference on Human Factors in Computing Systems

Quantified Score

Hi-index	0.04

Visualization

Abstract

As statistical machine learning algorithms and techniques continue to mature, many researchers and developers see statistical machine learning not only as a topic of expert study, but also as a tool for software development. Extensive prior work has studied software development, but little prior work has studied software developers applying statistical machine learning. This paper presents interviews of eleven researchers experienced in applying statistical machine learning algorithms and techniques to human-computer interaction problems, as well as a study of ten participants working during a five-hour study to apply statistical machine learning algorithms and techniques to a realistic problem. We distill three related categories of difficulties that arise in applying statistical machine learning as a tool for software development: (1) difficulty pursuing statistical machine learning as an iterative and exploratory process, (2) difficulty understanding relationships between data and the behavior of statistical machine learning algorithms, and (3) difficulty evaluating the performance of statistical machine learning algorithms and techniques in the context of applications. This paper provides important new insight into these difficulties and the need for development tools that better support the application of statistical machine learning.