Machine Learning for theNew York City Power Grid

Authors:
Cynthia Rudin;David Waltz;Roger Anderson;Albert Boulanger;Ansaf Salleb-Aouissi;Maggie Chow;Haimonti Dutta;Philip Gross;Bert Huang;Steve Ierome
Affiliations:
Massachusetts Institute of Technology Columbia University, Cambridge New York;Columbia University, New York;Columbia University, New York;Columbia University, New York;Columbia University, New York;Consolidated Edison Company of New York, New York;Columbia University, New York;Columbia University, New York;Columbia University, New York;Consolidated Edison Company of New York, New York
Venue:
IEEE Transactions on Pattern Analysis and Machine Intelligence
Year:
2012

Citing 0
Cited 4

Data quality assurance and performance measurement of data mining for preventive maintenance of power grid

Proceedings of the First International Workshop on Data Mining for Service and Maintenance
Toward an analytic framework for the electrical power grid

Proceedings of the 3rd International Conference on Future Energy Systems: Where Energy, Computing and Communication Meet
Machine learning with operational costs

The Journal of Machine Learning Research
Machine learning for science and society

Machine Learning

Quantified Score

Hi-index	0.14

Visualization

Abstract

Power companies can benefit from the use of knowledge discovery methods and statistical machine learning for preventive maintenance. We introduce a general process for transforming historical electrical grid data into models that aim to predict the risk of failures for components and systems. These models can be used directly by power companies to assist with prioritization of maintenance and repair work. Specialized versions of this process are used to produce 1) feeder failure rankings, 2) cable, joint, terminator, and transformer rankings, 3) feeder Mean Time Between Failure (MTBF) estimates, and 4) manhole events vulnerability rankings. The process in its most general form can handle diverse, noisy, sources that are historical (static), semi-real-time, or real-time, incorporates state-of-the-art machine learning algorithms for prioritization (supervised ranking or MTBF), and includes an evaluation of results via cross-validation and blind test. Above and beyond the ranked lists and MTBF estimates are business management interfaces that allow the prediction capability to be integrated directly into corporate planning and decision support; such interfaces rely on several important properties of our general modeling approach: that machine learning features are meaningful to domain experts, that the processing of data is transparent, and that prediction results are accurate enough to support sound decision making. We discuss the challenges in working with historical electrical grid data that were not designed for predictive purposes. The “rawness” of these data contrasts with the accuracy of the statistical models that can be obtained from the process; these models are sufficiently accurate to assist in maintaining New York City's electrical grid.