Semantic anomaly detection in online data sources

Authors:
Orna Raz;Philip Koopman;Mary Shaw
Affiliations:
Carnegie Mellon University, Pittsburgh PA;Carnegie Mellon University, Pittsburgh PA;Carnegie Mellon University, Pittsburgh PA
Venue:
Proceedings of the 24th International Conference on Software Engineering
Year:
2002

Citing 15
Cited 31

Neural networks for pattern recognition

Neural networks for pattern recognition
Handbook of software reliability engineering

Handbook of software reliability engineering
TrIAs: trainable information assistants for cooperative problem solving

Proceedings of the third annual conference on Autonomous Agents
Regression testing for wrapper maintenance

AAAI '99/IAAI '99 Proceedings of the sixteenth national conference on Artificial intelligence and the eleventh Innovative applications of artificial intelligence conference innovative applications of artificial intelligence
Temporal sequence learning and data reduction for anomaly detection

ACM Transactions on Information and System Security (TISSEC)
Dynamically Discovering Likely Program Invariants to Support Program Evolution

IEEE Transactions on Software Engineering - Special issue on 1999 international conference on software engineering
Mining time-changing data streams

Proceedings of the seventh ACM SIGKDD international conference on Knowledge discovery and data mining
Machine Learning

Machine Learning
Dependability: Basic Concepts and Terminology

Dependability: Basic Concepts and Terminology
Information Integration

IEEE Intelligent Systems
Querying Heterogeneous Information Sources Using Source Descriptions

VLDB '96 Proceedings of the 22th International Conference on Very Large Data Bases
Learning the Common Structure of Data

Proceedings of the Seventeenth National Conference on Artificial Intelligence and Twelfth Conference on Innovative Applications of Artificial Intelligence
An Approach to Preserving Sufficient Correctness in Open Resource Coalitions

IWSSD '00 Proceedings of the 10th International Workshop on Software Specification and Design
Software Fault Tolerance: A Tutorial

Software Fault Tolerance: A Tutorial
Dynamically discovering likely program invariants

Dynamically discovering likely program invariants

Research abstract for semantic anomaly detection in dynamic data feeds with incomplete specifications

Proceedings of the 24th International Conference on Software Engineering
Enabling automatic adaptation in systems with under-specified elements

WOSS '02 Proceedings of the first workshop on Self-healing systems
End-user software engineering with assertions in the spreadsheet paradigm

Proceedings of the 25th International Conference on Software Engineering
Visualizing concept drift

Proceedings of the ninth ACM SIGKDD international conference on Knowledge discovery and data mining
End-user software engineering

Communications of the ACM - End-user development: tools that empower users to create their own software solutions
Efficient incremental algorithms for dynamic detection of likely invariants

Proceedings of the 12th ACM SIGSOFT twelfth international symposium on Foundations of software engineering
Verifying mutual exclusion and liveness properties with split preconditions

Journal of Computer Science and Technology
An empirical study of fault localization for end-user programmers

Proceedings of the 27th international conference on Software engineering
The impact of software engineering research on modern progamming languages

ACM Transactions on Software Engineering and Methodology (TOSEM)
Invited research overview: end-user programming

CHI '06 Extended Abstracts on Human Factors in Computing Systems
Integrating automated test generation into the WYSIWYT spreadsheet testing methodology

ACM Transactions on Software Engineering and Methodology (TOSEM)
SMArTIC: towards building an accurate, robust and scalable specification miner

Proceedings of the 14th ACM SIGSOFT international symposium on Foundations of software engineering
Self-healing systems - survey and synthesis

Decision Support Systems
Trace anomalies as precursors of field failures: an empirical study

Empirical Software Engineering
The Daikon system for dynamic detection of likely invariants

Science of Computer Programming
Diagnosing misconfiguration with dynamic detection of configuration invariants

HotDep'07 Proceedings of the 3rd workshop on on Hot Topics in System Dependability
Topes: reusable abstractions for validating data

Proceedings of the 30th international conference on Software engineering
Automatic generation of software behavioral models

Proceedings of the 30th international conference on Software engineering
Accommodating data heterogeneity in ULS systems

Proceedings of the 2nd international workshop on Ultra-large-scale software-intensive systems
Fast, Accurate Creation of Data Validation Formats by End-User Developers

IS-EUD '09 Proceedings of the 2nd International Symposium on End-User Development
A toolset for automated failure analysis

ICSE '09 Proceedings of the 31st International Conference on Software Engineering
AVA: automated interpretation of dynamically detected anomalies

Proceedings of the eighteenth international symposium on Software testing and analysis
Automatic steering of behavioral model inference

Proceedings of the the 7th joint meeting of the European software engineering conference and the ACM SIGSOFT symposium on The foundations of software engineering
Detecting precise behavioral models

Proceedings of the doctoral symposium for ESEC/FSE on Doctoral symposium
A Technique for Verifying Component-Based Software

Electronic Notes in Theoretical Computer Science (ENTCS)
The state of the art in end-user software engineering

ACM Computing Surveys (CSUR)
Automatic requirement extraction from test cases

RV'10 Proceedings of the First international conference on Runtime verification
Where are my intelligent assistant's mistakes? a systematic testing approach

IS-EUD'11 Proceedings of the Third international conference on End-user development
Assessing fault occurrence likelihood for service-oriented systems

ICWE'11 Proceedings of the 11th international conference on Web engineering
Learning extended FSA from software: An empirical assessment

Journal of Systems and Software
Formal passive testing of timed systems: theory and tools

Software Testing, Verification & Reliability

Quantified Score

Hi-index	0.00

Visualization

Abstract

Much of the software we use for everyday purposes incorporates elements developed and maintained by someone other than the developer. These elements include not only code and databases but also dynamic data feeds from online data sources. Although everyday software is not mission critical, it must be dependable enough for practical use. This is limited by the dependability of the incorporated elements.It is particularly difficult to evaluate the dependability of dynamic data feeds, because they may be changed by their proprietors as they are used. Further, the specifications of these data feeds are often even sketchier than the specifications of software components.We demonstrate a method of inferring invariants about the normal behavior of dynamic data feeds. We use these invariants as proxies for specifications to perform on-going detection of anomalies in the data feed. We show the feasibility of our approach and demonstrate its usefulness for semantic anomaly detection: identifying occasions when a dynamic data feed is delivering unreasonable values, even though its behavior may be superficially acceptable (i.e., it is delivering parsable results in a timely fashion).