Using execution trace data to improve distribute systems

Authors:
Johan Moe;David A. Carr
Affiliations:
Ericsson Radio System AB,Box 1248, SE-581 12 Linkeöping, Sweden;Institutionen för Systemteknik, Luleå Tekniska Universitet, SE-971 87 Luleå,sweden
Venue:
Software—Practice & Experience
Year:
2002

Citing 22
Cited 3

Event-based performance perturbation: a case study

PPOPP '91 Proceedings of the third ACM SIGPLAN symposium on Principles and practice of parallel programming
Seesoft-A Tool for Visualizing Line Oriented Software Statistics

IEEE Transactions on Software Engineering - Special issue on software measurement principles, techniques, and environments
A framework for dynamic program analyzers

OOPSLA '93 Proceedings of the eighth annual conference on Object-oriented programming systems, languages, and applications
Visualization using timelines

ISSTA '94 Proceedings of the 1994 ACM SIGSOFT international symposium on Software testing and analysis
Visual information seeking: tight coupling of dynamic query filters with starfield displays

CHI '94 Proceedings of the SIGCHI Conference on Human Factors in Computing Systems
Debugging heterogeneous distributed systems using event-based models of behavior

ACM Transactions on Computer Systems (TOCS)
A toolset for performance engineering and software design of client-server systems

Performance Evaluation - Special issue: performance modeling tools
Visualizing dynamic software system information through high-level models

Proceedings of the 13th ACM SIGPLAN conference on Object-oriented programming, systems, languages, and applications
Whole program paths

Proceedings of the ACM SIGPLAN 1999 conference on Programming language design and implementation
Almost: exploring program traces

Proceedings of the 1999 workshop on new paradigms in information visualization and manipulation in conjunction with the eighth ACM internation conference on Information and knowledge management
Operational Profiles in Software-Reliability Engineering

IEEE Software
Architecture-Based Performance Analysis Applied to a Telecommunication System

IEEE Transactions on Software Engineering
A Method for Design and Performance Modeling of Client/Server Systems

IEEE Transactions on Software Engineering
Modeling Object-Oriented Program Execution

ECOOP '94 Proceedings of the 8th European Conference on Object-Oriented Programming
CSMonitor: A Visual Client/Server Monitor for CORBA-Based Distributed Applications

APSEC '98 Proceedings of the Fifth Asia Pacific Software Engineering Conference
Static and Dynamic Metrics for Effective Object Clustering

APSEC '98 Proceedings of the Fifth Asia Pacific Software Engineering Conference
Pattern Visualization for Software Comprehension

IWPC '98 Proceedings of the 6th International Workshop on Program Comprehension
Comprehension with[in] Virtual Environment Visualisations

IWPC '99 Proceedings of the 7th International Workshop on Program Comprehension
Program Visualisation using C++ Lenses

IWPC '99 Proceedings of the 7th International Workshop on Program Comprehension
Trace-Driven Debugging of Message Passing Programs

IPPS '98 Proceedings of the 12th. International Parallel Processing Symposium on International Parallel Processing Symposium
Object-Oriented Software Engineering: A Use Case Driven Approach

Object-Oriented Software Engineering: A Use Case Driven Approach
Execution patterns in object-oriented visualization

COOTS'98 Proceedings of the 4th conference on USENIX Conference on Object-Oriented Technologies and Systems - Volume 4

Toward the Reverse Engineering of UML Sequence Diagrams for Distributed Java Software

IEEE Transactions on Software Engineering
SIFT: a scalable iterative-unfolding technique for filtering execution traces

CASCON '08 Proceedings of the 2008 conference of the center for advanced studies on collaborative research: meeting of minds
Using entropy measures for comparison of software traces

Information Sciences: an International Journal

Quantified Score

Hi-index	0.00

Visualization

Abstract

One of the most challenging problems facing today's software engineer is to understand and modify distributed systems. One reason is that in actual use systems frequently behave differently than the developer intended. In order to cope with this challenge, we have developed a three-step method to study the run-time behavior of a distributed system. First, remote procedure calls are traced using CORBA interceptors. Next, the trace data is parsed to construct RPC call-return sequences, and summary statistics are generated. Finally, a visualization tool is used to study the statistics and look for anomalous behavior. We have been using this method on a large distributed system (more than 500 000 lines of code) with data collected during both system testing and operation at a customer's site. Despite the fact that the distributed system had been in operation for over three years, the method has uncovered system configuration and efficiency problems. Using these discoveries, the system support group has been able to improve product performance and their own product maintenance procedures.