Profile-directed restructuring of operating system code

Authors:
W. J. Schmidt;R. R. Roediger;C. S. Mestad;B. Mendelson;I. Shavit-Lottem;V. Bortnikov-Sitnitsky
Affiliations:
IBM AS/400 Division, 3605 Highway 52 North, Rochester, Minnesota;IBM AS/400 Division, 3605 Highway 52 North, Rochester, Minnesota;IBM AS/400 Division, 3605 Highway 52 North, Rochester, Minnesota;IBM Research Division, Haifa Research Laboratory, Matam--Advanced Technology Center, Haifa 31905, Israel;IBM Research Division, Haifa Research Laboratory, Matam--Advanced Technology Center, Haifa 31905, Israel;IBM Research Division, Haifa Research Laboratory, Matam--Advanced Technology Center, Haifa 31905, Israel
Venue:
IBM Systems Journal
Year:
1998

Citing 24
Cited 15

The program dependence graph and its use in optimization

ACM Transactions on Programming Languages and Systems (TOPLAS)
Compile-Time Program Restructuring in Multiprogrammed Virtual Memory Systems

IEEE Transactions on Software Engineering
Program optimization for instruction caches

ASPLOS III Proceedings of the third international conference on Architectural support for programming languages and operating systems
Achieving high instruction cache performance with an optimizing compiler

ISCA '89 Proceedings of the 16th annual international symposium on Computer architecture
Profile guided code positioning

PLDI '90 Proceedings of the ACM SIGPLAN 1990 conference on Programming language design and implementation
Improving instruction cache behavior by reducing cache pollution

Proceedings of the 1990 ACM/IEEE conference on Supercomputing
Profile-driven compilation

Profile-driven compilation
Ordering functions for improving memory reference locality in a shared memory multiprocessor system

MICRO 25 Proceedings of the 25th annual international symposium on Microarchitecture
The impact of operating system structure on memory system performance

SOSP '93 Proceedings of the fourteenth ACM symposium on Operating systems principles
Compile time instruction cache optimizations

ACM SIGARCH Computer Architecture News - Special issue: panel sessions of the 1991 workshop on multithreaded computers
Optimally profiling and tracing programs

ACM Transactions on Programming Languages and Systems (TOPLAS)
Trace-directed program restructuring for AIX executables

IBM Journal of Research and Development
Lessons learned from the OS/400 OO project

Communications of the ACM - Special issue on object-oriented experiences and future trends
The art of computer programming, volume 1 (3rd ed.): fundamental algorithms

The art of computer programming, volume 1 (3rd ed.): fundamental algorithms
Virtual Memory

ACM Computing Surveys (CSUR)
Cache Memories

ACM Computing Surveys (CSUR)
Improving locality by critical working sets

Communications of the ACM
Automatic segmentation of cyclic program structures based on connectivity and processor timing

Communications of the ACM
The working set model for program behavior

Communications of the ACM
Inside the as/400

Inside the as/400
Computers and Intractability: A Guide to the Theory of NP-Completeness

Computers and Intractability: A Guide to the Theory of NP-Completeness
Predicting and Precluding Problems with Memory Latency

IEEE Micro
The analytic design of a dynamic look ahead and program segmenting system for multiprogrammed computers

ACM '66 Proceedings of the 1966 21st national conference
On the application of a priori knowledge of program structure to the performance of virtual memory computer systems.

On the application of a priori knowledge of program structure to the performance of virtual memory computer systems.

Overcoming the challenges to feedback-directed optimization (Keynote Talk)

DYNAMO '00 Proceedings of the ACM SIGPLAN workshop on Dynamic and adaptive compilation and optimization
Characterizing the memory behavior of Java workloads: a structured view and opportunities for optimizations

Proceedings of the 2001 ACM SIGMETRICS international conference on Measurement and modeling of computer systems
Code layout optimizations for transaction processing workloads

ISCA '01 Proceedings of the 28th annual international symposium on Computer architecture
The hardness of cache conscious data placement

POPL '02 Proceedings of the 29th ACM SIGPLAN-SIGACT symposium on Principles of programming languages
System Optimization for OLTP Workloads

IEEE Micro
Reality-based optimization

Proceedings of the international symposium on Code generation and optimization: feedback-directed and runtime optimization
Optimization opportunities created by global data reordering

Proceedings of the international symposium on Code generation and optimization: feedback-directed and runtime optimization
System-wide compaction and specialization of the linux kernel

LCTES '05 Proceedings of the 2005 ACM SIGPLAN/SIGBED conference on Languages, compilers, and tools for embedded systems
The hardness of cache conscious data placement

Nordic Journal of Computing
TRICK: tracking and reusing compiler's knowledge

ACM SIGPLAN Notices
Automated reduction of the memory footprint of the Linux kernel

ACM Transactions on Embedded Computing Systems (TECS) - Special Section LCTES'05
A performance methodology for commercial servers

IBM Journal of Research and Development
The advantages of post-link code coverage

HVC'07 Proceedings of the 3rd international Haifa verification conference on Hardware and software: verification and testing
Experiments with auto-parallelizing SPEC2000FP benchmarks

LCPC'04 Proceedings of the 17th international conference on Languages and Compilers for High Performance Computing
Combining code reordering and cache configuration

ACM Transactions on Embedded Computing Systems (TECS)

Quantified Score

Hi-index	0.00

Visualization

Abstract

In this paper we describe how a profiling system can be successfully used to restructure the components of an operating system for improved overall performance. We discuss our choice of a profiling system and how it was applied to the AS/400® (Application System/400®) operating system for the purpose of reordering code. Previous work in the industry has been mainly useful only for application programs. Our work demonstrates how such techniques can be applied to operating system code, while preserving maintainability of the operating system in the customer's environment.