A machine learning approach for tracing regulatory codes to product specific requirements

Authors:
Jane Cleland-Huang;Adam Czauderna;Marek Gibiec;John Emenecker
Affiliations:
DePaul University, Chicago;DePaul University, Chicago;DePaul University, Chicago;DePaul University, Chicago
Venue:
Proceedings of the 32nd ACM/IEEE International Conference on Software Engineering - Volume 1
Year:
2010

Citing 22
Cited 23

Automatic text processing: the transformation, analysis, and retrieval of information by computer

Automatic text processing: the transformation, analysis, and retrieval of information by computer
Software maintenance and evolution: a roadmap

Proceedings of the Conference on The Future of Software Engineering
Toward Reference Models for Requirements Traceability

IEEE Transactions on Software Engineering
A Scenario-Driven Approach to Trace Dependency Analysis

IEEE Transactions on Software Engineering
Recovering documentation-to-source-code traceability links using latent semantic indexing

Proceedings of the 25th International Conference on Software Engineering
Extended Requirements Traceability: Results of an Industrial Case Study

RE '97 Proceedings of the 3rd IEEE International Symposium on Requirements Engineering
Information Retrieval Models for Recovering Traceability Links between Code and Documentation

ICSM '00 Proceedings of the International Conference on Software Maintenance (ICSM'00)
Using latent semantic analysis to identify similarities in source code to support program understanding

ICTAI '00 Proceedings of the 12th IEEE International Conference on Tools with Artificial Intelligence
Event-Based Traceability for Managing Evolutionary Change

IEEE Transactions on Software Engineering
Enhancing an Artefact Management System with Traceability Recovery Features

ICSM '04 Proceedings of the 20th IEEE International Conference on Software Maintenance
Helping Analysts Trace Requirements: An Objective Look

RE '04 Proceedings of the Requirements Engineering Conference, 12th IEEE International
Utilizing Supporting Evidence to Improve Dynamic Requirements Traceability

RE '05 Proceedings of the 13th IEEE International Conference on Requirements Engineering
Advancing Candidate Link Generation for Requirements Tracing: The Study of Methods

IEEE Transactions on Software Engineering
Building bridges for web query classification

SIGIR '06 Proceedings of the 29th annual international ACM SIGIR conference on Research and development in information retrieval
ArchTrace: Policy-Based Support for Managing Evolving Architecture-to-Implementation Traceability Links

ASE '06 Proceedings of the 21st IEEE/ACM International Conference on Automated Software Engineering
Automated classification of non-functional requirements

Requirements Engineering
Best Practices for Automated Traceability

Computer
Robust classification of rare queries using web knowledge

SIGIR '07 Proceedings of the 30th annual international ACM SIGIR conference on Research and development in information retrieval
Clustering support for automated tracing

Proceedings of the twenty-second IEEE/ACM international conference on Automated software engineering
Analyzing Regulatory Rules for Privacy and Security Requirements

IEEE Transactions on Software Engineering
Understanding user's query intent with wikipedia

Proceedings of the 18th international conference on World wide web
Improving automated requirements trace retrieval: a study of term-based enhancement methods

Empirical Software Engineering

Towards mining replacement queries for hard-to-retrieve traces

Proceedings of the IEEE/ACM international conference on Automated software engineering
Tracing architecturally significant requirements: a decision-centric approach

Proceedings of the 33rd International Conference on Software Engineering
Sixth international workshop on traceability in emerging forms of software engineering (TEFSE 2011)

Proceedings of the 33rd International Conference on Software Engineering
Mining requirements links

REFSQ'11 Proceedings of the 17th international working conference on Requirements engineering: foundation for software quality
Traceability research: taking the next steps

Proceedings of the 6th International Workshop on Traceability in Emerging Forms of Software Engineering
Grand challenges, benchmarks, and TraceLab: developing infrastructure for the software traceability research community

Proceedings of the 6th International Workshop on Traceability in Emerging Forms of Software Engineering
Traceclipse: an eclipse plug-in for traceability link recovery and management

Proceedings of the 6th International Workshop on Traceability in Emerging Forms of Software Engineering
kb-anonymity: a model for anonymized behaviour-preserving test and debugging data

Proceedings of the 32nd ACM SIGPLAN conference on Programming language design and implementation
Towards a benchmark for traceability

Proceedings of the 12th International Workshop on Principles of Software Evolution and the 7th annual ERCIM Workshop on Software Evolution
Managing multi-jurisdictional requirements in the cloud: towards a computational legal landscape

Proceedings of the 3rd ACM workshop on Cloud computing security workshop
Do better IR tools improve the accuracy of engineers' traceability recovery?

Proceedings of the International Workshop on Machine Learning Technologies in Software Engineering
Replication of an experiment on linguistic tool support for consolidation of requirements from multiple sources

Empirical Software Engineering
Automatically detecting the quality of the query and its implications in IR-based concept location

ASE '11 Proceedings of the 2011 26th IEEE/ACM International Conference on Automated Software Engineering
A comparative evaluation of two user feedback techniques for requirements trace retrieval

Proceedings of the 27th Annual ACM Symposium on Applied Computing
A tactic-centric approach for automating traceability of quality concerns

Proceedings of the 34th International Conference on Software Engineering
Evaluating the specificity of text retrieval queries to support software engineering tasks

Proceedings of the 34th International Conference on Software Engineering
Automatic query performance assessment during the retrieval of software artifacts

Proceedings of the 27th IEEE/ACM International Conference on Automated Software Engineering
Multi-layered approach for recovering links between bug reports and fixes

Proceedings of the ACM SIGSOFT 20th International Symposium on the Foundations of Software Engineering
Applying a smoothing filter to improve IR-based traceability recovery processes: An empirical investigation

Information and Software Technology
Using clustering to improve the structure of natural language requirements documents

REFSQ'13 Proceedings of the 19th international conference on Requirements Engineering: Foundation for Software Quality
Choosing compliance solutions through stakeholder preferences

REFSQ'13 Proceedings of the 19th international conference on Requirements Engineering: Foundation for Software Quality
Enhancing software artefact traceability recovery processes with link count information

Information and Software Technology
Recovering test-to-code traceability using slicing and textual analysis

Journal of Systems and Software

Quantified Score

Hi-index	0.00

Visualization

Abstract

Regulatory standards, designed to protect the safety, security, and privacy of the public, govern numerous areas of software intensive systems. Project personnel must therefore demonstrate that an as-built system meets all relevant regulatory codes. Current methods for demonstrating compliance rely either on after-the-fact audits, which can lead to significant refactoring when regulations are not met, or else require analysts to construct and use traceability matrices to demonstrate compliance. Manual tracing can be prohibitively time-consuming; however automated trace retrieval methods are not very effective due to the vocabulary mismatches that often occur between regulatory codes and product level requirements. This paper introduces and evaluates two machine-learning methods, designed to improve the quality of traces generated between regulatory codes and product level requirements. The first approach uses manually created traceability matrices to train a trace classifier, while the second approach uses web-mining techniques to reconstruct the original trace query. The techniques were evaluated against security regulations from the USA government's Health Insurance Privacy and Portability Act (HIPAA) traced against ten healthcare related requirements specifications. Results demonstrated improvements for the subset of HIPAA regulations that exhibited high fan-out behavior across the requirements datasets.