Scientific workflow management and the Kepler system: Research Articles

Authors:
Bertram Ludäscher;Ilkay Altintas;Chad Berkley;Dan Higgins;Efrat Jaeger;Matthew Jones;Edward A. Lee;Jing Tao;Yang Zhao
Affiliations:
San Diego Supercomputer Center, UC San Diego, San Diego, CA 92093, U.S.A. and Department of Computer Science and Genome Center, UC Davis, Davis, CA 95616, U.S.A.;San Diego Supercomputer Center, UC San Diego, San Diego, CA 92093, U.S.A.;National Center for Ecological Analysis and Synthesis, UC Santa Barbara, Santa Barbara, CA 93101, U.S.A.;National Center for Ecological Analysis and Synthesis, UC Santa Barbara, Santa Barbara, CA 93101, U.S.A.;San Diego Supercomputer Center, UC San Diego, San Diego, CA 92093, U.S.A.;National Center for Ecological Analysis and Synthesis, UC Santa Barbara, Santa Barbara, CA 93101, U.S.A.;Department of Electrical Engineering and Computer Sciences, UC Berkeley, Berkeley, CA 94720, U.S.A.;San Diego Supercomputer Center, UC San Diego, San Diego, CA 92093, U.S.A.;Department of Electrical Engineering and Computer Sciences, UC Berkeley, Berkeley, CA 94720, U.S.A.
Venue:
Concurrency and Computation: Practice & Experience - Workflow in Grid Systems
Year:
2006

Citing 0
Cited 302

Integrating ecoinformatics resources on the semantic web

Proceedings of the 15th international conference on World Wide Web
Indexing and searching tera-scale Grid-Based Digital Libraries

InfoScale '06 Proceedings of the 1st international conference on Scalable information systems
From Molecule to Man: Decision Support in Individualized E-Health

Computer
Semantic Grid Services for Video Analysis

WI-IATW '06 Proceedings of the 2006 IEEE/WIC/ACM international conference on Web Intelligence and Intelligent Agent Technology
Semantics-based automatic composition of geospatial Web service chains

Computers & Geosciences
On automated composition for web services

Proceedings of the 16th international conference on World Wide Web
A Workflow-Based Non-intrusive Approach for Enhancing the Survivability of Critical Infrastructures in Cyber Environment

SESS '07 Proceedings of the Third International Workshop on Software Engineering for Secure Systems
Cache for workflows

Proceedings of the 2nd workshop on Workflows in support of large-scale science
Integrating existing scientific workflow systems: the Kepler/Pegasus example

Proceedings of the 2nd workshop on Workflows in support of large-scale science
Workflow automation for processing plasma fusion simulation data

Proceedings of the 2nd workshop on Workflows in support of large-scale science
Supporting large-scale science with workflows

Proceedings of the 2nd workshop on Workflows in support of large-scale science
On the black art of designing computational workflows

Proceedings of the 2nd workshop on Workflows in support of large-scale science
Review Article: Workflow based framework for life science informatics

Computational Biology and Chemistry
Subjunctive interfaces: Extending applications to support parallel setup, viewing and control of alternative scenarios

ACM Transactions on Computer-Human Interaction (TOCHI)
Remote control: distributed application configuration, management, and visualization with plush

LISA'07 Proceedings of the 21st conference on Large Installation System Administration Conference
OrthoSearch: a scientific workflow approach to detect distant homologies on protozoans

Proceedings of the 2008 ACM symposium on Applied computing
Temporal dependency based checkpoint selection for dynamic verification of fixed-time constraints in grid workflow systems

Proceedings of the 30th international conference on Software engineering
Semantic links in integrated modelling frameworks

Mathematics and Computers in Simulation
Curated databases

Proceedings of the twenty-seventh ACM SIGMOD-SIGACT-SIGART symposium on Principles of database systems
A three tier architecture applied to LiDAR processing and monitoring

Scientific Programming - Scientific Workflows
Supporting the construction of workflows for biodiversity problem-solving accessing secure, distributed resources

Scientific Programming - Scientific Workflows
Scheduling scientific workflow applications with deadline and budget constraints using genetic algorithms

Scientific Programming - Scientific Workflows
Solving the grid interoperability problem by P-GRADE portal at workflow level

Future Generation Computer Systems
Conditional workflow management: A survey and analysis

Scientific Programming - Dynamic Computational Workflows: Discovery, Optimization and Scheduling
Specification and runtime workflow support in the ASKALON Grid environment

Scientific Programming - Dynamic Computational Workflows: Discovery, Optimization and Scheduling
Distributing workflows over a ubiquitous P2P network

Scientific Programming - Dynamic Computational Workflows: Discovery, Optimization and Scheduling
Protecting privacy in recorded conversations

PAIS '08 Proceedings of the 2008 international workshop on Privacy and anonymity in information society
Eliminating the middleman: peer-to-peer dataflow

HPDC '08 Proceedings of the 17th international symposium on High performance distributed computing
Flexible IO and integration for scientific codes through the adaptable IO system (ADIOS)

CLADE '08 Proceedings of the 6th international workshop on Challenges of large applications in distributed environments
Modeling and optimization of scientific workflows

Ph.D. '08 Proceedings of the 2008 EDBT Ph.D. workshop
WS-RF Workflow in Triana

International Journal of High Performance Computing Applications
Flexible and Efficient Workflow Deployment of Data-Intensive Applications On Grids With MOTEUR

International Journal of High Performance Computing Applications
Composing Different Models of Computation in Kepler and Ptolemy II

ICCS '07 Proceedings of the 7th international conference on Computational Science, Part III: ICCS 2007
Heterogeneous Workflows in Scientific Workflow Systems

ICCS '07 Proceedings of the 7th international conference on Computational Science, Part III: ICCS 2007
Towards a Formal Foundation for Aggregating Scientific Workflows

ICCS '07 Proceedings of the 7th international conference on Computational Science, Part III: ICCS 2007
Framework for Workflow Parallel Execution in Grid Environment

ICCS '07 Proceedings of the 7th international conference on Computational Science, Part III: ICCS 2007
A Dataflow-Oriented Atomicity and Provenance System for Pipelined Scientific Workflows

ICCS '07 Proceedings of the 7th international conference on Computational Science, Part III: ICCS 2007
The NExT System: Towards True Dynamic Adaptations of Semantic Web Service Compositions

ESWC '07 Proceedings of the 4th European conference on The Semantic Web: Research and Applications
On Specifying and Visualising Long-Running Empirical Studies

ICMT '08 Proceedings of the 1st international conference on Theory and Practice of Model Transformations
A Data Management Framework for Urgent Geoscience Workflows

ICCS '08 Proceedings of the 8th international conference on Computational Science, Part II
DaltOn: An Infrastructure for Scientific Data Management

ICCS '08 Proceedings of the 8th international conference on Computational Science, Part III
Tool Integration and Interoperability Challenges of a System-Level Design Flow: A Case Study

SAMOS '08 Proceedings of the 8th international workshop on Embedded Computer Systems: Architectures, Modeling, and Simulation
Experience in using a process language to define scientific workflow and generate dataset provenance

Proceedings of the 16th ACM SIGSOFT International Symposium on Foundations of software engineering
Inference Web in Action: Lightweight Use of the Proof Markup Language

ISWC '08 Proceedings of the 7th International Conference on The Semantic Web
Towards a Calculus for Collection-Oriented Scientific Workflows with Side Effects

OTM '08 Proceedings of the OTM 2008 Confederated International Conferences, CoopIS, DOA, GADA, IS, and ODBASE 2008. Part I on On the Move to Meaningful Internet Systems:
Kepler/pPOD: Scientific Workflow and Provenance Support for Assembling the Tree of Life

Provenance and Annotation of Data and Processes
Using Provenance to Improve Workflow Design

Provenance and Annotation of Data and Processes
A Provenance-Based Fault Tolerance Mechanism for Scientific Workflows

Provenance and Annotation of Data and Processes
Using Explicit Control Processes in Distributed Workflows to Gather Provenance

Provenance and Annotation of Data and Processes
Provenance in Sensornet Republishing

Provenance and Annotation of Data and Processes
Review: Modelling with knowledge: A review of emerging semantic approaches to environmental modelling

Environmental Modelling & Software
A scientific workflow construction command line

Proceedings of the 14th international conference on Intelligent user interfaces
Scientific workflow design for mere mortals

Future Generation Computer Systems
Atomicity and provenance support for pipelined scientific workflows

Future Generation Computer Systems
Efficiently discovering critical workflows in scientific explorations

Future Generation Computer Systems
Earth system modelling with Windows Workflow Foundation

Future Generation Computer Systems
Workflow Global Computing with YML

GRID '06 Proceedings of the 7th IEEE/ACM International Conference on Grid Computing
Efficient provenance storage over nested data collections

Proceedings of the 12th International Conference on Extending Database Technology: Advances in Database Technology
Ontology-supported scientific data frameworks: The Virtual Solar-Terrestrial Observatory experience

Computers & Geosciences
Semantic Web-based geospatial knowledge transformation

Computers & Geosciences
Workflow management for high volume supernova search

Proceedings of the 2009 ACM symposium on Applied Computing
Towards a Formal Framework for Workflow Interoperability

Web Services and Formal Methods
An integrated framework for performance-based optimization of scientific workflows

Proceedings of the 18th ACM international symposium on High performance distributed computing
Structural Considerations in Defining Executable Process Models

ICSP '09 Proceedings of the International Conference on Software Process: Trustworthy Software Development Processes
An Open Domain-Extensible Environment for Simulation-Based Scientific Investigation (ODESSI)

ICCS '09 Proceedings of the 9th International Conference on Computational Science: Part I
Parameter Space Exploration Using Scientific Workflows

ICCS '09 Proceedings of the 9th International Conference on Computational Science: Part I
On the Origin of Grid Species: The Living Application

ICCS '09 Proceedings of the 9th International Conference on Computational Science: Part I
What Makes Scientific Workflows Scientific?

SSDBM 2009 Proceedings of the 21st International Conference on Scientific and Statistical Database Management
Exploring Scientific Workflow Provenance Using Hybrid Queries over Nested Data and Lineage Graphs

SSDBM 2009 Proceedings of the 21st International Conference on Scientific and Statistical Database Management
Data Integration with the DaltOn Framework --- A Case Study

SSDBM 2009 Proceedings of the 21st International Conference on Scientific and Statistical Database Management
DynamicFlow: A Client-Side Workflow Management System

IWANN '09 Proceedings of the 10th International Work-Conference on Artificial Neural Networks: Part II: Distributed Computing, Artificial Intelligence, Bioinformatics, Soft Computing, and Ambient Assisted Living
Coping with Exceptions in Agent-Based Workflow Enactments

Engineering Societies in the Agents World IX
Services + Components = Data Intensive Scientific Workflow Applications with MeDICi

CBSE '09 Proceedings of the 12th International Symposium on Component-Based Software Engineering
Towards a Formal Semantics for the Process Model of the Taverna Workbench. Part II

Fundamenta Informaticae
A Novel Collaborative Grid Framework for Distributed Healthcare

CCGRID '09 Proceedings of the 2009 9th IEEE/ACM International Symposium on Cluster Computing and the Grid
Using Templates to Predict Execution Time of Scientific Workflow Applications in the Grid

CCGRID '09 Proceedings of the 2009 9th IEEE/ACM International Symposium on Cluster Computing and the Grid
Use of grid computing for modeling virtual geospatial products

International Journal of Geographical Information Science - Distributed Geographic Information Processing Research
The Long-Term Ecological Research community metadata standardisation project: a progress report

International Journal of Metadata, Semantics and Ontologies
Using the semantic web to integrate ecoinformatics resources

AAAI'06 proceedings of the 21st national conference on Artificial intelligence - Volume 2
A performance study of grid workflow engines

GRID '08 Proceedings of the 2008 9th IEEE/ACM International Conference on Grid Computing
A Visual Interface for on-the-fly Biological Database Integration and Workflow Design Using VizBuilder

DILS '09 Proceedings of the 6th International Workshop on Data Integration in the Life Sciences
Invocation of operations from script-based Grid applications

Future Generation Computer Systems
From data to knowledge to discoveries: Artificial intelligence and scientific workflows

Scientific Programming
The Circulate architecture: avoiding workflow bottlenecks caused by centralised orchestration

Cluster Computing
POGGI: Puzzle-Based Online Games on Grid Infrastructures

Euro-Par '09 Proceedings of the 15th International Euro-Par Conference on Parallel Processing
Customized and Optimized Service Selection with ProtocolDB

Globe '09 Proceedings of the 2nd International Conference on Data Management in Grid and Peer-to-Peer Systems
Scientific Workflows: Business as Usual?

BPM '09 Proceedings of the 7th International Conference on Business Process Management
Wings for Pegasus: creating large-scale scientific applications using semantic representations of computational workflows

IAAI'07 Proceedings of the 19th national conference on Innovative applications of artificial intelligence - Volume 2
Three fundamental dimensions of scientific workflow interoperability: Model of computation, language, and execution environment

Future Generation Computer Systems
A navigation model for exploring scientific workflow provenance graphs

Proceedings of the 4th Workshop on Workflows in Support of Large-Scale Science
Web enabling desktop workflow applications

Proceedings of the 4th Workshop on Workflows in Support of Large-Scale Science
A simulation toolkit to investigate the effects of grid characteristics on workflow completion time

Proceedings of the 4th Workshop on Workflows in Support of Large-Scale Science
A data-driven workflow language for grids based on array programming principles

Proceedings of the 4th Workshop on Workflows in Support of Large-Scale Science
Plasma fusion code coupling using scalable I/O services and scientific workflows

Proceedings of the 4th Workshop on Workflows in Support of Large-Scale Science
Workflow representation and runtime based on lazy functional streams

Proceedings of the 4th Workshop on Workflows in Support of Large-Scale Science
Kepler + Hadoop: a general architecture facilitating data-intensive applications in scientific workflow systems

Proceedings of the 4th Workshop on Workflows in Support of Large-Scale Science
Archive migration through workflow automation

PDCS '07 Proceedings of the 19th IASTED International Conference on Parallel and Distributed Computing and Systems
Semantic middleware for e-science knowledge spaces

Proceedings of the 7th International Workshop on Middleware for Grids, Clouds and e-Science
Workflow management in the grid era: A goal-driven approach based on process patterns

Multiagent and Grid Systems - New tendencies on agents and grid environments
Storing scientific workflows in a database

Proceedings of the VLDB Endowment
FlowVR-SciViz: a component-based framework for interactive scientific visualization

Proceedings of the 2009 Workshop on Component-Based High Performance Computing
CloudWF: A Computational Workflow System for Clouds Based on Hadoop

CloudCom '09 Proceedings of the 1st International Conference on Cloud Computing
BioExtract Server—An Integrated Workflow-Enabling System to Access and Analyze Heterogeneous, Distributed Biomolecular Data

IEEE/ACM Transactions on Computational Biology and Bioinformatics (TCBB)
Techniques for efficiently querying scientific workflow provenance graphs

Proceedings of the 13th International Conference on Extending Database Technology
Fine-grained and efficient lineage querying of collection-based workflow provenance

Proceedings of the 13th International Conference on Extending Database Technology
A service composition construct to support iterative development

FASE'07 Proceedings of the 10th international conference on Fundamental approaches to software engineering
B-Fabric: a data and application integration framework for life sciences research

DILS'07 Proceedings of the 4th international conference on Data integration in the life sciences
A formal model of dataflow repositories

DILS'07 Proceedings of the 4th international conference on Data integration in the life sciences
Project histories: managing data provenance across collection-oriented scientific workflow runs

DILS'07 Proceedings of the 4th international conference on Data integration in the life sciences
The Living Application: a Self-Organizing System for Complex Grid Tasks

International Journal of High Performance Computing Applications
Wikipedia driven autonomous label assignment in wrapper induced tables with missing column names

Proceedings of the 2010 ACM Symposium on Applied Computing
Automating molecular docking with explicit receptor flexibility using scientific workflows

BSB'07 Proceedings of the 2nd Brazilian conference on Advances in bioinformatics and computational biology
Probe-it!: visualization support for provenance

ISVC'07 Proceedings of the 3rd international conference on Advances in visual computing - Volume Part II
Semantically resolving type mismatches in scientific workflows

OTM'07 Proceedings of the 2007 OTM confederated international conference on On the move to meaningful internet systems - Volume Part I
Scientific workflow: a survey and research directions

PPAM'07 Proceedings of the 7th international conference on Parallel processing and applied mathematics
A business-oriented grid workflow management system

Euro-Par'07 Proceedings of the 2007 conference on Parallel processing
A study on a lightweight scientific workflow system for astronomical e-science service

IITA'09 Proceedings of the 3rd international conference on Intelligent information technology application
Policy-based hybrid workflow management system for heart disease identification

APCC'09 Proceedings of the 15th Asia-Pacific conference on Communications
On Detecting Data Flow Errors in Workflows

Journal of Data and Information Quality (JDIQ)
Bioinformatics algorithm development for Grid environments

Journal of Systems and Software
Parallelizing XML data-streaming workflows via MapReduce

Journal of Computer and System Sciences
Localising temporal constraints in scientific workflows

Journal of Computer and System Sciences
Efficiently supporting secure and reliable collaboration in scientific workflows

Journal of Computer and System Sciences
Information flow analysis of scientific workflows

Journal of Computer and System Sciences
Detecting distant homologies on protozoans metabolic pathways using scientific workflows

International Journal of Data Mining and Bioinformatics
RDFProv: A relational RDF store for querying and managing scientific workflow provenance

Data & Knowledge Engineering
Marine Geospatial Ecology Tools: An integrated framework for ecological geoprocessing with ArcGIS, Python, R, MATLAB, and C++

Environmental Modelling & Software
Adaptive service scheduling for workflow applications in Service-Oriented Grid

The Journal of Supercomputing
Meta-workflows: pattern-based interoperability between Galaxy and Taverna

Proceedings of the 1st International Workshop on Workflow Approaches to New Data-centric Science
DFL designer: collection-oriented scientific workflows with Petri nets and nested relational calculus

Proceedings of the 1st International Workshop on Workflow Approaches to New Data-centric Science
Resource descriptions, ontology, and resource discovery

International Journal of Metadata, Semantics and Ontologies
Sharing geoscience algorithms in a Web service-oriented environment (GRASS GIS example)

Computers & Geosciences
A data placement strategy in scientific cloud workflows

Future Generation Computer Systems
Grids and Clouds: Making Workflow Applications Work in Heterogeneous Distributed Environments

International Journal of High Performance Computing Applications
Parameterized specification, configuration and execution of data-intensive scientific workflows

Cluster Computing
Experiments with Memory-to-Memory Coupling for End-to-End Fusion Simulation Workflows

CCGRID '10 Proceedings of the 2010 10th IEEE/ACM International Conference on Cluster, Cloud and Grid Computing
Using VCL technology to implement distributed reconfigurable data centers and computational services for educational institutions

IBM Journal of Research and Development
Monitoring data quality in Kepler

Proceedings of the 19th ACM International Symposium on High Performance Distributed Computing
Adapting workflow technology to design-based research: development of a method for organizing the "messiness" of research in technology-rich online learning environments

ICLS '10 Proceedings of the 9th International Conference of the Learning Sciences - Volume 1
Towards practical incremental recomputation for scientists: an implementation for the Python language

TAPP'10 Proceedings of the 2nd conference on Theory and practice of provenance
On the use of abstract workflows to capture scientific process provenance

TAPP'10 Proceedings of the 2nd conference on Theory and practice of provenance
A virtual laboratory for medical image analysis

IEEE Transactions on Information Technology in Biomedicine
A Min-Min average algorithm for scheduling transaction-intensive grid workflows

AusGrid '09 Proceedings of the Seventh Australasian Symposium on Grid Computing and e-Research - Volume 99
DockFlow: Achieving interoperability of protein docking tools across heterogeneous Grid middleware

International Journal of Ad Hoc and Ubiquitous Computing
Client + cloud: evaluating seamless architectures for visual data analytics in the ocean sciences

SSDBM'10 Proceedings of the 22nd international conference on Scientific and statistical database management
Optimizing resource allocation for scientific workflows using advance reservations

SSDBM'10 Proceedings of the 22nd international conference on Scientific and statistical database management
A provenance-based approach to resource discovery in distributed molecular dynamics workflows

RED'09 Proceedings of the 2nd international conference on Resource discovery
Helping biologists effectively build workflows, without programming

DILS'10 Proceedings of the 7th international conference on Data integration in the life sciences
Power-Aware Consolidation of Scientific Workflows in Virtualized Environments

Proceedings of the 2010 ACM/IEEE International Conference for High Performance Computing, Networking, Storage and Analysis
e-BioFlow: improving practical use of workflow systems in bioinformatics

ITBAM'10 Proceedings of the First international conference on Information technology in bio- and medical informatics
Building multi-agent systems for workflow enactment and exception handling

COIN'09 Proceedings of the 5th international conference on Coordination, organizations, institutions, and norms in agent systems
Automated component-level evaluation: present and future

CLEF'10 Proceedings of the 2010 international conference on Multilingual and multimodal information access evaluation: cross-language evaluation forum
Decentralized execution of linear workflows over web services

Future Generation Computer Systems
Reputation-based dependable scheduling of workflow applications in Peer-to-Peer Grids

Computer Networks: The International Journal of Computer and Telecommunications Networking
On-demand minimum cost benchmarking for intermediate dataset storage in scientific cloud workflow systems

Journal of Parallel and Distributed Computing
KBB: a knowledge-bundle builder for research studies

ER'10 Proceedings of the 2010 international conference on Advances in conceptual modeling: applications and challenges
Modeling water resource systems using a service-oriented computing paradigm

Environmental Modelling & Software
An effective methodology for defining consistent semantics of complex systems

CEFP'09 Proceedings of the Third summer school conference on Central European functional programming school
Workflows for metabolic flux analysis: data integration and human interaction

ISoLA'10 Proceedings of the 4th international conference on Leveraging applications of formal methods, verification, and validation - Volume Part I
P-GRADE Portal: A generic workflow system to support user communities

Future Generation Computer Systems
Workflows to open provenance graphs, round-trip

Future Generation Computer Systems
A hybrid fault tolerance technique in grid computing system

The Journal of Supercomputing
Formalisations and applications of BPMN

Science of Computer Programming
An experimentation workbench for replayable networking research

NSDI'07 Proceedings of the 4th USENIX conference on Networked systems design & implementation
Exploiting Latent I/O Asynchrony in Petascale Science Applications

International Journal of High Performance Computing Applications
Workflows for information integration in the life sciences

Search computing
CONFLuEnCE: CONtinuous workFLow ExeCution Engine

Proceedings of the 2011 ACM SIGMOD International Conference on Management of data
A run-time system for efficient execution of scientific workflows on distributed environments

International Journal of Parallel Programming
Integrated data placement and task assignment for scientific workflows in clouds

Proceedings of the fourth international workshop on Data-intensive distributed computing
Just in time: adding value to the IO pipelines of high performance applications with JITStaging

Proceedings of the 20th international symposium on High performance distributed computing
Experiences using smaash to manage data-intensive simulations

Proceedings of the 20th international symposium on High performance distributed computing
A research agenda for data curation cyberinfrastructure

Proceedings of the 11th annual international ACM/IEEE joint conference on Digital libraries
Improving simulation management systems through ontology generation and utilization

Proceedings of the 11th annual international ACM/IEEE joint conference on Digital libraries
A scientific workflow environment for Earth system related studies

Computers & Geosciences
Workflow technology for geo-processing: the missing link

Proceedings of the 2nd International Conference on Computing for Geospatial Research & Applications
Temporal dependency-based checkpoint selection for dynamic verification of temporal constraints in scientific workflow systems

ACM Transactions on Software Engineering and Methodology (TOSEM)
CG3DR: Coordination of icosahedral virus reconstruction using Condensed Graphs

Parallel Computing
Molecular parameter optimization gateway (ParamChem): workflow management through TeraGrid ASTA

Proceedings of the 2011 TeraGrid Conference: Extreme Digital Discovery
Coherence and performance for interactive scientific visualization applications

SC'11 Proceedings of the 10th international conference on Software composition
Knowledge annotations in scientific workflows: an implementation in Kepler

SSDBM'11 Proceedings of the 23rd international conference on Scientific and statistical database management
Improving workflow fault tolerance through provenance-based recovery

SSDBM'11 Proceedings of the 23rd international conference on Scientific and statistical database management
Search, adapt, and reuse: the future of scientific workflows

ACM SIGMOD Record
SciProv: an architecture for semantic query in provenance metadata on e-science context

ITBAM'11 Proceedings of the Second international conference on Information technology in bio- and medical informatics
Experiment and analysis services in a fingerprint digital library for collaborative research

TPDL'11 Proceedings of the 15th international conference on Theory and practice of digital libraries: research and advanced technology for digital libraries
An executable and testable semantics for iTasks

IFL'08 Proceedings of the 20th international conference on Implementation and application of functional languages
A data management system for ab-initio nuclear physics applications

Proceedings of the 19th High Performance Computing Symposia
Distributed application configuration, management, and visualization with plush

ACM Transactions on Internet Technology (TOIT)
MEDCollector: multisource epidemic data collector

Transactions on large-scale data- and knowledge-centered systems IV
Examples of ecological data synthesis driven by rich metadata, and practical guidelines to use the Ecological Metadata Language specification to this end

International Journal of Metadata, Semantics and Ontologies
Modeling and simulation of distributed computing workflows in heterogeneous network environments

Simulation
Path planning for chaining geospatial web services

W2GIS'06 Proceedings of the 6th international conference on Web and Wireless Geographical Information Systems
Scientific workflow infrastructure for computational chemistry on the grid

ICCS'06 Proceedings of the 6th international conference on Computational Science - Volume Part III
A three tier architecture for LiDAR interpolation and analysis

ICCS'06 Proceedings of the 6th international conference on Computational Science - Volume Part III
Workflows for wind tunnel grid applications

ICCS'06 Proceedings of the 6th international conference on Computational Science - Volume Part III
Integration of compute-intensive tasks into scientific workflows in beesycluster

ICCS'06 Proceedings of the 6th international conference on Computational Science - Volume Part III
Actor Petri net model for scientific workflows: model, design and system

Proceedings of the 4th International Conference on Uniquitous Information Management and Communication
High end scientific codes with computational I/O pipelines: improving their end-to-end performance

Proceedings of the 2nd international workshop on Petascal data analytics: challenges and opportunities
Distributed workflow-driven analysis of large-scale biological data using biokepler

Proceedings of the 2nd international workshop on Petascal data analytics: challenges and opportunities
Scientific workflow reuse through conceptual workflows on the virtual imaging platform

Proceedings of the 6th workshop on Workflows in support of large-scale science
Provenance for MapReduce-based data-intensive workflows

Proceedings of the 6th workshop on Workflows in support of large-scale science
Achieving reproducibility by combining provenance with service and workflow versioning

Proceedings of the 6th workshop on Workflows in support of large-scale science
Knowledge generation from digital libraries and persistent archives

ECDL'06 Proceedings of the 10th European conference on Research and Advanced Technology for Digital Libraries
Towards automatic generation of semantic types in scientific workflows

WISE'05 Proceedings of the 2005 international conference on Web Information Systems Engineering
Service oriented architectures for science gateways on grid systems

ICSOC'05 Proceedings of the Third international conference on Service-Oriented Computing
Datagridflows: managing long-run processes on datagrids

DMG 2005 Proceedings of the First VLDB conference on Data Management in Grids
Applications development for the computational grid

APWeb'06 Proceedings of the 8th Asia-Pacific Web conference on Frontiers of WWW Research and Development
SIBIOS ontology: a robust package for the integration and pipelining of bioinformatics services

DILS'06 Proceedings of the Third international conference on Data Integration in the Life Sciences
Collection-Oriented scientific workflows for integrating and analyzing biological data

DILS'06 Proceedings of the Third international conference on Data Integration in the Life Sciences
A scientific workflow system based on GOS

HPCA'09 Proceedings of the Second international conference on High Performance Computing and Applications
Parallel high-resolution climate data analysis using swift

Proceedings of the 2011 ACM international workshop on Many task computing on grids and supercomputers
A dependency-driven formulation of parareal: parallel-in-time solution of PDEs as a many-task application

Proceedings of the 2011 ACM international workshop on Many task computing on grids and supercomputers
A scientific workflow framework integrated with object deputy model for data provenance

WAIM '06 Proceedings of the 7th international conference on Advances in Web-Age Information Management
Sustaining the development of cyberinfrastructure: an organization adapting to change

Proceedings of the ACM 2012 conference on Computer Supported Cooperative Work
Modeling and storing scientific protocols

OTM'06 Proceedings of the 2006 international conference on On the Move to Meaningful Internet Systems: AWeSOMe, CAMS, COMINF, IS, KSinBIT, MIOS-CIAO, MONET - Volume Part I
Data integration and workflow solutions for ecology

DILS'05 Proceedings of the Second international conference on Data Integration in the Life Sciences
Actor-oriented design of scientific workflows

ER'05 Proceedings of the 24th international conference on Conceptual Modeling
In-situ I/O processing: a case for location flexibility

Proceedings of the sixth workshop on Parallel Data Storage
Flexible service composition

CIA'06 Proceedings of the 10th international conference on Cooperative Information Agents
Managing rapidly-evolving scientific workflows

IPAW'06 Proceedings of the 2006 international conference on Provenance and Annotation of Data
Provenance collection support in the kepler scientific workflow system

IPAW'06 Proceedings of the 2006 international conference on Provenance and Annotation of Data
A model for user-oriented data provenance in pipelined scientific workflows

IPAW'06 Proceedings of the 2006 international conference on Provenance and Annotation of Data
A calculus for propagating semantic annotations through scientific workflow queries

EDBT'06 Proceedings of the 2006 international conference on Current Trends in Database Technology
ModeleR: An enviromental model repository as knowledge base for experts

Expert Systems with Applications: An International Journal
Challenges storing and representing biomedical data

USAB'11 Proceedings of the 7th conference on Workgroup Human-Computer Interaction and Usability Engineering of the Austrian Computer Society: information Quality in e-Health
High performance computing techniques for scaling image analysis workflows

PARA'10 Proceedings of the 10th international conference on Applied Parallel and Scientific Computing - Volume 2
A decomposition-based approach for service composition with global QoS guarantees

Information Sciences: an International Journal
AstroShelf: understanding the universe through scalable navigation of a galaxy of annotations

SIGMOD '12 Proceedings of the 2012 ACM SIGMOD International Conference on Management of Data
A data dependency based strategy for intermediate data storage in scientific cloud workflow systems

Concurrency and Computation: Practice & Experience
Enabling data and compute intensive workflows in bioinformatics

Euro-Par'11 Proceedings of the 2011 international conference on Parallel Processing - Volume 2
A study on modeling of lightweight scientific workflow systems using XML schema

APWeb'12 Proceedings of the 14th Asia-Pacific international conference on Web Technologies and Applications
Using workflows to control the experiment execution in modeling and simulation software

Proceedings of the 5th International ICST Conference on Simulation Tools and Techniques
Workflow management for soft real-time interactive applications in virtualized environments

Future Generation Computer Systems
Challenges and approaches for distributed workflow-driven analysis of large-scale biological data: vision paper

Proceedings of the 2012 Joint EDBT/ICDT Workshops
Toward provenance capturing as cross-cutting concern

TaPP'12 Proceedings of the 4th USENIX conference on Theory and Practice of Provenance
A practical approach to developing a web-based geospatial workflow composition and execution system

Proceedings of the 3rd International Conference on Computing for Geospatial Research and Applications
Using workflows and web services to manage simulation studies (WIP)

Proceedings of the 2012 Symposium on Theory of Modeling and Simulation - DEVS Integrative M&S Symposium
WPS orchestration using the Taverna workbench: The eScience approach

Computers & Geosciences
Parallel software architecture for experimental workflows in computational biology on clouds

PPAM'11 Proceedings of the 9th international conference on Parallel Processing and Applied Mathematics - Volume Part II
Database support for exploring scientific workflow provenance graphs

SSDBM'12 Proceedings of the 24th international conference on Scientific and Statistical Database Management
Towards a Formal Semantics for the Process Model of the Taverna Workbench. Part II

Fundamenta Informaticae
Eco-informatics modelling via semantic inference

Information Systems
Data-intensive architecture for scientific knowledge discovery

Distributed and Parallel Databases
A Distributed Workflow Management System with Case Study of Real-life Scientific Applications on Grids

Journal of Grid Computing
Scripting distributed scientific workflows using Weaver

Concurrency and Computation: Practice & Experience
Data contracts for cloud-based data marketplaces

International Journal of Computational Science and Engineering
Automatic execution of workflows on laser-scanned data for extracting bridge surveying goals

Advanced Engineering Informatics
Enhancing integrated environmental modelling by designing resource-oriented interfaces

Environmental Modelling & Software
Toward self-describing and workflow integrated Earth system models: A coupled atmosphere-ocean modeling system application

Environmental Modelling & Software
CROWN FlowEngine: a GPEL-based grid workflow engine

HPCC'07 Proceedings of the Third international conference on High Performance Computing and Communications
Budget constrained resource allocation for non-deterministic workflows on an iaas cloud

ICA3PP'12 Proceedings of the 12th international conference on Algorithms and Architectures for Parallel Processing - Volume Part I
Extending MPI to better support multi-application interaction

EuroMPI'12 Proceedings of the 19th European conference on Recent Advances in the Message Passing Interface
Scientific workflow management with ADAMS

ECML PKDD'12 Proceedings of the 2012 European conference on Machine Learning and Knowledge Discovery in Databases - Volume Part II
Design and implementation of GXP make - A workflow system based on make

Future Generation Computer Systems
Declarative rules for inferring fine-grained data provenance from scientific workflow execution traces

IPAW'12 Proceedings of the 4th international conference on Provenance and Annotation of Data and Processes
Designing and Deploying a Scientific Computing Cloud Platform

GRID '12 Proceedings of the 2012 ACM/IEEE 13th International Conference on Grid Computing
WS-PGRADE/gUSE Generic DCI Gateway Framework for a Large Variety of User Communities

Journal of Grid Computing
Easy Development and Integration of Science Gateways with Vine Toolkit

Journal of Grid Computing
Management and storage of in situ oceanographic data: An ECM-based approach

Information Systems
Foundations and tools for end-user architecting

Proceedings of the 17th Monterey conference on Large-Scale Complex IT Systems: development, operation and management
WorMS- a framework to support workflows in M&S

Proceedings of the Winter Simulation Conference
Using workflows in M&S software

Proceedings of the Winter Simulation Conference
A Scheduling Algorithm for the Distributed Student Registration System in Transaction-Intensive Environment

International Journal of Distance Education Technologies
Towards Next Generation Provenance Systems for e-Science

International Journal of Information System Modeling and Design
DAGwoman: enabling DAGMan-like workflows on non-Condor platforms

Proceedings of the 1st ACM SIGMOD Workshop on Scalable Workflow Execution Engines and Technologies
A declarative approach to customize workflow provenance

Proceedings of the Joint EDBT/ICDT 2013 Workshops
Scheduling parameter sweep workflow in the Grid based on resource competition

Future Generation Computer Systems
Consequence analysis of complex events on critical U.S. infrastructure

Communications of the ACM
Detecting common scientific workflow fragments using templates and execution provenance

Proceedings of the seventh international conference on Knowledge capture
ReproZip: using provenance to support computational reproducibility

TaPP'13 Proceedings of the 5th USENIX conference on Theory and Practice of Provenance
D-PROV: extending the PROV provenance model with workflow structure

TaPP'13 Proceedings of the 5th USENIX conference on Theory and Practice of Provenance
D-PROV: extending the PROV provenance model with workflow structure

Proceedings of the 5th USENIX Workshop on the Theory and Practice of Provenance
Seamless coarse grained parallelism integration in intensive bioinformatics workflows

Proceedings of the 20th European MPI Users' Group Meeting
Decentralized orchestration of data-centric workflows in Cloud environments

Future Generation Computer Systems
On an integrated mapping and scheduling solution to large-scale scientific workflows in resource sharing environments

Proceedings of the 46th Annual Simulation Symposium
DynamicCloudSim: simulating heterogeneity in computational clouds

Proceedings of the 2nd ACM SIGMOD Workshop on Scalable Workflow Execution Engines and Technologies
A continuous workflow scheduling framework

Proceedings of the 2nd ACM SIGMOD Workshop on Scalable Workflow Execution Engines and Technologies
Exploiting application dynamism and cloud elasticity for continuous dataflows

SC '13 Proceedings of the International Conference on High Performance Computing, Networking, Storage and Analysis
SDQuery DSI: integrating data management support with a wide area data transfer protocol

SC '13 Proceedings of the International Conference on High Performance Computing, Networking, Storage and Analysis
DDS: A deadlock detection-based scheduling algorithm for workflow computations in HPC systems with storage constraints

Parallel Computing
Cloud4SNP: Distributed Analysis of SNP Microarray Data on the Cloud

Proceedings of the International Conference on Bioinformatics, Computational Biology and Biomedical Informatics
Improving data transfer performance of web service workflows in the cloud environment

International Journal of Computational Science and Engineering
A case study of trust issues in scientific video collections

Proceedings of the 2nd ACM international workshop on Multimedia analysis for ecological data
A workflow for the prediction of the effects of residue substitution on protein stability

PRIB'13 Proceedings of the 8th IAPR international conference on Pattern Recognition in Bioinformatics
Semantics and provenance for processing element composition in dispel workflows

WORKS '13 Proceedings of the 8th Workshop on Workflows in Support of Large-Scale Science
Understanding workflows for distributed computing: nitty-gritty details

WORKS '13 Proceedings of the 8th Workshop on Workflows in Support of Large-Scale Science
OpenMOLE, a workflow engine specifically tailored for the distributed exploration of simulation models

Future Generation Computer Systems
An Adaptive Grid Workflow Scheduling Based on Bottleneck Detection and Execution Context

Proceedings of International Conference on Information Integration and Web-based Applications & Services
A roadmap to domain specific programming languages for environmental modeling: key requirements and concepts

Proceedings of the 2013 ACM workshop on Domain-specific modeling
Form-Based Web Service Composition for Domain Experts

ACM Transactions on the Web (TWEB)
Semantics and Planning Based Workflow Composition for Video Processing

Journal of Grid Computing
Exploring Workflow Interoperability for Neuroimage Analysis on the SHIWA Platform

Journal of Grid Computing
Computer-Assisted Scientific Workflow Design

Journal of Grid Computing
Model-as-you-go: An Approach for an Advanced Infrastructure for Scientific Workflows

Journal of Grid Computing
Integration, optimization and usability of enterprise applications

Journal of Network and Computer Applications
A graph distance based metric for data oriented workflow retrieval with variable time constraints

Expert Systems with Applications: An International Journal
An expert system hybrid architecture to support experiment management

Expert Systems with Applications: An International Journal
Modeling and optimizing large-scale data flows

Future Generation Computer Systems
Report from the second workshop on scalable workflow enactment engines and technology (SWEET'13)

ACM SIGMOD Record
Rule-driven service coordination middleware for scientific applications

Future Generation Computer Systems
Towards Scalable and Cost-aware Bioinformatics Workflow Execution in the Cloud-Recent Advances to the Tavaxy Workflow System

Fundamenta Informaticae - Scalable Workflow Enactment Engines and Technology
Hybrid Analytic Flows-the Case for Optimization

Fundamenta Informaticae - Scalable Workflow Enactment Engines and Technology
BeesyBees: A mobile agent-based middleware for a reliable and secure execution of service-based workflow applications in BeesyCluster

Multiagent and Grid Systems - Agent Based Computing: From Model to Implementation

Quantified Score

Hi-index	0.02

Visualization

Abstract

Many scientific disciplines are now data and information driven, and new scientific knowledge is often gained by scientists putting together data analysis and knowledge discovery ‘pipelines’. A related trend is that more and more scientific communities realize the benefits of sharing their data and computational services, and are thus contributing to a distributed data and computational community infrastructure (a.k.a. ‘the Grid’). However, this infrastructure is only a means to an end and ideally scientists should not be too concerned with its existence. The goal is for scientists to focus on development and use of what we call scientific workflows. These are networks of analytical steps that may involve, e.g., database access and querying steps, data analysis and mining steps, and many other steps including computationally intensive jobs on high-performance cluster computers. In this paper we describe characteristics of and requirements for scientific workflows as identified in a number of our application projects. We then elaborate on Kepler, a particular scientific workflow system, currently under development across a number of scientific data management projects. We describe some key features of Kepler and its underlying Ptolemy II system, planned extensions, and areas of future research. Kepler is a community-driven, open source project, and we always welcome related projects and new contributors to join. Copyright © 2005 John Wiley & Sons, Ltd.