AI-assisted system generates and executes scientific workflows via MCP
Researchers present an AI-assisted scientific workflow management system that combines specification-driven generation, an LLM-based debugging agent, and a Model Context Protocol (MCP) layer integrated with the Pegasus workflow management system to enable non-expert users to construct and execute large-scale pipelines.
Score breakdown
The MCP-integrated, specification-first design removes the need for domain experts to manually author, debug, and submit complex scientific pipelines, making large-scale reproducible workflow execution accessible to non-expert users.
- 01Authors: Komal Thareja, Hamza Safri, Rajiv Mayani
- 02The system introduces a structured specification phase that separates workflow intent, design, and implementation before code generation
- 03An LLM-based debugging agent diagnoses and resolves failures across multiple system layers
Komal Thareja, Hamza Safri, and Rajiv Mayani present an end-to-end AI-assisted framework for scientific workflow management that addresses a core limitation of existing LLM-based approaches: direct code synthesis, which the paper argues limits transparency, reproducibility, and integration with established workflow systems. Their method introduces a structured specification phase that explicitly separates workflow intent, design, and implementation, enabling validation before code generation begins. This separation is intended to make the system's reasoning auditable and to reduce errors that arise when LLMs jump straight to implementation.
The framework also includes an LLM-based debugging agent capable of diagnosing and resolving failures across multiple system layers, reducing the manual effort typically required to troubleshoot complex pipelines.
The framework also includes an LLM-based debugging agent capable of diagnosing and resolving failures across multiple system layers, reducing the manual effort typically required to troubleshoot complex pipelines. For distributed execution and user interaction, the team integrates Pegasus — a widely used scientific workflow management system — with a Model Context Protocol (MCP) layer, creating a unified interface for workflow submission, monitoring, and control.
The approach was evaluated using a federated learning workflow for medical imaging, selected for its parallel, iterative, and dependency-intensive structure. The system successfully generated and executed large-scale workflows containing thousands of jobs, reduced debugging effort, and allowed non-expert users to construct workflows with expert-level design patterns. The authors conclude that end-to-end AI-assisted workflow generation and execution is feasible and point toward AI-driven platforms for managing the full scientific workflow lifecycle.
Key facts
- 01Authors: Komal Thareja, Hamza Safri, Rajiv Mayani
- 02The system introduces a structured specification phase that separates workflow intent, design, and implementation before code generation
- 03An LLM-based debugging agent diagnoses and resolves failures across multiple system layers
- 04Pegasus WMS is integrated with a Model Context Protocol (MCP) layer for workflow submission, monitoring, and control
- 05Evaluated on a federated learning workflow for medical imaging with thousands of jobs
- 06Non-expert users were able to construct workflows with expert-level design patterns
- 07The paper argues direct code synthesis limits transparency, reproducibility, and WMS integration
Topics
Summary and scoring are generated automatically from the original article. We always link back to the publisher and never republish images or paywalled content. Last processed Jun 18, 2026 · 10:40 UTC. How this works →