首页|Making a Pipeline Production-Ready: Challenges and Lessons Learned in the Healthcare Domain

Making a Pipeline Production-Ready: Challenges and Lessons Learned in the Healthcare Domain

来源：

英文摘要

Deploying a Machine Learning (ML) training pipeline into production requires good software engineering practices. Unfortunately, the typical data science workflow often leads to code that lacks critical software quality attributes. This experience report investigates this problem in SPIRA, a project whose goal is to create an ML-Enabled System (MLES) to pre-diagnose insufficiency respiratory via speech analysis. This paper presents an overview of the architecture of the MLES, then compares three versions of its Continuous Training subsystem: from a proof of concept Big Ball of Mud (v1), to a design pattern-based Modular Monolith (v2), to a test-driven set of Microservices (v3) Each version improved its overall extensibility, maintainability, robustness, and resiliency. The paper shares challenges and lessons learned in this process, offering insights for researchers and practitioners seeking to productionize their pipelines.

作者：Roberto Oliveira Bolgheroni、Renato Cordeiro Ferreira、Alfredo Goldman、Marcelo Finger、Daniel Angelo Esteves Lawand、Lucas Quaresma Medina Lam

作者单位：

学科分类：医学研究方法

推荐引用：Roberto Oliveira Bolgheroni,Renato Cordeiro Ferreira,Alfredo Goldman,Marcelo Finger,Daniel Angelo Esteves Lawand,Lucas Quaresma Medina Lam.Making a Pipeline Production-Ready: Challenges and Lessons Learned in the Healthcare Domain[EB/OL].(2025-07-06)[2025-07-09].https://arxiv.org/abs/2506.06946.点此复制

Making a Pipeline Production-Ready: Challenges and Lessons Learned in the Healthcare Domain

Making a Pipeline Production-Ready: Challenges and Lessons Learned in the Healthcare Domain

评论