Treffer: Scanflow: an end-to-end agent-based autonomic ML workflow manager for clusters

Title:
Scanflow: an end-to-end agent-based autonomic ML workflow manager for clusters
Publisher Information:
Association for Computing Machinery (ACM) 2021
Document Type:
E-Ressource Electronic Resource
Availability:
Open access content. Open access content
Open Access
Note:
2 p.
application/pdf
English
Other Numbers:
HGF oai:upcommons.upc.edu:2117/359094
Liu, P. [et al.]. Scanflow: an end-to-end agent-based autonomic ML workflow manager for clusters. A: ACM/IFIP International Middleware Conference. "Middleware'21 demos and posters: proceedings of the 2021 International Middleware Conference Demos and Posters: December 6-10, 2021, Virtual event, Canada". New York: Association for Computing Machinery (ACM), 2021, p. 1-2. ISBN 978-1-4503-9154-2. DOI 10.1145/3491086.3492468.
978-1-4503-9154-2
10.1145/3491086.3492468
1298723747
Contributing Source:
UNIV POLITECNICA DE CATALUNYA
From OAIster®, provided by the OCLC Cooperative.
Accession Number:
edsoai.on1298723747
Database:
OAIster

Weitere Informationen

Machine Learning (ML) is more than just training models, the whole life-cycle must be considered. Once deployed, a ML model needs to be constantly managed, supervised and debugged to guarantee its availability, validity and robustness in dynamic contexts. This demonstration presents an agent-based ML workflow manager so-called Scanflow1, which enables autonomic management and supervision of the end-to-end life-cycle of ML workflows on distributed clusters. The case study on a MNIST project2 shows that different teams can collaborate using Scanflow within a ML project at different phases, and the effectiveness of agents to maintain the model accuracy and throughput of the model serving while running in production.
This work was partially supported by Lenovo as part of LenovoBSC 2020 collaboration agreement, by the Spanish Government under contract PID2019-107255GB-C22, and by the Generalitat de Catalunya under contract 2017-SGR-1414 and under grant 2020 FI-B 00257.
Postprint (published version)