Treffer: Parallelizing RNA-Seq Analysis with BioSkel: A FastFlow Based Prototype

Title:
Parallelizing RNA-Seq Analysis with BioSkel: A FastFlow Based Prototype
Contributors:
Beauvais, Valentin, Tonci, Nicolo, Robert, Sophie, Limet, Sébastien
Publication Year:
2025
Collection:
ARPI - Archivio della Ricerca dell'Università di Pisa
Document Type:
Fachzeitschrift article in journal/newspaper
Language:
English
Relation:
info:eu-repo/semantics/altIdentifier/wos/WOS:001459447800001; volume:53; issue:2; journal:INTERNATIONAL JOURNAL OF PARALLEL PROGRAMMING; https://hdl.handle.net/11568/1329049
DOI:
10.1007/s10766-025-00786-3
Accession Number:
edsbas.A3BA256C
Database:
BASE

Weitere Informationen

Over the past decade, the widespread adoption of RNA-seq methodology for transcript-level monitoring has resulted in a surge of biological data requiring comprehensive analysis. The BioSkel project aims to develop a framework for RNA sequencing analysis on multi/many-core machines. This framework relies on generic and modular high-level parallel patterns, enabling biologists to customize their data processing to their specific needs while abstracting away the complexities of parallelization. In this study, we introduce the initial prototype of BioSkel for RNA sequencing analysis, which comprises three main steps: sequence alignment, feature counting, and differential expression analysis. This prototype leverages FastFlow as a back-end for parallelizing the execution, either in shared- and distributed-memory. We provide experimental validations of our approach, considering different architectures and dataset sizes. As a valuable byproduct, we introduce a distributed HPC version of Bowtie2 tool, the first publicly available to our knowledge.