Vom 20.12.2025 bis 11.01.2026 ist die Universitätsbibliothek geschlossen. Ab dem 12.01.2026 gelten wieder die regulären Öffnungszeiten. Ausnahme: Medizinische Hauptbibliothek und Zentralbibliothek sind bereits ab 05.01.2026 wieder geöffnet. Weitere Informationen

Treffer: Massively Parallel Implementation of Sequence Alignment with Basic Local Alignment Search Tool Using Parallel Computing in Java Library.

Title:
Massively Parallel Implementation of Sequence Alignment with Basic Local Alignment Search Tool Using Parallel Computing in Java Library.
Authors:
Nowicki M; 1 Faculty of Mathematics and Computer Science, Nicolaus Copernicus University in Toruń , Poland ., Bzhalava D; 2 Department of Laboratory Medicine, Karolinska Institutet , Stockholm, Sweden ., BaŁa P; 3 Interdisciplinary Center for Mathematical and Computational Modeling, University of Warsaw , Warsaw, Poland .
Source:
Journal of computational biology : a journal of computational molecular cell biology [J Comput Biol] 2018 Aug; Vol. 25 (8), pp. 871-881. Date of Electronic Publication: 2018 Jul 13.
Publication Type:
Journal Article; Research Support, Non-U.S. Gov't
Language:
English
Journal Info:
Publisher: Mary Ann Liebert, Inc Country of Publication: United States NLM ID: 9433358 Publication Model: Print-Electronic Cited Medium: Internet ISSN: 1557-8666 (Electronic) Linking ISSN: 10665277 NLM ISO Abbreviation: J Comput Biol Subsets: MEDLINE
Imprint Name(s):
Original Publication: New York, NY : Mary Ann Liebert, Inc., c1994-
Contributed Indexing:
Keywords: BLAST; Java; PCJ; next-generation sequencing; sequence alignment
Entry Date(s):
Date Created: 20180714 Date Completed: 20191023 Latest Revision: 20191023
Update Code:
20250114
DOI:
10.1089/cmb.2018.0079
PMID:
30004240
Database:
MEDLINE

Weitere Informationen

Basic Local Alignment Search Tool (BLAST) is an essential algorithm that researchers use for sequence alignment analysis. The National Center for Biotechnology Information (NCBI)-BLAST application is the most popular implementation of the BLAST algorithm. It can run on a single multithreading node. However, the volume of nucleotide and protein data is fast growing, making single node insufficient. It is more and more important to develop high-performance computing solutions, which could help researchers to analyze genetic data in a fast and scalable way. This article presents execution of the BLAST algorithm on high performance computing (HPC) clusters and supercomputers in a massively parallel manner using thousands of processors. The Parallel Computing in Java (PCJ) library has been used to implement the optimal splitting up of the input queries, the work distribution, and search management. It is used with the nonmodified NCBI-BLAST package, which is an additional advantage for the users. The result application-PCJ-BLAST-is responsible for reading sequence for comparison, splitting it up and starting multiple NCBI-BLAST executables. Since I/O performance could limit sequence analysis performance, the article contains an investigation of this problem. The obtained results show that using Java and PCJ library it is possible to perform sequence analysis using hundreds of nodes in parallel. We have achieved excellent performance and efficiency and we have significantly reduced the time required for sequence analysis. Our work also proved that PCJ library could be used as an effective tool for fast development of the scalable applications.