Biclustering in bioinformatics using big data and High Performance Computing applications: challenges and perspectives, a review

López Fernández, Aurelio; Gómez-Vela, Francisco Antonio; Delgado Cháves, Fernando M.; Rodríguez Baena, Domingo Savio; González Dominguez, Jorge

Publication:
Biclustering in bioinformatics using big data and High Performance Computing applications: challenges and perspectives, a review

Files

SurveyBiclustering.pdf (1.41 MB)

Identifiers

URI: https://hdl.handle.net/10433/24404

DOI: 10.1007/s11227-025-07563-6

Publication date

2025-07-08

Authors

López Fernández, Aurelio

Gómez-Vela, Francisco Antonio

Delgado Cháves, Fernando M.

Rodríguez Baena, Domingo Savio

González Dominguez, Jorge

Publisher

Elsevier

Export

Abstract

Biclustering is a powerful machine learning technique that simultaneously groups rows and columns in matrix-based datasets. Applied to gene expression data in bioinformatics, its use has expanded alongside the rapid growth of high-throughput sequencing technologies, leading to massive and complex biological datasets. This review aims to examine how biclustering methods and their validation strategies are evolving to meet the demands of High Performance Computing (HPC) and Big Data environments. We present a structured classification of existing approaches based on the computational paradigms they employ, including MPI/OpenMP, Apache Hadoop/Spark, and GPU/CUDA. By synthesising these developments, we highlight current trends and outline key research challenges. The knowledge gathered in this work may support researchers in adapting and scaling biclustering algorithms to analyse large-scale biomedical data more efficiently. Our contribution is intended to bridge the gap between algorithmic innovation and computational scalability in the context of bioinformatics and data-intensive applications.

Keywords

Big Data
Biological Databases
Data Analysis and Big Data
Functional clustering
Protein Databases
Bioinformatics

Bibliographic reference

J Supercomput 81, 1123 (2025).

Collections

DDI - Artículos de revistas

Full item page

Publication:
Biclustering in bioinformatics using big data and High Performance Computing applications: challenges and perspectives, a review

Files

Identifiers

Publication date

Reading date

Event date

Start date of the public exhibition period

End date of the public exhibition period

Authors

Advisors

Authors of photography

Person who provides the photography

Journal Title

Journal ISSN

Volume Title

Publisher

Export

Research Projects

Organizational Units

Journal Issue

Abstract

Doctoral program

Related publication

Research projects

Description

Keywords

Bibliographic reference

Photography rights

Collections

Publication: Biclustering in bioinformatics using big data and High Performance Computing applications: challenges and perspectives, a review

Files

Identifiers

Publication date

Reading date

Event date

Start date of the public exhibition period

End date of the public exhibition period

Authors

Advisors

Authors of photography

Person who provides the photography

Journal Title

Journal ISSN

Volume Title

Publisher

Export

Research Projects

Organizational Units

Journal Issue

Abstract

Doctoral program

Related publication

Research projects

Description

Keywords

Bibliographic reference

Photography rights

Collections

Publication:
Biclustering in bioinformatics using big data and High Performance Computing applications: challenges and perspectives, a review