Publication: Biclustering in bioinformatics using big data and High Performance Computing applications: challenges and perspectives, a review
| dc.contributor.author | López Fernández, Aurelio | |
| dc.contributor.author | Gómez-Vela, Francisco Antonio | |
| dc.contributor.author | Delgado Cháves, Fernando M. | |
| dc.contributor.author | Rodríguez Baena, Domingo Savio | |
| dc.contributor.author | González Dominguez, Jorge | |
| dc.date.accessioned | 2025-07-10T12:39:15Z | |
| dc.date.available | 2025-07-10T12:39:15Z | |
| dc.date.issued | 2025-07-08 | |
| dc.description.abstract | Biclustering is a powerful machine learning technique that simultaneously groups rows and columns in matrix-based datasets. Applied to gene expression data in bioinformatics, its use has expanded alongside the rapid growth of high-throughput sequencing technologies, leading to massive and complex biological datasets. This review aims to examine how biclustering methods and their validation strategies are evolving to meet the demands of High Performance Computing (HPC) and Big Data environments. We present a structured classification of existing approaches based on the computational paradigms they employ, including MPI/OpenMP, Apache Hadoop/Spark, and GPU/CUDA. By synthesising these developments, we highlight current trends and outline key research challenges. The knowledge gathered in this work may support researchers in adapting and scaling biclustering algorithms to analyse large-scale biomedical data more efficiently. Our contribution is intended to bridge the gap between algorithmic innovation and computational scalability in the context of bioinformatics and data-intensive applications. | |
| dc.description.sponsorship | Universidad Pablo de Olavide de Sevilla, Departamento de deporte e informática | |
| dc.format.mimetype | application/pdf | |
| dc.identifier.citation | J Supercomput 81, 1123 (2025). | |
| dc.identifier.doi | 10.1007/s11227-025-07563-6 | |
| dc.identifier.uri | https://hdl.handle.net/10433/24404 | |
| dc.language.iso | en | |
| dc.publisher | Elsevier | |
| dc.rights | Attribution-NonCommercial-NoDerivatives 4.0 International | en |
| dc.rights.accessRights | open access | |
| dc.rights.uri | http://creativecommons.org/licenses/by-nc-nd/4.0/ | |
| dc.subject | Big Data | |
| dc.subject | Biological Databases | |
| dc.subject | Data Analysis and Big Data | |
| dc.subject | Functional clustering | |
| dc.subject | Protein Databases | |
| dc.subject | Bioinformatics | |
| dc.title | Biclustering in bioinformatics using big data and High Performance Computing applications: challenges and perspectives, a review | |
| dc.title.alternative | Biclustering in bioinformatics using big data and High Performance Computing applications: challenges and perspectives, a review | |
| dc.type | journal article | |
| dc.type.hasVersion | VoR | |
| dspace.entity.type | Publication | |
| relation.isAuthorOfPublication | 5205a971-aeb9-4488-a278-e61cadd3b544 | |
| relation.isAuthorOfPublication | d1d327f0-daff-46c1-af17-bd2b79390ed7 | |
| relation.isAuthorOfPublication | d1ce7621-7f35-4889-935c-e3088a94c1c5 | |
| relation.isAuthorOfPublication | fcc78511-f641-4285-9e74-d071e3e3c0e3 | |
| relation.isAuthorOfPublication.latestForDiscovery | 5205a971-aeb9-4488-a278-e61cadd3b544 |
Files
Original bundle
1 - 1 of 1

