Publication:
Biclustering in bioinformatics using big data and High Performance Computing applications: challenges and perspectives, a review

dc.contributor.authorLópez Fernández, Aurelio
dc.contributor.authorGómez-Vela, Francisco Antonio
dc.contributor.authorDelgado Cháves, Fernando M.
dc.contributor.authorRodríguez Baena, Domingo Savio
dc.contributor.authorGonzález Dominguez, Jorge
dc.date.accessioned2025-07-10T12:39:15Z
dc.date.available2025-07-10T12:39:15Z
dc.date.issued2025-07-08
dc.description.abstractBiclustering is a powerful machine learning technique that simultaneously groups rows and columns in matrix-based datasets. Applied to gene expression data in bioinformatics, its use has expanded alongside the rapid growth of high-throughput sequencing technologies, leading to massive and complex biological datasets. This review aims to examine how biclustering methods and their validation strategies are evolving to meet the demands of High Performance Computing (HPC) and Big Data environments. We present a structured classification of existing approaches based on the computational paradigms they employ, including MPI/OpenMP, Apache Hadoop/Spark, and GPU/CUDA. By synthesising these developments, we highlight current trends and outline key research challenges. The knowledge gathered in this work may support researchers in adapting and scaling biclustering algorithms to analyse large-scale biomedical data more efficiently. Our contribution is intended to bridge the gap between algorithmic innovation and computational scalability in the context of bioinformatics and data-intensive applications.
dc.description.sponsorshipUniversidad Pablo de Olavide de Sevilla, Departamento de deporte e informática
dc.format.mimetypeapplication/pdf
dc.identifier.citationJ Supercomput 81, 1123 (2025).
dc.identifier.doi10.1007/s11227-025-07563-6
dc.identifier.urihttps://hdl.handle.net/10433/24404
dc.language.isoen
dc.publisherElsevier
dc.rightsAttribution-NonCommercial-NoDerivatives 4.0 Internationalen
dc.rights.accessRightsopen access
dc.rights.urihttp://creativecommons.org/licenses/by-nc-nd/4.0/
dc.subjectBig Data
dc.subjectBiological Databases
dc.subjectData Analysis and Big Data
dc.subjectFunctional clustering
dc.subjectProtein Databases
dc.subjectBioinformatics
dc.titleBiclustering in bioinformatics using big data and High Performance Computing applications: challenges and perspectives, a review
dc.title.alternativeBiclustering in bioinformatics using big data and High Performance Computing applications: challenges and perspectives, a review
dc.typejournal article
dc.type.hasVersionVoR
dspace.entity.typePublication
relation.isAuthorOfPublication5205a971-aeb9-4488-a278-e61cadd3b544
relation.isAuthorOfPublicationd1d327f0-daff-46c1-af17-bd2b79390ed7
relation.isAuthorOfPublicationd1ce7621-7f35-4889-935c-e3088a94c1c5
relation.isAuthorOfPublicationfcc78511-f641-4285-9e74-d071e3e3c0e3
relation.isAuthorOfPublication.latestForDiscovery5205a971-aeb9-4488-a278-e61cadd3b544

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
SurveyBiclustering.pdf
Size:
1.41 MB
Format:
Adobe Portable Document Format