Titel
Cloud-based Approach on Genetic Data Imputation Parameters’ Optimization
Abstract
The imputation process for genetic data is cost and time-intensive, primarily due to the high complexity of the methods involved, and the substantial volume of data processed. A thorough performance evaluation of the imputation algorithms such as Beagle, AlphaPlantImpute, LinkImputeR, MACH and others shows that while some algorithms are highly accurate, they are often computationally expensive. Being widely used, they have multiple input parameters which impact the quality and accuracy of the imputation. Traditional machine learning techniques for parameter optimization like grid search and randomized search become inefficient in high-dimensional parameter spaces, leading to prohibitive computational costs, especially in large-scale applications. Our study proposes the cloud-based approach for input parameters optimization by using Bayesian optimization with consecutive Domain Reduction Transformer (DRT). Described algorithm and developed library allow users to find the optimal input parameters for the data imputation in a more flexible way.
Stichwort
Bayesian optimizationparameters optimizationBeaglecloud technologiesdistributed calculationsbioinformatics
Objekt-Typ
Sprache
Englisch [eng]
Persistent identifier
Erschienen in
Titel
CEUR Workshop Proceedings (CEUR-WS.org)
Band
3892
ISSN
1613-0073
Erscheinungsdatum
07.01.2025
Seitenanfang
279
Seitenende
286
Publication
CEUR-WS Team, Redaktion Sun SITE, Informatik V, RWTH Aachen
Erscheinungsdatum
07.01.2025
Zugänglichkeit
Rechteangabe
© 2024 Copyright for this paper by its authors

Herunterladen

Universität Wien | Universitätsring 1 | 1010 Wien | T +43-1-4277-0