Presently, imputation is widely established for genetic data, specifically, Single Nucleotide Polymorphisms (SNPs). No analogue currently exists for other omics layers, such as the epigenetic modification, DNA methylation (DNAm). DNAm is critical to gene regulation and is also able to accurately track lifestyle behaviours and environmental exposures.
We will use the world’s largest saliva- and blood-based DNAm datasets with replication of our findings across other cohorts of diverse ancestries, ages, and backgrounds. By using a variety of statistical ML and AI approaches, we will create a robust imputation process for DNAm that will be implemented via a front-end server. This will enable the global research community to save £millions from generating new array data and lead to new discoveries in association studies within biomedical research.