Treffer: Genomic prediction using mCADD scores as prior information in a mouse population.
Weitere Informationen
Although standard genomic prediction (GP) models such as genomic best linear unbiased prediction (GBLUP) assume that single-nucleotide polymorphisms (SNPs) contribute equally to genetic variation, some SNPs may be more informative than others because they are more closely linked to causal variants. GP models could therefore be finetuned by incorporating biological annotations. Here, we used combined annotation dependent depletion (CADD) scores, which reflect the likelihood of a genetic variant being deleterious, as prior information in genomic prediction. Our objective was to determine the benefit of using CADD scores to select or weigh SNPs in genomic prediction. We analyzed 10 traits in a dataset of 835 mice from the diversity outbred (DO) mouse population. For selecting or weighing SNPs, we either used the CADD scores at the exact position of SNPs (CADD-SNP) or the maximum CADD score in a predefined window around the SNPs (CADD-window). In addition, we employed 5 GP models (GBLUP, BayesA, BayesB, BayesC, and BayesR) to analyze different sets of selected SNPs, and a weighted GBLUP model for weighing scenarios. The results showed that selecting SNPs based on CADD-SNP did not improve prediction accuracy. In contrast, compared to using all SNPs, selecting the top 40% of SNPs based on CADD-window was the optimal scenario. This approach effectively removed noninformative SNPs and improved prediction accuracy for at least 6 out of 10 traits. The improvements among these traits ranged from an average of 0.014 for body weight at 10 weeks to 0.094 for bone mineral density across 5 GP models. Weighing (selected) SNPs based on either CADD-SNP or CADD-window had little impact on accuracy. In conclusion, using CADD-window scores to select SNPs improved prediction accuracy, but the benefit depended on the trait of interest and the GP model that was used, while using CADD scores to weigh SNPs did not improve prediction accuracy. [ABSTRACT FROM AUTHOR]
Copyright of Genetics is the property of Oxford University Press / USA and its content may not be copied or emailed to multiple sites without the copyright holder's express written permission. Additionally, content may not be used with any artificial intelligence tools or machine learning technologies. However, users may print, download, or email articles for individual use. This abstract may be abridged. No warranty is given about the accuracy of the copy. Users should refer to the original published version of the material for the full abstract. (Copyright applies to all Abstracts.)