Assessing Runs of Homozygosity: A comparison of SNP Array and Whole Genome Sequence low coverage data
Assessing Runs of Homozygosity: A comparison of SNP Array and Whole Genome Sequence low coverage data
Abstract Runs of Homozygosity (ROH) are sequences that arise when identical haplotypes are inherited from each parent. Since their first detection due to technological advances in the late 1990s, ROHs have been shedding light on human population history and deciphering the genetic basis of monogenic and complex traits and diseases. ROH studies have predominantly exploited SNP array data, but are gradually moving to whole genome sequence (WGS) data as it becomes available. WGS data, covering more genetic variability, can add value to ROH studies, but require additional considerations during analysis. Using SNP array and low coverage WGS data from 1885 individuals from 20 world populations, our aims were to compare ROH from the two datasets and to establish software conditions to get comparable results, thus providing guidelines for combining disparate datasets in joint ROH analyses. Using the PLINK Homozygosity functions, we found that by allowing 3 heterozygous SNPs per window when dealing with WGS low coverage data, it is possible to establish meaningful comparisons between data using the two technologies.
Ceballos Francisco C.、Ramsay Mich¨¨le、Hazelhurst Scott
Sydney Brenner Institute for Molecular Bioscience, Faculty of Health Sciences, University of the WitwatersrandSydney Brenner Institute for Molecular Bioscience, Faculty of Health Sciences, University of the Witwatersrand||Division of Human Genetics, School of Pathology, Faculty of Health Sciences, University of the WitwatersrandSydney Brenner Institute for Molecular Bioscience, Faculty of Health Sciences, University of the Witwatersrand||School of Electrical & Information Engineering, University of the Witwatersrand
遗传学生物科学研究方法、生物科学研究技术
Runs of HomozygosityROHSNP array dataWGS low coverage data
Ceballos Francisco C.,Ramsay Mich¨¨le,Hazelhurst Scott.Assessing Runs of Homozygosity: A comparison of SNP Array and Whole Genome Sequence low coverage data[EB/OL].(2025-03-28)[2025-06-08].https://www.biorxiv.org/content/10.1101/160705.点此复制
评论