Multi-platform discovery of haplotype-resolved structural variation in human genomes
PublisherNature Publishing Group
MetadataShow full item record
AbstractThe incomplete identification of structural variants (SVs) from whole-genome sequencing data limits studies of human genetic diversity and disease association. Here, we apply a suite of long-read, short-read, strand-specific sequencing technologies, optical mapping, and variant discovery algorithms to comprehensively analyze three trios to define the full spectrum of human genetic variation in a haplotype-resolved manner. We identify 818,054 indel variants (<50 bp) and 27,622 SVs (≥50 bp) per genome. We also discover 156 inversions per genome and 58 of the inversions intersect with the critical regions of recurrent microdeletion and microduplication syndromes. Taken together, our SV callsets represent a three to sevenfold increase in SV detection compared to most standard high-throughput sequencing studies, including those from the 1000 Genomes Project. The methods and the dataset presented serve as a gold standard for the scientific community allowing us to make recommendations for maximizing structural variation sensitivity for future genome sequencing studies. © 2019, The Author(s).
SponsorsFunding for this research project by the Human Genome Structural Variation Consortium (HGSVC).
Rights/TermsAttribution-NonCommercial 3.0 United States
Identifier to cite or link to this itemhttps://www.scopus.com/inward/record.uri?partnerID=HzOxMe3b&scp=85060084825&origin=inward; http://hdl.handle.net/10713/8870
The following license files are associated with this item:
Except where otherwise noted, this item's license is described as Attribution-NonCommercial 3.0 United States