The sequence data in this collection was generated by the Kwiatkowski group at the Wellcome Trust Sanger Institute (WTSI) and the Nuffield Department of Medicine, University of Oxford. Please see the WTSI data sharing policy for details on the use of this data, in addition to respecting Fort Lauderdale principles and reading the accompanying Data Reuse policy.
“The purpose of the project is to support the discovery and understanding of genetic variants that influence human disease. Specifically defined goals are (a) the discovery of single nucleotide variants at frequencies of 1% or higher in diverse populations (b) even more comprehensive discovery (variants down to frequencies of 0.1 - 0.5%) in functional gene regions and (c) discovery of structural variants, such as copy number variants, other insertions and deletions, and inversions, including sequence-level understanding of breakpoints.”
The sequence data is 4x coverage of individuals from four Gambian populations.
The data is from the following European Nucleotide Archive (ENA) study IDs:
ERP001420 - Fula
ERP001781 - Jola
ERP002150 - Woloff
ERP002385 - Mandinka
If you have any questions please email email@example.com.