The International Genome Sample Resource

The 1000 Genomes Project created a catalogue of common human genetic variation, using openly consented samples from people who declared themselves to be healthy. The reference data resources generated by the project remain heavily used by the biomedical science community.

The International Genome Sample Resource (IGSR) maintains and shares the human genetic variation resources built by the 1000 Genomes Project. We also update the resources to the current reference assembly, add new data sets generated from the 1000 Genomes Project samples and add data from projects working with other openly consented samples.

Latest Announcements

Thursday September 30, 2021

A variation call set obtained from the analysis of Gambian Genome Variation Project samples on GRCh38

We have recently published a Data Note describing our analysis of 505 samples from four Gambian populations in the Gambian Genome Variation Project (GGVP) on GRCh38.

For the analysis we have used a multi-caller site discovery approach along with imputation and phasing to produce a phased biallelic single nucleotide variant (SNV) and insertion/deletion (INDEL) call set. Variation had not previously been explored on the GRCh38 human genome assembly for 387 of the samples. Compared to our previous work with the 1000 Genomes Project data on GRCh38 described here, we identified over nine million novel SNVs and over 870 thousand novel INDELs.

The files generated in this analysis can be accessed from our FTP. Including the alignment files used in the variant identification http://ftp.1000genomes.ebi.ac.uk/vol1/ftp/data_collections/gambian_genome_variation_project/data/ and the call set itself http://ftp.1000genomes.ebi.ac.uk/vol1/ftp/data_collections/gambian_genome_variation_project/release/20200217_biallelic_SNV/

More information on the samples analysed in this work can be found in the IGSR portal.

Frequency distributions and genotypes are available in Ensembl.

All announcements