Skip to contents

Sample, pedigree, and population data for 2,504 samples in the Phase 3 release of the 1000 Genomes Project data.

Usage

kgp3

Format

A tibble with 2504 rows and 10 columns:

fid

Family ID

id

Individual ID

pid

Paternal ID

mid

Maternal ID

sex

Sex (1=Male, 2=Female)

sexf

Sex as a factor

pop

Short population code

reg

Short region code

population

Long population description

region

Long region description

References

Byrska-Bishop, Marta, et al. "High-coverage whole-genome sequencing of the expanded 1000 Genomes Project cohort including 602 trios." Cell 185.18 (2022): 3426-3440.

1000 Genomes Project Consortium. "A global reference for human genetic variation." Nature 526.7571 (2015): 68.

License information is available at https://github.com/igsr/1000Genomes_data_indexes/blob/master/LICENSE. The 1000 Genomes data is made publicly available according to the Fort Lauderdale Agreement (https://www.genome.gov/Pages/Research/WellcomeReport0303.pdf).