Sample, pedigree, and population data for 2,504 samples in the Phase 3 release of the 1000 Genomes Project data.
Format
A tibble with 2504 rows and 10 columns:
- fid
Family ID
- id
Individual ID
- pid
Paternal ID
- mid
Maternal ID
- sex
Sex (1=Male, 2=Female)
- sexf
Sex as a factor
- pop
Short population code
- reg
Short region code
- population
Long population description
- region
Long region description
References
Byrska-Bishop, Marta, et al. "High-coverage whole-genome sequencing of the expanded 1000 Genomes Project cohort including 602 trios." Cell 185.18 (2022): 3426-3440.
1000 Genomes Project Consortium. "A global reference for human genetic variation." Nature 526.7571 (2015): 68.
License information is available at https://github.com/igsr/1000Genomes_data_indexes/blob/master/LICENSE. The 1000 Genomes data is made publicly available according to the Fort Lauderdale Agreement (https://www.genome.gov/Pages/Research/WellcomeReport0303.pdf).