Sample, pedigree, and population data for 3,202 samples in the expanded 1000 Genomes Project data.
Format
A tibble with 3202 rows and 11 columns:
- fid
Family ID
- id
Individual ID
- pid
Paternal ID
- mid
Maternal ID
- sex
Sex (1=Male, 2=Female)
- sexf
Sex as a factor
- pop
Short population code
- reg
Short region code
- population
Long population description
- region
Long region description
- phase3
Logical; indicates whether this sample is included in the Phase 3 release data
References
Byrska-Bishop, Marta, et al. "High-coverage whole-genome sequencing of the expanded 1000 Genomes Project cohort including 602 trios." Cell 185.18 (2022): 3426-3440.
1000 Genomes Project Consortium. "A global reference for human genetic variation." Nature 526.7571 (2015): 68.
License information is available at https://github.com/igsr/1000Genomes_data_indexes/blob/master/LICENSE. The 1000 Genomes data is made publicly available according to the Fort Lauderdale Agreement (https://www.genome.gov/Pages/Research/WellcomeReport0303.pdf).