The Wellcome Trust Sanger Institute [Cambridge], Centre National de Génotypage (CNG), Commissariat à l'énergie atomique et aux énergies alternatives (CEA), University of Michigan [Ann Arbor], University of Michigan System, The 1000 Genomes Project Consortium, Harvard University--MIT Division of Health Sciences and Technology, Massachusetts Institute of Technology. Department of Biology, Lander, Eric S., Altshuler, David, Daly, Mark J., Grossman, Sharon Rachel, Jaffe, David B., Korn, Joshua M., Dermitzakis, Emmanouil
المصدر:
Nature Nature, 2010, 467 (7319), pp.1061-1073. ⟨10.1038/nature09534⟩ Nature; Vol 467 PMC ResearcherID Nature, 467(7319), 1061-1073 Durbin, R M, Altshuler, D, Abecasis, G R, Bentley, D R, Chakravarti, A, Clark, A G, Collins, F S, de la Vega, F M, Donnelly, P, Egholm, M, Flicek, P, Gabriel, S B, Gibbs, R A, Knoppers, B M, Lander, E S, Lehrach, H, Mardis, E R, McVean, G A, Nickerson, D A, Peltonen, L, Schafer, A J, Sherry, S T, Wang, J, Wilson, R K, Li, R & The 1000 Genomes Project Consortium 2010, ' A map of human genome variation from population-scale sequencing ', Nature, vol. 467, no. 7319, pp. 1061–1073 . https://doi.org/10.1038/nature09534Test Nature, Vol. 467, No 7319 (2010) pp. 1061-73
April 1, 2011 The 1000 Genomes Project aims to provide a deep characterization of human genome sequence variation as a foundation for investigating the relationship between genotype and phenotype. Here we present results of the pilot phase of the project, designed to develop and compare different strategies for genome-wide sequencing with high-throughput platforms. We undertook three projects: low-coverage whole-genome sequencing of 179 individuals from four populations; high-coverage sequencing of two mother–father–child trios; and exon-targeted sequencing of 697 individuals from seven populations. We describe the location, allele frequency and local haplotype structure of approximately 15 million single nucleotide polymorphisms, 1 million short insertions and deletions, and 20,000 structural variants, most of which were previously undescribed. We show that, because we have catalogued the vast majority of common variation, over 95% of the currently accessible variants found in any individual are present in this data set. On average, each person is found to carry approximately 250 to 300 loss-of-function variants in annotated genes and 50 to 100 variants previously implicated in inherited disorders. We demonstrate how these results can be used to inform association and functional studies. From the two trios, we directly estimate the rate of de novo germline base substitution mutations to be approximately 10[superscript −8] per base pair per generation. We explore the data with regard to signatures of natural selection, and identify a marked reduction of genetic variation in the neighbourhood of genes, due to selection at linked sites. These methods and public data will support the next phase of human genetic research. Ministry of Science and Technology of the People's Republic of China (973 program no. 2011CB809200) National Natural Science Foundation (China) (30725008) National Natural Science Foundation (China) (30890032) National Natural Science Foundation (China) (30811130531) National Natural Science Foundation (China) (30221004) China (Chinese 863 program 2006AA02Z177) China (Chinese 863 program 2006AA02Z334) China (Chinese 863 program 2006AA02A302) China (Chinese 863 program 2009AA022707) National Institutes of Health (U.S.) (Grant U54HG2750) National Institutes of Health (U.S.) (Grant U01HG5208) National Institutes of Health (U.S.) (Grant U54HG3067)