Data from: Remarkably divergent regions punctuate the genome assembly of the Caenorhabditis elegans Hawaiian strain CB4856

التفاصيل البيبلوغرافية
العنوان: Data from: Remarkably divergent regions punctuate the genome assembly of the Caenorhabditis elegans Hawaiian strain CB4856
المؤلفون: Thompson, Owen A., Snoek, L. Basten, Nijveen, Harm, Sterken, Mark G., Volkers, Rita J. M., Brenchley, Rachel, van't Hof, Arjen, Bevers, Roel P. J., Cossins, Andrew R., Yanai, Itai, Hajnal, Alex, Schmid, Tobias, Perkins, Jaryn D., Spencer, David, Kruglyak, Leonid, Andersen, Erik C., Moerman, Donald G., Hillier, LaDeana W., Kammenga, Jan E., Waterston, Robert H.
سنة النشر: 2016
المجموعة: Zenodo
مصطلحات موضوعية: Caenorhabditis elegans, genome assembly
الوصف: The Hawaiian strain (CB4856) of Caenorhabditis elegans is one of the most divergent from the canonical laboratory strain N2 and has been widely used in developmental, population and evolutionary studies. To enhance the utility of the strain, we have generated a draft sequence of the CB4856 genome, exploiting a variety of resources and strategies. The CB4856 genome when compared against the N2 reference has 327,050 single nucleotide variants (SNVs) and 79,529 insertion-deletion events (indels) that result in a total of 3.3 megabasepairs (Mb) of N2 sequence missing from CB4856 and 1.4 Mb of sequence present in CB4856 not present in N2. As previously reported, the density of SNVs varies along the chromosomes, with the arms of chromosomes showing greater average variation than the centers. In addition, we find 61 regions totaling 2.8 Mb, distributed across all six chromosomes, that have a greatly elevated SNV density, ranging from 2% to 16% SNVs. A survey of other wild isolates show that the two alternative haplotypes for each region are widely distributed, suggesting they have been maintained by balancing selection over long evolutionary times. These divergent regions contain an abundance of genes from large rapidly evolving families encoding F-box, MATH, BATH, seven-transmembrane G-coupled receptors, and nuclear hormone receptors suggesting that they provide selective advantages in natural environments. The draft sequence makes available a comprehensive catalog of sequence differences between the CB4856 and N2 strains that will facilitate the molecular dissection of their phenotypic differences. Our work also emphasizes the importance of going beyond simple alignment of reads to a reference genome when assessing differences between genomes. ; File_S1_Celegans_N2_to_CB4856_lastz_alignment.txtLASTZ alignments between C. elegans N2 (WormBase version WS230) and C. elegans Hawaiian CB4856 (Genbank accession: JZEW00000000)
نوع الوثيقة: dataset
اللغة: unknown
العلاقة: https://zenodo.org/communities/dryadTest; https://zenodo.org/record/5024569Test; https://doi.org/10.5061/dryad.1k8kqTest; oai:zenodo.org:5024569
DOI: 10.5061/dryad.1k8kq
الإتاحة: https://doi.org/10.5061/dryad.1k8kqTest
https://doi.org/10.1534/genetics.115.175950Test
https://zenodo.org/record/5024569Test
حقوق: info:eu-repo/semantics/openAccess ; https://creativecommons.org/publicdomain/zero/1.0/legalcodeTest
رقم الانضمام: edsbas.FE63364C
قاعدة البيانات: BASE