Correction: Can we use it? On the utility of de novo and reference-based assembly of Nanopore data for plant plastome sequencing

التفاصيل البيبلوغرافية
العنوان: Correction: Can we use it? On the utility of de novo and reference-based assembly of Nanopore data for plant plastome sequencing
المؤلفون: Marco Dorfner, Christoph Oberprieler, Agnes Scheunert, Thomas Lingl
المصدر: PLoS ONE
PLoS ONE, Vol 15, Iss 3, p e0226234 (2020)
بيانات النشر: Universität Regensburg, 2020.
سنة النشر: 2020
مصطلحات موضوعية: Chloroplasts, CHLOROPLAST DNA, READ ALIGNMENT, GENOME, MITOCHONDRIAL, EVOLUTIONARY, ANGIOSPERMS, POPULATION, REPEAT, REGION, RATES, Sequence assembly, Plant Genomes, Artificial Gene Amplification and Extension, Plant Science, Plant Genetics, Genome, Polymerase Chain Reaction, Nanopores, Database and Informatics Methods, Phylogenomics, Plant Genomics, Genome Sequencing, Phylogeny, Chromosome Mapping, High-Throughput Nucleotide Sequencing, food and beverages, Genomics, 580 Pflanzen (Botanik), Nanopore, Chloroplast DNA, Medicine, Engineering and Technology, Cellular Structures and Organelles, Cellular Types, Sequence Analysis, Research Article, Biotechnology, Bioinformatics, Science, Plant Cell Biology, Bioengineering, Computational biology, Biology, Research and Analysis Methods, Chloroplast Genome, Plant Cells, Consensus sequence, Genetics, ddc:580, Indel, Genome, Chloroplast, Molecular Biology Techniques, Sequencing Techniques, Molecular Biology, Sequence Assembly Tools, Correction, Biology and Life Sciences, Computational Biology, Cell Biology, Comparative Genomics, Genome Analysis, Plant Biotechnology, Nanopore sequencing, Leucanthemum, Sequence Alignment
الوصف: The chloroplast genome harbors plenty of valuable information for phylogenetic research. Illumina short-read data is generally used forde novoassembly of whole plastomes. PacBio or Oxford Nanopore long reads are additionally employed in hybrid approaches to enable assembly across the highly similar inverted repeats of a chloroplast genome. Unlike for PacBio, plastome assemblies based solely on Nanopore reads are rarely found, due to their high error rate and non-random error profile. However, the actual quality decline connected to their use has never been quantified. Furthermore, no study has employed reference-based assembly using Nanopore reads, which is common with Illumina data. UsingLeucanthemumMill. as an example, we compared the sequence quality of seven plastome assemblies of the same species, using combinations of two sequencing platforms and three analysis pipelines. In addition, we assessed the factors which might influence Nanopore assembly quality during sequence generation and bioinformatic processing.The consensus sequence derived fromde novoassembly of Nanopore data had a sequence identity of 99.59% compared to Illumina short-readde novoassembly. Most of the found errors comprise indels (81.5%), and a large majority of them is part of homopolymer regions. The quality of reference-based assembly is heavily dependent upon the choice of a close-enough reference. Using a reference with 0.83% sequence divergence from the studied species, mapping of Nanopore reads results in a consensus comparable to that from Nanoporede novoassembly, and of only slightly inferior quality compared to a reference-based assembly with Illumina data (0.49% and 0.26% divergence from Illuminade novo). For optimal assembly of Nanopore data, appropriate filtering of contaminants and chimeric sequences, as well as employing moderate read coverage, is essential.Based on these results, we conclude that Nanopore long reads are a suitable alternative to Illumina short reads in plastome phylogenomics. Only few errors remain in the finalized assembly, which can be easily masked in phylogenetic analyses without loss in analytical accuracy. The easily applicable and cost-effective technology might warrant more attention by researchers dealing with plant chloroplast genomes.
وصف الملف: application/pdf
DOI: 10.5283/epub.43098
الوصول الحر: https://explore.openaire.eu/search/publication?articleId=doi_dedup___::f81d359b80d58e7d29d79ed45f84ac54Test
حقوق: OPEN
رقم الانضمام: edsair.doi.dedup.....f81d359b80d58e7d29d79ed45f84ac54
قاعدة البيانات: OpenAIRE