CaltechAUTHORS
  A Caltech Library Service

A workflow for accurate metabarcoding using nanopore MinION sequencing

Baloğlu, Bilgenur and Chen, Zhewei and Elbrecht, Vasco and Braukmann, Thomas and MacDonald, Shanna and Steinke, Dirk (2021) A workflow for accurate metabarcoding using nanopore MinION sequencing. Methods in Ecology and Evolution, 12 (5). pp. 794-804. ISSN 2041-210X. doi:10.1111/2041-210x.13561. https://resolver.caltech.edu/CaltechAUTHORS:20200526-120017153

[img] PDF - Published Version
Creative Commons Attribution.

1MB
[img] PDF - Submitted Version
Creative Commons Attribution Non-commercial.

477kB
[img] MS Excel (Table S1) - Supplemental Material
Creative Commons Attribution.

42kB
[img] MS Excel (Table S3) - Supplemental Material
Creative Commons Attribution.

145kB
[img] PDF - Supplemental Material
Creative Commons Attribution.

2MB

Use this Persistent URL to link to this item: https://resolver.caltech.edu/CaltechAUTHORS:20200526-120017153

Abstract

1. Metabarcoding has become a common approach to the rapid identification of the species composition in a mixed sample. The majority of studies use established short‐read high‐throughput sequencing platforms. The Oxford Nanopore MinION™, a portable sequencing platform, represents a low‐cost alternative allowing researchers to generate sequence data in the field. However, a major drawback is the high raw read error rate that can range from 10% to 22%. 2. To test whether the MinION™ represents a viable alternative to other sequencing platforms, we used rolling circle amplification (RCA) to generate full‐length consensus DNA barcodes for a bulk mock sample of 50 aquatic invertebrate species with at least 15% genetic distance to each other. By applying two different laboratory protocols, we generated two MinION™ runs that were used to build error‐corrected consensus sequences. A newly developed Python pipeline, ASHURE, was used for data processing, consensus building, clustering and taxonomic assignment of the resulting reads. 3. Our pipeline achieved median accuracies of up to 99.3% for long concatemeric reads (>45 barcodes) and successfully identified all 50 species in the mock community. The use of RCA was integral for increasing consensus accuracy but was also the most time‐consuming step of the laboratory workflow. Most concatemeric reads were skewed towards a shorter read length range with a median read length of up to 1,262 bp. 4. Our study demonstrates that Nanopore sequencing can be used for metabarcoding, but exploration of other isothermal amplification procedures to improve consensus accuracy is recommended.


Item Type:Article
Related URLs:
URLURL TypeDescription
https://doi.org/10.1111/2041-210x.13561DOIArticle
https://doi.org/10.1101/2020.05.21.108852DOIDiscussion Paper
https://publons.com/publon/10.1111/2041-210X.13561/Related ItemPeer Review
https://doi.org/10.5281/zenodo.4450611DOIData
https://github.com/BBaloglu/ASHURERelated ItemCode
ORCID:
AuthorORCID
Baloğlu, Bilgenur0000-0002-4434-5356
Chen, Zhewei0000-0002-7422-095X
Elbrecht, Vasco0000-0003-4672-7099
Braukmann, Thomas0000-0002-2452-3776
Steinke, Dirk0000-0002-8992-575X
Additional Information:© 2021 The Authors. Methods in Ecology and Evolution published by John Wiley & Sons Ltd on behalf of British Ecological Society. This is an open access article under the terms of the Creative Commons Attribution License, which permits use, distribution and reproduction in any medium, provided the original work is properly cited. Issue Online: 04 May 2021; Version of Record online: 18 February 2021; Accepted manuscript online: 28 January 2021; Manuscript accepted: 07 January 2021; Manuscript received: 26 May 2020. We thank all staff at the Centre for Biodiversity Genomics who helped to collect the samples employed to assemble the mock community. We also would like to thank Florian Leese, Arne Beermann, Cristina Hartmann‐Fatu, and Marie Gutgesell for collecting and providing specimens. This study was supported by funding through the Canada First Research Excellence Fund. The funders had no role in study design, data collection, and analysis, decision to publish, or preparation of the manuscript. This work represents a contribution to the University of Guelph Food From Thought research program. Authors' Contributions: B.B., V.E., T.B. and D.S. designed the experiments; B.B. and S.M. assembled the mock community; B.B. did the laboratory work; V.E. did the MiSeq experiment; B.B. and Z.C. analysed the data and built the bioinformatics pipeline; B.B. and D.S. wrote the manuscript, all authors contributed to the manuscript. Peer Review: The peer review history for this article is available at https://publons.com/publon/10.1111/2041‐210X.13561. Data Availability Statement: Software ASHURE is deposited at Zenodo https://doi.org/10.5281/zenodo.4450611 (Baloğlu & Chen, 2021) and is available at Github under https://github.com/BBaloglu/ASHURE. Raw read data are available at the SRA under PRJNA627498 (ONT: Protocols A and B) and SRR9207930 (Illumina MiSeq).
Funders:
Funding AgencyGrant Number
Canada First Research Excellence FundUNSPECIFIED
Subject Keywords:bioinformatics pipeline, metabarcoding, Nanopore sequencing, rolling circle amplification
Issue or Number:5
DOI:10.1111/2041-210x.13561
Record Number:CaltechAUTHORS:20200526-120017153
Persistent URL:https://resolver.caltech.edu/CaltechAUTHORS:20200526-120017153
Official Citation:Baloğlu, B, Chen, Z, Elbrecht, V, Braukmann, T, MacDonald, S, Steinke, D. A workflow for accurate metabarcoding using nanopore MinION sequencing. Methods Ecol Evol. 2021; 12: 794–804. https://doi.org/10.1111/2041-210X.13561
Usage Policy:No commercial reproduction, distribution, display or performance rights in this work are provided.
ID Code:103450
Collection:CaltechAUTHORS
Deposited By: Tony Diaz
Deposited On:26 May 2020 19:34
Last Modified:07 May 2021 17:22

Repository Staff Only: item control page