Not logged in
PANGAEA.
Data Publisher for Earth & Environmental Science

Sachs, Maria; Dünn, Manon (2023): Amplicon sequence variant table of benthic heterotrophic protists of the southern Baltic Sea sampled in 2020-2022 [dataset]. PANGAEA, https://doi.org/10.1594/PANGAEA.961796

Always quote citation above when using data! You can download the citation in several formats below.

Published: 2023-09-12DOI registered: 2023-10-11

RIS CitationBibTeX Citation ShareShow MapGoogle Earth

Abstract:
We aimed to explore the community compostion of benthic heterotrophic protists in three different regions of the southern Baltic Sea - Fehmarnbelt, Oderbank and Roennebank. Sediment samples where collected with a multicorer system and sliced into layered depth profile during three cruises in 2020, 2021 and 2022 with research vessels Elisabeth Mann Borgese (2020, EMB238 and 2021, EMB267) and Alkor (2022, AL570). We performed a paired-end NovaSeq sequencing (2 × 150 bp) run of the amplified V9 region of the 18S rDNA. For subsequent quality measures during data analysis, we created an in vitro community, called a "mock community", comprising DNA of nine different protist cultures, adding this mixture to each individual sequencing run. After sequencing, the raw reads were demultiplexed and the barcode and primer sequences were clipped using cutadapt (Martin 2011). The data was further processed using the the dada2 package in R (Callahan et al. 2016). For taxonomic assignment we used the PR2 database (Protist Ribosomal Reference database, Guillou et al. 2012, https://pr2-database.org/ ) updated with 150 sequences obtained from our own collection using usearch_global (v2.18.0, Rognes et al. 2016). We discarded all Metazoa, fungi, autotrophic protists (determined on the basis of taxonomic assignment) and retained only heterotrophic protists' amplicon sequence variants (ASVs) with a pairwise identity of >80% to a reference sequence. For the main dataset of samples, we then chose individual minimum thresholds per sample according to the accompanying mock community on the respective sequencing lane. For calculation of these thresholds, we used the proportion of the lowest read number of an ASV in the mock community data set that could be assigned to the cultured species.
Keyword(s):
Baltic Sea; Brackish waters; protists; sediment
Supplement to:
Sachs, Maria; Dünn, Manon; Arndt, Hartmut (2023): Benthic Heterotrophic Protist Communities of the Southern Baltic Analyzed with the Help of Curated Metabarcoding Studies. Biology, 12(7), 1010, https://doi.org/10.3390/biology12071010
Related to:
Sachs, Maria; Dünn, Manon (2023): Environmental data related to amplicon sequencing of benthic heterotrophic protists of the southern Baltic Sea, in 2020-2022 [dataset]. PANGAEA, https://doi.org/10.1594/PANGAEA.961784
References:
Callahan, Benjamin J; McMurdie, Paul J; Rosen, Michael J; Han, Andrew W; Johnson, Amy Jo A; Holmes, Susan P (2016): DADA2: High-resolution sample inference from Illumina amplicon data. Nature Methods, 13, 581-583, https://doi.org/10.1038/nmeth.3869
Guillou, Laurent; Bachar, Dipankar; Audic, Stephane; Bass, David; Berney, Cédric; Bittner, Lucie; Boutte, Christophe; Burgaud, Gaétan; De Vargas, Colomban; Decelle, Johan; del Campo, Javier; Dolan, John; Dunthorn, Micah; Bente, Edvardsen; Holzmann, Maria; Kooistra, W H C F; Lara, Enrique; Le Bescot, Noan; Logares, Ramiro; Mahé, Frédéric; Massana, Ramón; Montresor, Marina; Morard, Raphael; Not, Fabrice; Pawlowski, Jan; Probert, Ian; Sauvadet, Anne-Laure; Siano, Raffaele; Stoeck, Thorsten; Vaulot, Daniel; Zimmermann, Pascal; Christen, Richard (2012): The Protist Ribosomal Reference database (PR2): a catalog of unicellular eukaryote Small Sub-Unit rRNA sequences with curated taxonomy. Nucleic Acids Research, 41(D1), D597-D604, https://doi.org/10.1093/nar/gks1160
Martin, Marcel (2011): Cutadapt removes adapter sequences from high-throughput sequencing reads. EMBnet.journal, 17(1), 10, https://doi.org/10.14806/ej.17.1.200
Rognes, Torbjørn; Flouri, Tomáš; Nichols, Ben; Quince, Christopher; Mahé, Frédéric (2016): VSEARCH: a versatile open source tool for metagenomics. PeerJ, 4, e2584, https://doi.org/10.7717/peerj.2584
Funding:
Federal Ministry of Education and Research (BMBF), grant/award no. 03F0848D: DAM sustainMare - MGF Baltic Sea, University of Cologne
Coverage:
Median Latitude: 54.452870 * Median Longitude: 12.593818 * South-bound Latitude: 54.248971 * West-bound Longitude: 10.686013 * North-bound Latitude: 54.773210 * East-bound Longitude: 14.332776
Date/Time Start: 2020-05-27T12:11:00 * Date/Time End: 2022-04-03T14:10:24
Minimum Elevation: -38.0 m * Maximum Elevation: -11.0 m
Event(s):
AL570_75-1 * Latitude: 54.760610 * Longitude: 13.997050 * Date/Time: 2022-04-03T11:05:50 * Elevation: -38.0 m * Location: Baltic Sea * Campaign: AL570 * Basis: Alkor (1990) * Method/Device: MultiCorer (MUC) * Comment: BoKo
AL570_78-1 * Latitude: 54.773210 * Longitude: 14.015560 * Date/Time: 2022-04-03T14:10:24 * Elevation: -38.0 m * Location: Baltic Sea * Campaign: AL570 * Basis: Alkor (1990) * Method/Device: MultiCorer (MUC)
EMB238_2-4 * Latitude: 54.556067 * Longitude: 10.758716 * Date/Time: 2020-05-27T12:11:00 * Elevation: -20.4 m * Location: Baltic Sea * Campaign: EMB238 (MPA-DAM 2020 A) * Basis: Elisabeth Mann Borgese * Method/Device: MultiCorer (MUC)
Comment:
Explanation of data header:
* ASV_ID = amplicon sequence variant ID
* Number_of_reads = total number of reads
* Percentage_Identity = percentage of identity to a sequence from the reference database
* GenBank_Closer_Match = accession number of the closest match in the database
* Taxonomic ranks from Kingdom up to species level
* Sequences = sequence for each ASV
* Seq_length = sequence length for each ASV
* Reads for each ASV in each individual sample (columns X144 to X781C15 which refer to sample IDs)
The environmental data for each sample was archived separately in PANGAEA (Sachs & Dünn, 2023, https://doi.org/10.1594/PANGAEA.961784)
Status:
Curation Level: Enhanced curation (CurationLevelC)
Size:
678.9 kBytes

Download Data

Download dataset