Not logged in
PANGAEA.
Data Publisher for Earth & Environmental Science

De Vargas, Colomban; Bittner, Lucie (2010): Svalbard 2010 mesocosm experiment: Protistan diversity [dataset]. Station Biologique de Roscoff, PANGAEA, https://doi.org/10.1594/PANGAEA.777733, In: Svalbard 2010 team (2010): EPOCA Svalbard 2010 mesocosm experiment in Kongsfjorden, Svalbard, Norway [dataset publication series]. PANGAEA, https://doi.org/10.1594/PANGAEA.769833

Always quote citation above when using data! You can download the citation in several formats below.

RIS CitationBibTeX Citation

Abstract:
Hierarchical clustering. Taxonomic assignment of reads was performed using a preexisting database of SSU rDNA sequences from including XXX reference sequences generated by Sanger sequencing. Experimental amplicons (reads), sorted by abundance, were then concatenated with the reference extracted sequences sorted by decreasing length. All sequences, experimental and referential, were then clustered to 85% identity using the global alignment clustering option of the uclust module from the usearch v4.0 software (Edgar, 2010). Each 85% cluster was then reclustered at a higher stringency level (86%) and so on (87%, 88%,…) in a hierarchical manner up to 100% similarity. Each experimental sequence was then identified by the list of clusters to which it belonged at 85% to 100% levels. This information can be viewed as a matrix with the lines corresponding to different sequences and the columns corresponding to the cluster membership at each clustering level. Taxonomic assignment for a given read was performed by first looking if reference sequences clustered with the experimental sequence at the 100% clustering level. If this was the case, the last common taxonomic name of the reference sequence(s) within the cluster was used to assign the environmental read. If not, the same procedure was applied to clusters from 99% to 85% similarity if necessary, until a cluster was found containing both the experimental read and reference sequence(s), in which case sequences were taxonomically assigned as described above.
Funding:
Seventh Framework Programme (FP7), grant/award no. 211384: European Project on Ocean Acidification
Parameter(s):
#NameShort NameUnitPrincipal InvestigatorMethod/DeviceComment
1Sample IDSample IDBittner, Lucie
2Experimental treatmentExp treatBittner, Lucie
3Experiment dayExp daydayBittner, Lucie
4FractionFractionBittner, Lucie
5Sequence abundancesequence abund#Bittner, Lucie
6Number of sequencesSequence no#Bittner, Lucie
7Fraction of sampleFraction sample#Bittner, Lucie
8KingdomKingdomBittner, Lucie
9PhylumPhylumBittner, Lucie
10ClassClassBittner, Lucie
11OrderOrderBittner, Lucie
12FamilyFamilyBittner, Lucie
Size:
93578 data points

Download Data

Download dataset as tab-delimited text — use the following character encoding:

View dataset as HTML (shows only first 2000 rows)