eDNA sequence annotated only as Eukaryota - include or exclude for GBIF occurrence?

daphnehoh · April 13, 2023, 2:18am

Hello everyone!

I am currently in the process of preparing an occurrence dataset from an eDNA project. However, during species classification, I have encountered some OTU sequences that were only identified up to the domain level - Eukaryota, with no lower-level taxon rank available.

I have noticed that the GBIF backbone does not include the ‘domain’ taxon rank, and I am seeking suggestions on how to proceed with these occurrences. Should I exclude them and only keep those OTUs (and their corresponding occurrences) that have been identified up to at least the kingdom level?

cecsve · April 13, 2023, 8:16am

Hi,

You can include them in the dataset, and if you include sequences as well, then potential future users may be able to re-annotate using an updated database. The interpreted value for scientificName would show up as ‘Incertae sedis’ but the original (verbatim) value would still be associated with each record.

How specific was the primer you used? Could you with relative certainty say it should be kingdom level ‘Animalia’ for example? Or do you expect that some sequences may belong to other kingdoms?

tfroeslev · April 13, 2023, 8:34am

I would add this modification to Cecilies comment: You should preferably include all OTUs and their sequences (using the dna-derived dwc extension). That will make the data interoperable across datasets, with or without annotation.

daphnehoh · April 13, 2023, 8:58am

Thanks @cecsve and @tfroeslev for the suggestions and comments!

We used COI gene, which isn’t specific to any kingdom really, and those annotated ones in the dataset now contains kingdom Animalia, Chromista, Fungi, Plantae and Protozoa - which is actually quite diverse.

I will proceed with including all OTUs as occurrence along with their DNA sequences.

system · May 13, 2023, 6:59pm

This topic was automatically closed 30 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
iNaturalist database not up-to-date?	4	570	February 24, 2022
Sequenced-based data on GBIF - What you need to know before analyzing data - GBIF Data Blog data-blog	5	14260	May 2, 2019
Sibling datasets to overcome star schema limitation Data Publishing	7	729	June 20, 2022
Are taxon exclusions in search possible? Data Use	2	464	October 23, 2023
Investigating taxonomic issues on GBIF.org Data Publishing NodesSupportHour	6	182	February 13, 2025

eDNA sequence annotated only as Eukaryota - include or exclude for GBIF occurrence?

Related topics