The same Occurrence in different Materials of Citations (Books)

skobel2015 · September 14, 2022, 1:23pm

Hi.

We will planned to digitalize occurrences from books. We were worked with department and we were included occurrences about rare species in oboth books (First book was about total list of flora of kurgans, second book was about rare species of region at different territories). So, some occurrences in both books include duplicate (not all, no). Bigger part of information include other occurrences. But we dont want to lose part of occurrences from second book (in this case it will look like incomplete part of book). We will want to made citations to second dataset from first published dataset and also save references from second book.

How I should do made refences on second dataset? Which one of solution is better? Reference to first dataset (or made it for duplicates from first dataset?)
Which one term of DwC is better to will use for reference of first dataset (or duplicates)?
Which one occurrence ID will be for duplicates?
Did You know examples of similar dataset`s?

I would be grateful for the answer

mgrosjean · September 20, 2022, 8:04am

Hi @skobel2015

There is no ideal solution.

If possible I would advise to use the same identifiers for corresponding occurrences between the two datasets. For example, if two occurrence corresponds to the same observations and are published in two different datasets, you can use the same occurrenceID. This should allow our clustering algorithm to put those occurrences together (see this example: Occurrence Detail 2871636339). Note that this would only work of the two occurrences are published in different datasets as our clustering function doesn’t work within datasets.

In addition to that, you could also use the associatedOccurrences and/or resource relationship extension

See more information about how our clustering algorithm works here: Identifying potentially related records - How does the GBIF data-clustering feature work? - GBIF Data Blog

Alternatively, you could also combine the occurrences into one.

system · October 20, 2022, 6:04pm

This topic was automatically closed 30 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
How GBIF identifies related occurrence records (GBIF technical support hour for Nodes) Data Publishing NodesSupportHour	2	887	December 14, 2023
Sibling datasets to overcome star schema limitation Data Publishing	7	805	June 20, 2022
Identifying potentially related records - How does the GBIF data-clustring feature work? - GBIF Data Blog Data blog	19	7847	June 1, 2023
Duplicate occurrence records Data Publishing	3	949	January 7, 2023
Sibling datasets to overcome DwCArchive star schema limitation(2) Data Publishing	3	725	August 28, 2022

The same Occurrence in different Materials of Citations (Books)

Related topics