A recent discussion on Twitter prompted this topic. Let’s explore how Bionomia and GBIF might interact & who might use or benefit from sharing this information about some of the people linked to occurrence records.
There are presently 25M links created or displayed in Bionomia between people with either an ORCID ID (living) or a Wikidata Q number (deceased) for 22M occurrence records from 5547 datasets. The majority of these are created by 166 volunteers or by the collectors/determiners of specimens themselves, but a growing proportion of these are also obtained from source datasets that populate newly ratified Darwin Core terms, recordedByID and identifiedByID. Bionomia obtains specimen (or specimen-like) records from GBIF and refreshes these every two weeks. Here is one example download: Download. The GBIF informatics team created a custom download for this based on the Apache Avro file format. The metadata behind these 25M links indicate what was the action executed – either collected, identified, or both –, who made the link, when the link was created, and when it was modified. There are additional metadata derived from ORCID or Wikidata cached in Bionomia that may be valuable such as aliases of the person and their birth/death dates (deceased, from Wikidata). There are also Frictionless Data downloads for each dataset. Here’s an example for the Canadian Museum of Nature Herbarium. Note that the datasetKey in the URL is the same as that generated by GBIF for its presentation there: Canadian Museum of Nature Herbarium.
Taking a prompt from @trobertson’s Twitter posts, here are questions we might explore:
- Should GBIF be merging in the Bionomia links during indexing time?
- If yes to the above, should GBIF additionally record metadata on the link such as who created it and when it was created? [Aside: perhaps not all links are created equal if we assign more weight to links created by collectors/determiners themselves or by individuals with an affiliation with an organization that publishes the dataset]
- Do publishers to GBIF want these links between people and occurrence records? In what form should this take?
- Do users of GBIF data want these links between people and records? In what form should that take?