Darwin Core Half-Million - UPDATE

sharif.islam · November 2, 2022, 9:09pm

We have momentum and interests about DES and the services we are piloting. This will take some time. In the next few months we will be talking with GBIF and iDigBio to see how we can broaden our scope for the pilot.

The community annotation and curation services can provide the following advantages:

The institutions and the CMS can keep their infrastructure and data model but still take advantage of new annotations and enrichment (of course, some adjustments need to be made on the CMS side to receive the data).
Introduction of the persistent identifiers at the digital specimen and annotation level provides granularity and linking of different digital objects.
These objects can provide the base for large scale data quality checks and annotation services. Here’s a test annotation digital object with a PID. Serialisation of such records (JSON or JSON-LD form) can be fed back to the CMS or other systems. Some of the basic data checks and annotations can easily be automated.
We can also use these identifiers for data citation, attribution (we are also thinking about authentication, authorisation, trust and verification methods for these annotation objects which also will not be easy).

However, we still need a few basic things in place.

As @abentley already pointed out – collections staff and the museums do not have the capacity to do some of these data clean up tasks. Automating and opening up the records will help. And it is easy to say FAIR this and FAIR that. But most museums do not have proper data steward roles and relevant data management training. DiSSCo will help with some of these capacity building but each institutions and the funding agencies need to support more data management tasks. With these training and capacity as foundation, we will start seeing the benefit of community curation or annotation at scale.

Topic		Replies	Views
Filtering isn't cleaning Data Use	22	1418	October 3, 2023
The strange case(s) of the missing identity Miscellaneous	23	286	September 8, 2024
Collections catalogue (GRBio) Miscellaneous	52	6512	June 28, 2020
About the Data Publishing category Data Publishing	1	1258	May 3, 2018
Traceability and version control when publishing a curated regional occurrence dataset with mixed original and previously published records Data Publishing data-quality	13	73	May 13, 2026

Darwin Core Half-Million - UPDATE

Related topics