I remember a long time ago, there was a list (published if memory serve as a Google Sheets doc) of all cases where two or more institutions used the same identifier. I can’t recall which of the several collection code documentation initiatives active at the time published it (it was a LONG time ago).
Does an equivalent exist for GrSciColl?
Hi @Circeus
You can download any selection of GRSciColl institution or collection by clicking on the “Download as TSV” button in the web interface:
Here: Data - GRSciColl or here: Data - GRSciColl
The resulting CSV file will contain the concatenated identifiers for each entry. You can then use the information to identify the entries with the same identifiers.
We had an attempt at coordinating the deduplication of GRSciColl entries with this repository: GitHub - gbif/collections-duplicates but since then, a lot of records have been edited and deduplicated by the community (but the issues haven’t been closed in the repository).
We could repurpose this repository and recreate new issues if that was useful. What do you think?
1 Like
I don’t actually have any particular need for it
. I was just genuinely curious in general as to whether such a list existed.
1 Like