1.1. Directory to support the collections community (USE)

dhobern · March 27, 2020, 9:00am

This is topic 1.1. in the Uses section of the Advancing the Catalogue of the World’s Natural History Collections consultation. Use this topic to discuss the questions listed below.

Background
Collections staff and taxonomists collaborate as a truly global community. Valuable specimens are distributed between institutions in all parts of the world. Researchers visit these collections or borrow specimens as part of their work. Index Herbariorum (IH) is the directory of information on the world’s herbaria (addresses, contacts, specialties, size, etc.). It is a well-managed resource and highly regarded as a tool by the botanical community. No full equivalent exists globally for other natural history collections, although national/regional infrastructures such as the Atlas of Living Australia (ALA) Collectory, the iDigBio US Collections List, and the CETAF Registry and Profiles serve similar roles. GBIF has recently integrated the Global Registry of Scientific Collections (GRSciColl) into its registry as a framework that can be extended with richer information curated by collections communities.

Other materials

The following contributed materials are particularly relevant to this topic:

Questions
The focus here is on the catalogue as a directory of known institutions and information required to contact them.

Would the collections community benefit from a comprehensive directory of all natural history collections?
Who would make use of such a directory?

dshorthouse · April 16, 2020, 2:46pm

At the risk of distracting from the intent of these questions, what is the collections community?

If the answer to the first question is a resounding no from some members of the community but not others, what then? Comprehensiveness is a noble goal for the administrators of the registry, useful for someone looking to request a loan, but largely meaningless to a collection manager.

Debbie · April 16, 2020, 7:47pm

Great point, both to “what is the collections community” and the implied, what do we mean by a collection. For starters anyway, the ideas paper does discuss this somewhat. It seems our intent to start with natural science collections, then broaden the scope from there (living collections, zooarchaeological collections, …). Certainly, in many (most) cases the data for such a catalog is going to come from those working in the collections like the collection manager you reference. They will have an interest in the data they need for their work, but what other data might they be able to supply (if they had incentive/time)? And who would make use of that data?

dshorthouse · April 16, 2020, 8:16pm

Surely all this is known through the enduring efforts of IH and the hard lessons learned from GRBio. In start-up mode until there’s a sustainable model, is it not more about the champion in the community who (with a small team) has the drive to see it through than it is about the feature-set, incentives, or who will use the data? Those latter considerations are all nice-to-haves but I expect most of it can be accomplished through responsive, agile development than fleshing out the requirements upfront only to later find or nominate the right team to build it.

Hongfeng-Wang · April 18, 2020, 12:07am

非常能受益。机构管理者在制定发展方向、申请资助、说服公众此机构的重要性、制定吸引研究者等工作中都非常需要此信息。

Debbie · April 18, 2020, 1:14am

欢迎 Welcome Honfeng-Wang. Thanks for your insights about who will benefit and the use cases for those stakeholders.

qgroom · April 18, 2020, 8:10am

It has recognized for a long time that there is a need to develop knowledge on biodiversity in biodiverse countries, particularly in the tropics. Their collections are an important component of that. Yet these collections could remain invisible to the rest of the scientific world; unused and likely to be lost. Being part of such a directory is just one of the many things we can do to make all collections useful to science. It makes it clear that they are part of global biodiversity infrastructure.

Along the same lines one would hope that the directory would be a hub for communication about collections, such as best practices, events, funding etc.

Maofang · April 18, 2020, 4:35pm

In Hongfeng Wang’s post, he said:

The catalogue would be very beneficial for collections community, especially for policy-makers from institutions, who will need it to formulate the directions of development, to apply for funding, to convince the public of the importance of the institution, and to develop ways to attract researchers, etc.

JessUtrup · April 20, 2020, 1:29pm

It would also be useful, I think, to have the acronyms for these institutions and their collections included.

tkarim · April 20, 2020, 3:59pm

Yes! We don’t have a good way to leverage our collective resources right now in terms of policy, funding, etc. A catalogue is the obvious first place to start.

dhobern · April 20, 2020, 11:32pm

Thanks @JessUtrup - Sorry if some of those acronyms were unclear - the text at the start of this page comes from the Ideas Paper where they are all spelled out and there are links to the relevant sites. I have edited this page so it expands some of these abbreviations and links either to presentations on this site or to the relevant project sites. I hope this helps.

JessUtrup · April 21, 2020, 12:21am

Sorry, I was unclear. I thought it would be useful in a directory of institutions to also include their acronym. I have struggled when working with museum specimens sometimes to determine what other museums may have related material if the label only has an acronym. I would love to see a central listing of the acronyms for all institutions. For example, I work in invertebrate paleontology at the Yale Peabody Museum, so we’re YPM or YPM-IP.

dhobern · April 21, 2020, 12:52am

My mistake! Of course, you are most definitely correct. This has traditionally been one of the places where cataloguing efforts start as it helps curators and taxonomists make and use references to collections. It has become even more important in the digital age with such acronyms being offered as pseudo-stable identifiers in specimen records, etc.

abentley · April 22, 2020, 9:37pm

Such a catalogue would benefit the collections community by providing us with a sense of community and an ability to articulate that community for all sorts of reasons - funding, scope, governance, metrics, etc. It will also provide us with a much needed articulation of the non-digitized components of collections that are otherwise not articulated through digitized catalogs published to aggregators. It will also allow individual collections to contact each other and collaborate in new and exciting ways.

abentley · April 22, 2020, 9:53pm

Should have both institutional and collection (and maybe even dataset) acronyms as well as a synonymy system to include old/orphan/unused codes. This would be useful to the collections and research communities if it provided an easy to use lookup system for acronyms to be used in publication - or even an automated system of populating a publication with a list.

elyw · April 24, 2020, 5:38am

Getting back to the original issue: “focus here is on the catalogue as a directory of known institutions and information required to contact them”
Having a directory of known institutions including contact details is certainly worth achieving, and has been done with Index Herbariorum. We can’t dismiss the worth of such a directory as ‘only being good for those who want to borrow material’. Managing loans, exchanges and sending specimens for destructive testing is a significant part of any collection manager’s duties.
The two points I’d like to add:

users need to be able to manage their own institution’s information (i.e. the directory needs to be editable)
if a key use case for the structure of the directory is to support collection managers then ensuring that they are involved making sure that the directory has the fields they need will be important.

clauden · April 24, 2020, 12:59pm

Sounds similar to @elyw comment, on IT compared to collections people for choosing/using data identifiers.

Some users might not know to do directory edits, or how to choose their institutions info., acronym, etc., or it may already be done (and in differing versions/places) by others. Collections people need to be involved with data people, to report and modify (including updates and corrections) directories.

waddink · April 24, 2020, 2:06pm

@elyw would it not make more sense to just use an existing directory like ROR/GRID? That has almost 100.000 research organisations and most institutes are in there already. Then all you need to do is adding the institute identifier (ROR ID) to the collection descriptions and you have a list of institutes with their collections. Up to date information like the institute name, acronym, location and website is then provided as well as links to e.g. wikidata and wikipedia that can provide additional information about the institute.

trobertson · April 24, 2020, 3:22pm

Registries like ROR will surely be the future or at least part of it. Is it ready to use now as the authority do you know? I ask because their own information suggests they are still exploring the processes around data curation and the subsequent lifecycle and we’d need to assess if the content is there now to support our needs. For example, Copenhagen Zoology Museum seemingly not registered but perhaps there are legal aspects and it falls under the university. We may need to consider if it will be accepted/adopted globally in time.

waddink · April 24, 2020, 4:27pm

@trobertson that is a good question. I would say yes, if you take both ROR and Grid into account. See also my blog post
https://dissco.tech/2020/04/11/identifiers-for-our-institutes-grid-and-ror/

ROR on its own is indeed not yet fully developed. It also does not yet have all the metadata that Grid has, but they are working on that. It does have all the records though and they keep the two in sync, at least for now. so there is a 1:1 relationship. The metadata is public domain in both. Grid is the mature brother that has the processes around data curation in place. However it is provided by a company, and may therefore not something to rely on in the long term. Therefore Ror was established, which is community driven and has a pretty strong support from the scholarly research community. I am not entirely confident about their business model though. But even if they fail or do not grow beyond providing an API on top of Grid, you can always fall back on Grid.

All institutes that are part of a RI or deliver data to a science infrastructure like GBIF can be seen as a research organization and are therefore in scope.

However some institutes do not have an identifier yet because they are part of a university that was then given the identifier. They can however ask for their own identifier and we did not experience any hurdles with that with the Synth+ partners in ELViS. Parent-child relations for organisation’s are supported.

The nat hist museum of Copenhagen is here: https://grid.ac/institutes/grid.507616.3 or https://ror.org/040ck2b86

I think Ror id was not yet synchronized back with grid but ror already got the grid id (grid is updated only every three months). They only applied for a grid identifier, a few months ago and already automatically got a Ror id.

Kind regards,

Wouter

Topic		Replies	Views
Collections catalogue (GRBio) Miscellaneous	52	6450	June 28, 2020
Preferred identifiers for GRSciColl entries - Should we mint DOIs for collections? Global Registry of Scientific Collections	17	937	February 11, 2025
Darwin Core Half-Million - UPDATE Data Publishing	11	1187	December 8, 2022
Technical Support hour for Nodes - Global Registry of Scientific Collections (GRSciColl) Data Publishing NodesSupportHour	7	862	July 16, 2023
Attributing work done (Data Attribution) Miscellaneous	19	1813	March 27, 2021

1.1. Directory to support the collections community (USE)

Related topics