Finding citizen science datasets on GBIF - GBIF Data Blog

The short answer is yes, partially.

Citizen science is scientific research conducted, in whole or in part, by amateur (or non-professional) scientists. Citizen science is sometimes described as “public participation in scientific research,” participatory monitoring, and participatory action research (wikipedia definition).


This is a companion discussion topic for the original entry at https://data-blog.gbif.org/post/gbif-citizen-science/
2 Likes

This is very interesting. Around 50% of occurrence records on GBIF are citizen science.

@jwaller - thanks :slight_smile:

cool that you added the machine tags.

Agreed!

I’m wondering if we could use a similarly precise tag for the datasets ID’d in the 2016 study to get a quick look at the overlap…?

1 Like

The 2016 citizen science datasets are already tagged:
I used the datasets from the 2016 study as training set (in addition to the ones I annotated) and I tagged all the datasets annotated as citizen science both from the training set and the prediction.

I could tag the datasets considered citizen science that weren’t included in the 2016 differently, if you are interested.
Otherwise, if you would like to see the evolution of citizen science datasets in GBIF over time, @jwaller is working on a something.

1 Like

Hello - I wonder has there been any continuation on the tagging of citizen science datasets / creation of a list of citizen science datasets? I’ve seen the 2023 blog post from @mgrosjean and others regarding this. The list of citizen science datasets made though seems overly conservative (as pointed out by pserra on that post - also e.g. the Polish Vegetation Database is included although I can’t detect why this would be considered citizen science). Many thanks.

1 Like

Hi @pvichm

My colleague @jwaller has taken over the tagging of citizen science datasets. We have a new field for these tags, you can access the list of citizen science dataset with the API: https://api.gbif.org/v1/dataset/search?category=CitizenScience (you can also use this call to download the list as a TSV file: https://api.gbif.org/v1/dataset/search/export?category=CitizenScience)
You can read about the process of tagging here: GitHub - gbif/dataset-category-management: Creates issues for reviewing GBIF datasets in certain categories · GitHub

Could I ask you where you found the Polish Vegetation Database labelled as citizen science dataset? (As far as I can tell, it doesn’t have the machineTag nor the category) Thanks!

1 Like

Hi @mgrosjean - Thank you for your reply! Great to receive the link, and thank you for flagging where the process of tagging is described, too.

Regarding the Polish Vegetation Database, it was included in the file that downloaded from your April 2023 post here: How to identify citizen science occurrences in GBIF? - #3 by datafixer

Thank you!

1 Like

Thanks @pvichm it looks like the API call in my comment isn’t giving the expected result, I will log the issue.

I have edited the comment on the other post and suggest the approach with the category filter.

Thank you @mgrosjean. From the TSV you sent two comments above: some of the major citizen science datasets are not included (e.g., Artportalen, iNaturalist Research-Grade Observations). Is it possible the API call is acting-up there too? I understand the citizen science tagging can never be complete, but was surprised that datasets from e.g. Chandler et al. were missing from the list. Thank you again!

@pvichm thank you very much for pointing this out. I found a bug wherein some datasets were not getting categories added. I am working on it now.

2 Likes