Modelling imperfect presence data obtained by citizen science

, , , , , , , , , , Davis, Jacqueline, & Hunter, Vanessa (2017) Modelling imperfect presence data obtained by citizen science. Environmetrics, 28(5), Article number: e2446 1-29.

View at publisher

Description

There is growing awareness about the potential benefit of harnessing citizen science for research, particularly in the biological and environmental sciences. Data quality is a major constraint in the use of citizen-science data, in particular, imperfect observations. In this paper, we fit species distribution models to presence-only data (presences and counts, with no absences observed) by exploiting the uncertainty in reported presences, instead of generating pseudo-absences as is common in previous presence-only studies. This approach allowed us to extend the suite of models to include those commonly fit to presence/absence and abundance data. We fit several models to a case study data set of jaguar encounters reported by citizens in the Peruvian Amazon. The true species distribution for the case study data is unknown, and thus we also undertake an extensive simulation study to evaluate model performance. We analyze the sources of error by studying the bias and variance of the models and discuss the predictive performance of each model and its ability to recover the true species distribution. The simulation study shows that, although several approaches are capable of recovering the species distribution, the choice of a modelling approach is a complex one and depends on factors such as inferential aim, model complexity, sample size, and computational resources. This study also addresses some issues in dealing with compound-imperfect observations arising from citizen-science data, and we discuss further steps needed in this research area.

Impact and interest:

20 citations in Scopus
19 citations in Web of Science®
Search Google Scholar™

Citation counts are sourced monthly from Scopus and Web of Science® citation databases.

These databases contain citations from different subsets of available publications and different time periods and thus the citation count from each is usually different. Some works are not in either database and no count is displayed. Scopus includes citations from articles published in 1996 onwards, and Web of Science® generally from 1980 onwards.

Citations counts from the Google Scholar™ indexing service can be viewed at the linked Google Scholar™ search.

ID Code: 109065
Item Type: Contribution to Journal (Journal Article)
Refereed: Yes
ORCID iD:
Mengersen, Kerrieorcid.org/0000-0001-8625-9168
Peterson, Erinorcid.org/0000-0003-2992-0372
Clifford, Samuelorcid.org/0000-0002-3774-3882
Ye, Nanorcid.org/0000-0001-5971-9202
Brown, Rossorcid.org/0000-0003-0813-7741
Vercelloni, Julieorcid.org/0000-0001-5227-014X
Measurements or Duration: 29 pages
Keywords: Citizen Science, Expert Elicitation, Jaguar Preservation, Uncertainty Modelling, Virtual Reality
DOI: 10.1002/env.2446
ISSN: 1180-4009
Pure ID: 33233109
Divisions: Past > Institutes > Institute for Future Environments
Past > QUT Faculties & Divisions > Science & Engineering Faculty
Current > Research Centres > ARC Centre of Excellence for Mathematical & Statistical Frontiers (ACEMS)
Copyright Owner: Consult author(s) regarding copyright matters
Copyright Statement: This work is covered by copyright. Unless the document is being made available under a Creative Commons Licence, you must assume that re-use is limited to personal use and that permission from the copyright owner must be obtained for all other uses. If the document is available under a Creative Commons License (or other specified license) then refer to the Licence for details of permitted re-use. It is a condition of access that users recognise and abide by the legal requirements associated with these rights. If you believe that this work infringes copyright please provide details by email to qut.copyright@qut.edu.au
Deposited On: 18 Jul 2017 09:03
Last Modified: 25 Oct 2025 09:55