A comparison of statistical methods for deriving occupancy estimates from machine learning outputs.

Journal: Scientific reports

PMID: 40289178

Abstract

The combination of autonomous recording units (ARUs) and machine learning enables scalable biodiversity monitoring. These data are often analysed using occupancy models, yet methods for integrating machine learning outputs with these models are rarely compared. Using the Yucatán black howler monkey as a case study, we evaluated four approaches for integrating ARU data and machine learning outputs into occupancy models: (i) standard occupancy models with verified data, and false-positive occupancy models using (ii) presence-absence data, (iii) counts of detections, and (iv) continuous classifier scores. We assessed estimator accuracy and the effects of decision threshold, temporal subsampling, and verification strategies. We found that classifier-guided listening with a standard occupancy model provided an accurate estimate with minimal verification effort. The false-positive models yielded similarly accurate estimates under specific conditions, but were sensitive to subjective choices including decision threshold. The inability to determine stable parameter choices a priori, coupled with the increased computational complexity of several models (i.e. the detection-count and continuous-score models), limits the practical application of false-positive models. In the case of a high-performance classifier and a readily detectable species, classifier-guided listening paired with a standard occupancy model provides a practical and efficient approach for accurately estimating occupancy.

Authors

Lydia K D Katsis

School of Geography and Environmental Science, University of Southampton, Southampton, UK. L.K.D.Katsis@soton.ac.uk.
Tessa A Rhinehart

Department of Biological Sciences, University of Pittsburgh, Pittsburgh, PA, USA.
Elizabeth Dorgay

Ya'axché Conservation Trust, Punta Gorda Town, Toledo District, Belize City, Belize.
Emma E Sanchez

Panthera-Belize, Panthera Wild Cat Conservation Belize, 5189 Monday Morning Ave., Cayo District, Belmopan City, Belize.
Jake L Snaddon

School of Geography and Environmental Science, University of Southampton, Southampton, UK.
C Patrick Doncaster

School of Biological Sciences, University of Southampton, Southampton, UK.
Justin Kitzes

Department of Biological Science, University of Pittsburgh, 4200 Fifth Ave, Pittsburgh, PA, 15260, USA.

Keywords

Animals Biodiversity Machine Learning Models, Statistical

External Resources

View on PubMed Access via DOI PubMed (40289178)

A comparison of statistical methods for deriving occupancy estimates from machine learning outputs.

Abstract

Authors

Keywords

External Resources

Popular Topics

Recent Journals