A Benford’s law-based framework to determine the threshold of occurrence sites for species distribution modelling from ecological monitoring databases

Taeyong Shim, Zhonghyun Kim, Jinho Jung

Research output: Contribution to journalArticlepeer-review

Abstract

The use of data-based species distribution models (SDMs) has increased significantly in recent years. However, studies of determining the minimum requirements of occurrence sites from ecological monitoring datasets used in species distribution modelling remain insufficient. Therefore, this study proposed a framework to determine the threshold of minimum occurrence sites for SDMs by assessing compliance with Benford’s law. The compliance test verified that the national-scale freshwater fish monitoring dataset was natural and reliable. Results derived from true skill statistics (TSS) determined the minimum number of occurrence sites for reliable species distribution modelling was 20 with a TSS value of 0.793 and an overall accuracy of 0.804. The Benford compliance test has shown to be a useful tool for swift and efficient evaluation of the reliability of species occurrence datasets, or the determination of the threshold of occurrence sites before species distribution modelling. Further studies regarding the evaluation of this method’s transferability to other species and validation using SDM performance are required. Overall, the framework proposed in this study demonstrates that Benford compliance test applied to species monitoring datasets can be used to derive a universal and model-independent minimum occurrence threshold for SDMs.

Original languageEnglish
Article number16777
JournalScientific reports
Volume13
Issue number1
DOIs
Publication statusPublished - 2023 Dec

Bibliographical note

Publisher Copyright:
© 2023, Springer Nature Limited.

ASJC Scopus subject areas

  • General

Fingerprint

Dive into the research topics of 'A Benford’s law-based framework to determine the threshold of occurrence sites for species distribution modelling from ecological monitoring databases'. Together they form a unique fingerprint.

Cite this