dc.contributor.author
Haubrock, Phillip J.
dc.contributor.author
Soto, Ismael
dc.contributor.author
Macêdo, Rafael L.
dc.date.accessioned
2025-09-05T06:20:09Z
dc.date.available
2025-09-05T06:20:09Z
dc.identifier.uri
https://refubium.fu-berlin.de/handle/fub188/49090
dc.identifier.uri
http://dx.doi.org/10.17169/refubium-48813
dc.description.abstract
1. The preservation of global biodiversity has become challenging due to intensifying anthropogenic pressures. This study addresses the complex challenges associated with long-term monitoring data (i.e. missing years and gap filling) on the accuracy of temporal biodiversity trends.
2. Here, we analysed over 20 years of annual river macroinvertebrate data, simulating missing entries and applying imputation methods with linear and non-linear models to fill gaps. Our findings show that increasing numbers of gaps lead to increased trend variability, lower Akaike Information Criterion scores and higher standard deviation in model-explained deviance, thus suggesting that models fit more easily to datasets with more missing values due to fewer data constraints, while also introducing greater uncertainty and unexplained variability in the inferred trends.
3. When evaluating different gap-filling algorithms, we found that their performance varied considerably, contributing to increased uncertainty in trend estimates. Random Forest Imputations and Random Sample from Observed Values performed best, introducing less variation and aligning more closely with the original trends, whereas Predictive Mean Matching and its weighted variant amplified deviations, particularly with increasing gaps. Importantly, even a small number of missing or imputed values could, in some cases, reverse the trend direction, highlighting the risk of misinterpretation from seemingly minor data loss.
4. Synthesis and applications. In the current era of large-scale biodiversity monitoring, our study highlights the risks of missing data and the need for cautious imputation. We show that, in many cases, retaining gaps may lead to more accurate trend estimates than imputing data. When imputation is unavoidable, methods, such as Random Forest and Random Sampling from observed values performed relatively well in our macroinvertebrate richness case study. However, the choice to impute as well as the method used should be evaluated in light of the biodiversity metric and the type of trend being analysed.
en
dc.format.extent
18 Seiten
dc.rights.uri
https://creativecommons.org/licenses/by/4.0/
dc.subject
biodiversity monitoring
en
dc.subject
biodiversity trends
en
dc.subject
missing data
en
dc.subject
time-series analysis
en
dc.subject.ddc
500 Naturwissenschaften und Mathematik::570 Biowissenschaften; Biologie::570 Biowissenschaften; Biologie
dc.title
To fill or not to fill: Comparing imputation methods for improved riverine long-term biodiversity monitoring
dc.type
Wissenschaftlicher Artikel
dcterms.bibliographicCitation.doi
10.1111/1365-2664.70110
dcterms.bibliographicCitation.journaltitle
Journal of Applied Ecology
dcterms.bibliographicCitation.number
9
dcterms.bibliographicCitation.pagestart
2421
dcterms.bibliographicCitation.pageend
2438
dcterms.bibliographicCitation.volume
62
dcterms.bibliographicCitation.url
https://doi.org/10.1111/1365-2664.70110
refubium.affiliation
Biologie, Chemie, Pharmazie
refubium.affiliation.other
Institut für Biologie

refubium.resourceType.isindependentpub
no
dcterms.accessRights.openaire
open access
dcterms.isPartOf.eissn
1365-2664
refubium.resourceType.provider
WoS-Alert