TY - JOUR
T1 - Leveraging machine learning and citizen science data to describe flowering phenology across South Africa
AU - Stewart, Ross D.
AU - Bard, Nicolas W.
AU - van der Bank, Michelle
AU - Davies, T. Jonathan
N1 - Publisher Copyright:
© 2025 The Author(s). Plants, People, Planet published by John Wiley & Sons Ltd on behalf of New Phytologist Foundation.
PY - 2025
Y1 - 2025
N2 - Societal Impact Statement: Recent shifts in flowering times are an index of, and a response to, human driven climate change. However, most information on these flowering changes is heavily skewed to the northern hemisphere. This imbalance limits our understanding of how climate change is affecting ecosystems, including the mismatches of flowering times between species, increased risks from frost or drought, and shifts in crop growing seasons. Here we show how this knowledge gap can be addressed by combining modern machine learning tools with large-scale data collected by citizen scientists. We present a new map showing flowering patterns across South Africa, revealing complex regional differences and wide variation between plant species. Summary: Phenology—the timing of recurring life history events—is strongly linked to climate. Shifts in phenology have important implications for trophic interactions, ecosystem functioning, and community ecology. However, data on plant phenology can be time consuming to collect and current records are biased across space and taxonomy. Here, we evaluate the performance of convolutional neural networks (CNN), a machine learning tool, for classifying flowering phenology on a very large and taxonomically diverse dataset of citizen science images sourced from the iNaturalist database. Our analysis focuses on plants listed in the National Botanical Gardens (NBGs) of South Africa, a country famed for its floristic diversity (approximately 21,000 species), but poorly represented in phenological databases. We trained our CNN on a dataset of 10,000 images, representing several thousand species across >200 families, and applied the trained model to >1.8 million iNaturalist records. We were able to correctly classify images with >90% accuracy. Using metadata associated with each image, we then reconstructed the timing of peak flower production and length of the flowering season across South Africa and within each National Botanical Garden. Our analysis illustrates how machine learning tools can leverage the vast wealth of citizen science biodiversity data to describe large-scale phenological dynamics. We suggest such approaches may be particularly valuable where data on plant phenology is currently lacking.
AB - Societal Impact Statement: Recent shifts in flowering times are an index of, and a response to, human driven climate change. However, most information on these flowering changes is heavily skewed to the northern hemisphere. This imbalance limits our understanding of how climate change is affecting ecosystems, including the mismatches of flowering times between species, increased risks from frost or drought, and shifts in crop growing seasons. Here we show how this knowledge gap can be addressed by combining modern machine learning tools with large-scale data collected by citizen scientists. We present a new map showing flowering patterns across South Africa, revealing complex regional differences and wide variation between plant species. Summary: Phenology—the timing of recurring life history events—is strongly linked to climate. Shifts in phenology have important implications for trophic interactions, ecosystem functioning, and community ecology. However, data on plant phenology can be time consuming to collect and current records are biased across space and taxonomy. Here, we evaluate the performance of convolutional neural networks (CNN), a machine learning tool, for classifying flowering phenology on a very large and taxonomically diverse dataset of citizen science images sourced from the iNaturalist database. Our analysis focuses on plants listed in the National Botanical Gardens (NBGs) of South Africa, a country famed for its floristic diversity (approximately 21,000 species), but poorly represented in phenological databases. We trained our CNN on a dataset of 10,000 images, representing several thousand species across >200 families, and applied the trained model to >1.8 million iNaturalist records. We were able to correctly classify images with >90% accuracy. Using metadata associated with each image, we then reconstructed the timing of peak flower production and length of the flowering season across South Africa and within each National Botanical Garden. Our analysis illustrates how machine learning tools can leverage the vast wealth of citizen science biodiversity data to describe large-scale phenological dynamics. We suggest such approaches may be particularly valuable where data on plant phenology is currently lacking.
KW - citizen science
KW - climate change
KW - convolutional neural networks
KW - iNaturalist
KW - machine learning
KW - phenology
UR - https://www.scopus.com/pages/publications/105009347234
U2 - 10.1002/ppp3.70059
DO - 10.1002/ppp3.70059
M3 - Article
AN - SCOPUS:105009347234
SN - 2572-2611
JO - Plants People Planet
JF - Plants People Planet
ER -