Pollen identification through convolutional neural networks: First application on a full fossil pollen sequence.

Autor: Durand M; Département de Géographie, Université de Montréal, Montréal, Québec, Canada., Paillard J; Département de Géographie, Université de Montréal, Montréal, Québec, Canada., Ménard MP; Département de Géographie, Université de Montréal, Montréal, Québec, Canada., Suranyi T; Département de Géographie, Université de Montréal, Montréal, Québec, Canada.; Laboratoire Chrono-Environnement, UMR 6249 CNRS, Université de Franche-Comté, Besançon, France., Grondin P; Direction de la recherche forestière, Ministère des Ressources naturelles et des Forêts, Québec City, Québec, Canada., Blarquez O; Département de Géographie, Université de Montréal, Montréal, Québec, Canada.
Jazyk: angličtina
Zdroj: PloS one [PLoS One] 2024 Apr 30; Vol. 19 (4), pp. e0302424. Date of Electronic Publication: 2024 Apr 30 (Print Publication: 2024).
DOI: 10.1371/journal.pone.0302424
Abstrakt: The automation of pollen identification has seen vast improvements in the past years, with Convolutional Neural Networks coming out as the preferred tool to train models. Still, only a small portion of works published on the matter address the identification of fossil pollen. Fossil pollen is commonly extracted from organic sediment cores and are used by paleoecologists to reconstruct past environments, flora, vegetation, and their evolution through time. The automation of fossil pollen identification would allow paleoecologists to save both time and money while reducing bias and uncertainty. However, Convolutional Neural Networks require a large amount of data for training and databases of fossilized pollen are rare and often incomplete. Since machine learning models are usually trained using labelled fresh pollen associated with many different species, there exists a gap between the training data and target data. We propose a method for a large-scale fossil pollen identification workflow. Our proposed method employs an accelerated fossil pollen extraction protocol and Convolutional Neural Networks trained on the labelled fresh pollen of the species most commonly found in Northeastern American organic sediments. We first test our model on fresh pollen and then on a full fossil pollen sequence totalling 196,526 images. Our model achieved an average per class accuracy of 91.2% when tested against fresh pollen. However, we find that our model does not perform as well when tested on fossil data. While our model is overconfident in its predictions, the general abundance patterns remain consistent with the traditional palynologist IDs. Although not yet capable of accurately classifying a whole fossil pollen sequence, our model serves as a proof of concept towards creating a full large-scale identification workflow.
Competing Interests: The authors have declared that no competing interests exist.
(Copyright: © 2024 Durand et al. This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.)
Databáze: MEDLINE
Nepřihlášeným uživatelům se plný text nezobrazuje