Automatic classification of literature in systematic reviews on food safety using machine learning.

Autor: van den Bulk LM; Wageningen Food Safety Research, Akkermaalsbos 2, 6708, WB, Wageningen, the Netherlands., Bouzembrak Y; Wageningen Food Safety Research, Akkermaalsbos 2, 6708, WB, Wageningen, the Netherlands., Gavai A; Wageningen Food Safety Research, Akkermaalsbos 2, 6708, WB, Wageningen, the Netherlands., Liu N; Wageningen Food Safety Research, Akkermaalsbos 2, 6708, WB, Wageningen, the Netherlands., van den Heuvel LJ; Wageningen Food Safety Research, Akkermaalsbos 2, 6708, WB, Wageningen, the Netherlands., Marvin HJP; Wageningen Food Safety Research, Akkermaalsbos 2, 6708, WB, Wageningen, the Netherlands.
Jazyk: angličtina
Zdroj: Current research in food science [Curr Res Food Sci] 2021 Dec 26; Vol. 5, pp. 84-95. Date of Electronic Publication: 2021 Dec 26 (Print Publication: 2022).
DOI: 10.1016/j.crfs.2021.12.010
Abstrakt: Systematic reviews are used to collect relevant literature to answer a research question in a way that is clear, thorough, unbiased and reproducible. They are implemented as a standard method in the domain of food safety to obtain a literature overview on the state-of-the-art research related to food safety topics of interest. A disadvantage to systematic reviews, however, is that this process is time-consuming and requires expert domain knowledge. The work reported here aims to reduce the time needed by an expert to screen all possible relevant articles by applying machine learning techniques to classify the articles automatically as either relevant or not relevant. Eight different machine learning algorithms and ensembles of all combinations of these algorithms were tested on two different systematic reviews on food safety (i.e. chemical hazards in cereals and leafy greens). The results showed that the best performance was obtained by an ensemble of naive Bayes and a support vector machine, resulting in an average decrease of 32.8% in the amount of articles the expert has to read and an average decrease in irrelevant articles of 57.8% while keeping 95% of the relevant articles. It was concluded that automatic classification of the literature in a systematic literature review can support experts in their task and save valuable time without compromising the quality of the review.
Competing Interests: The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.
(© 2021 The Authors.)
Databáze: MEDLINE