Machine learning on spectral data from miniature devices for food quality analysis - a case study

Autor: Asharindavida, Fayas, Nibouche, Omar, Uhomoibhi, James, Liu, Jun, Wang, Hui
Jazyk: angličtina
Rok vydání: 2023
Zdroj: Asharindavida, F, Nibouche, O, Uhomoibhi, J, Liu, J & Wang, H 2023, Machine learning on spectral data from miniature devices for food quality analysis-a case study . in ICMLSC '23: Proceedings of the 2023 7th International Conference on Machine Learning and Soft Computing . The International Conference on Machine Learning and Soft Computing (ICMLSC): Proceedings, ACM, pp. 89–95, ICMLSC 2023: The 7th International Conference on Machine Learning and Soft Computing, Chongqing, China, 05/01/2023 . https://doi.org/10.1145/3583788.3583801
Popis: Food quality analysis can be carried out by spectral data acquired from spectrometers with its advantage of non-destructive way of testing. Portable and miniature spectroscopy can be a suitable solution when it meets the specifications such as portability, cost, and short processing time requirements, to enable ordinary citizens to use such a device in the fight against food fraud. Compared to more expensive, bulky, and non-portable devices, the data collected using miniature and portable spectrometers is of a lower quality and thus adversely affect the quality of the analysis. Research have been carried out to use machine learning (ML) classifiers on spectral data analysis for food quality assessment. The present work focuses on two aspects: firstly, preliminary exploratory statistical analysis is conducted on the real spectral data on different food products including oils, fruits and spices acquired from such miniature devices, which aims to evaluate and illustrate the distinctive characteristics of such spectral data, data distribution and difference in the spectra across multiple data acquisitions etc. along with a summary of the key challenges to face and explore. Secondly, a case study for the differentiation of extra virgin olive from adulterated with vegetable oil is provided to analyze and evaluate how some commonly used ML classifiers can be used for classification, while the impact of different preprocessing methods to improve the accuracy and efficiency is also provided. The case study demonstrates the good potential of using data analytics for spectral data from miniature device, although the overall performance of those ML classifiers is not exceptional (the classification rates of up to 83.32%) which is partially due to the quality of data, and partially due to limiting to only some classifiers. More elaborate data pre-processing and cleaning methods can be used to address the key challenges of the spectral data from miniature device, and other types of classifiers can be also explored further in future work.
Databáze: OpenAIRE