Zero-shot evaluation of ChatGPT for food named-entity recognition and linking.

Autor: Ogrinc M; Jožef Stefan International Postgraduate School, Ljubljana, Slovenia.; Department of Computer Systems, Jožef Stefan Institute, Ljubljana, Slovenia., Koroušić Seljak B; Department of Computer Systems, Jožef Stefan Institute, Ljubljana, Slovenia., Eftimov T; Department of Computer Systems, Jožef Stefan Institute, Ljubljana, Slovenia.
Jazyk: angličtina
Zdroj: Frontiers in nutrition [Front Nutr] 2024 Aug 13; Vol. 11, pp. 1429259. Date of Electronic Publication: 2024 Aug 13 (Print Publication: 2024).
DOI: 10.3389/fnut.2024.1429259
Abstrakt: Introduction: Recognizing and extracting key information from textual data plays an important role in intelligent systems by maintaining up-to-date knowledge, reinforcing informed decision-making, question-answering, and more. It is especially apparent in the food domain, where critical information guides the decisions of nutritionists and clinicians. The information extraction process involves two natural language processing tasks named entity recognition-NER and named entity linking-NEL. With the emergence of large language models (LLMs), especially ChatGPT, many areas began incorporating its knowledge to reduce workloads or simplify tasks. In the field of food, however, we noticed an opportunity to involve ChatGPT in NER and NEL.
Methods: To assess ChatGPT's capabilities, we have evaluated its two versions, ChatGPT-3.5 and ChatGPT-4, focusing on their performance across both NER and NEL tasks, emphasizing food-related data. To benchmark our results in the food domain, we also investigated its capabilities in a more broadly investigated biomedical domain. By evaluating its zero-shot capabilities, we were able to ascertain the strengths and weaknesses of the two versions of ChatGPT.
Results: Despite being able to show promising results in NER compared to other models. When tasked with linking entities to their identifiers from semantic models ChatGPT's effectiveness falls drastically.
Discussion: While the integration of ChatGPT holds potential across various fields, it is crucial to approach its use with caution, particularly in relying on its responses for critical decisions in food and bio-medicine.
Competing Interests: The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest. The author(s) declared that they were an editorial board member of Frontiers, at the time of submission. This had no impact on the peer review process and the final decision.
(Copyright © 2024 Ogrinc, Koroušić Seljak and Eftimov.)
Databáze: MEDLINE