Assessing the quality of AI information from ChatGPT regarding oral surgery, preventive dentistry, and oral cancer: An exploration study.

Autor: Alsayed AA; Sijam Dental Centre, Riyadh, Saudi Arabia., Aldajani MB; Department of Paediatric Dentistry, Faculty of Dentistry, King Abdulaziz University, Jeddah, Saudi Arabia., Aljohani MH; Oral and Maxillofacial Diagnostic Sciences, College of Dentistry, Taibah University, Madinah city, Saudi Arabia., Alamri H; Department of Preventive Dentistry, College of Dentistry, Majmaah University, Al Majmaah, Saudi Arabia., Alwadi MA; Department of Dental Health, College of Applied Medical Sciences, King Saud University, Riyadh, Saudi Arabia., Alshammari BZ; Ministry of Health, Qassim, Saudi Arabia., Alshammari FR; College of Dentistry, University of Ha'il, Ha'il city, Saudi Arabia.
Jazyk: angličtina
Zdroj: The Saudi dental journal [Saudi Dent J] 2024 Nov; Vol. 36 (11), pp. 1483-1489. Date of Electronic Publication: 2024 Sep 12.
DOI: 10.1016/j.sdentj.2024.09.009
Abstrakt: Aim: Evaluation of the quality of dental information produced by the ChatGPT artificial intelligence language model within the context of oral surgery, preventive dentistry, and oral cancer.
Methodology: This study adopted quantitative methods approach. The experts prepared 50 questions (including dimensions of, risk factors, preventive measures, diagnostic methods, and treatment options) that would be presented to ChatGPT, and its responses were rated for their accuracy, completeness, relevance, clarity or comprehensibility, and possible risks using a standardized rubric. To carry out the assessment of the responses by ChatGPT, a standardized scoring rubric was used. Evaluation process included feedback concerning the strengths, weaknesses, and potential areas of improvement in the responses provided by ChatGPT.
Results: While achieving the highest score for preventive dentistry at 4.3/5 and being able to communicate the complex information coherently, the tool showed lower accuracy for oral surgery and oral cancer, scoring 3.9/5 and 3.6/5, respectively, with several gaps for post-operative instructions, personalized risk assessments, and specialized diagnostic methods. Potential risks, such as a lack of individualized advice, were shown in 53% of the oral cancer and in 40% of the oral surgery. While showing promise in some domains, ChatGPT had important limitations in specialized areas that require nuanced expertise.
Conclusion: The findings point to the need for professional supervision while using AI-generated information and ongoing evaluation as capabilities evolve, for the assurance of responsible implementation in the best interest of patient care.
Competing Interests: The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.
(© 2024 THE AUTHORS.)
Databáze: MEDLINE