GPT-4 as a Source of Patient Information for Anterior Cervical Discectomy and Fusion: A Comparative Analysis Against Google Web Search.

Autor:	Mastrokostas PG; College of Medicine, State University of New York (SUNY) Downstate, Brooklyn, NY, USA., Mastrokostas LE; Brooklyn College of the City University of New York, Brooklyn, NY, USA., Emara AK; Department of Orthopaedic Surgery, Cleveland Clinic, Cleveland, OH, USA., Wellington IJ; Department of Orthopaedic Surgery, University of Connecticut, Hartford, CT, USA., Ginalis E; Department of Neurosurgery, Rutgers University, Newark, NJ, USA., Houten JK; Department of Neurosurgery, Mount Sinai School of Medicine, New York, NY, USA., Khalsa AS; Department of Orthopaedic Surgery, University of Pennsylvania, Philadelphia, PA, USA., Saleh A; Department of Orthopaedic Surgery, Maimonides Medical Center, Brooklyn, NY, USA., Razi AE; Department of Orthopaedic Surgery, Maimonides Medical Center, Brooklyn, NY, USA., Ng MK; Department of Orthopaedic Surgery, Maimonides Medical Center, Brooklyn, NY, USA.
Jazyk:	angličtina
Zdroj:	Global spine journal [Global Spine J] 2024 Nov; Vol. 14 (8), pp. 2389-2398. Date of Electronic Publication: 2024 Mar 21.
DOI:	10.1177/21925682241241241
Abstrakt:	Study Design: Comparative study. Objectives: This study aims to compare Google and GPT-4 in terms of (1) question types, (2) response readability, (3) source quality, and (4) numerical response accuracy for the top 10 most frequently asked questions (FAQs) about anterior cervical discectomy and fusion (ACDF). Methods: "Anterior cervical discectomy and fusion" was searched on Google and GPT-4 on December 18, 2023. Top 10 FAQs were classified according to the Rothwell system. Source quality was evaluated using JAMA benchmark criteria and readability was assessed using Flesch Reading Ease and Flesch-Kincaid grade level. Differences in JAMA scores, Flesch-Kincaid grade level, Flesch Reading Ease, and word count between platforms were analyzed using Student's t-tests. Statistical significance was set at the .05 level. Results: Frequently asked questions from Google were varied, while GPT-4 focused on technical details and indications/management. GPT-4 showed a higher Flesch-Kincaid grade level (12.96 vs 9.28, P = .003), lower Flesch Reading Ease score (37.07 vs 54.85, P = .005), and higher JAMA scores for source quality (3.333 vs 1.800, P = .016). Numerically, 6 out of 10 responses varied between platforms, with GPT-4 providing broader recovery timelines for ACDF. Conclusions: This study demonstrates GPT-4's ability to elevate patient education by providing high-quality, diverse information tailored to those with advanced literacy levels. As AI technology evolves, refining these tools for accuracy and user-friendliness remains crucial, catering to patients' varying literacy levels and information needs in spine surgery. Competing Interests: Declaration of Conflicting InterestsThe author(s) declared the following potential conflicts of interest with respect to the research, authorship, and/or publication of this article: Mitchell K. Ng is a paid consultant at Ferghana Partners. For the remaining authors none were declared.
Databáze:	MEDLINE
Externí odkaz:	Zobrazit plný text záznamu