The impact of language variability on artificial intelligence performance in regenerative endodontics
| dc.contributor.author | Büyüközer Özkan, Hatice | |
| dc.contributor.author | Doğan Çankaya, Tülin | |
| dc.contributor.author | Kölüş, Türkay | |
| dc.date.accessioned | 2025-08-01T11:23:03Z | |
| dc.date.available | 2025-08-01T11:23:03Z | |
| dc.date.issued | May 2025 | |
| dc.department | KMÜ, Ahmet Keleşoğlu Diş Hekimliği Fakültesi, Klinik Bilimler Bölümü | |
| dc.description.abstract | Regenerative endodontic procedures (REPs) are promising treatments for immature teeth with necrotic pulp. Artificial intelligence (AI) is increasingly used in dentistry; thus, this study evaluates the reliability of AI-generated information on REPs, comparing four AI models against clinical guidelines. Methods: ChatGPT-4o, Claude 3.5 Sonnet, Grok 2, and Gemini 2.0 Advanced were tested with 20 REP-related questions from the ESE/AAE guidelines and expert consensus. Questions were posed in Turkish and English, with or without prompts. Two specialists assessed 640 AI-generated answers via a four-point rubric. Inter-rater reliability and response accuracy were statistically analyzed. Results: Inter-rater reliability was high (0.85–0.97). ChatGPT-4o showed higher accuracy with English prompts (p < 0.05). Claude was more accurate than Grok in the Turkish (nonprompted) and English (prompted) conditions (p < 0.05). No model reached ≥80% accuracy. Claude (English, prompted) scored highest; Grok-Turkish (nonprompted) scored lowest. Conclusions: The performance of AI models varies significantly across languages. English queries yield higher accuracy. While AI shows potential for REPs information, current models lack sufficient accuracy for clinical reliance. Cautious interpretation and validation against guidelines are essential. Further research is needed to enhance AI performance in specialized dental fields. | |
| dc.identifier.doi | 10.3390/healthcare13101190 | |
| dc.identifier.issn | 22279032 | |
| dc.identifier.issue | 10 | |
| dc.identifier.pmid | 40428026 | |
| dc.identifier.startpage | 1190 | |
| dc.identifier.uri | https://www.doi.org/10.3390/healthcare13101190 | |
| dc.identifier.uri | https://hdl.handle.net/11492/10898 | |
| dc.identifier.volume | 13 | |
| dc.identifier.wos | WOS:001496290400001 | |
| dc.identifier.wosquality | Q2 | |
| dc.indekslendigikaynak | Scopus | |
| dc.institutionauthor | Kölüş, Türkay | |
| dc.institutionauthorid | Kölüş, Türkay/0000-0002-0840-7126 | |
| dc.language.iso | en | |
| dc.publisher | Multidisciplinary Digital Publishing Institute (MDPI) | |
| dc.relation.ispartof | Healthcare (Switzerland) | |
| dc.relation.publicationcategory | Makale - Uluslararası Hakemli Dergi - Kurum Öğretim Elemanı | |
| dc.rights | info:eu-repo/semantics/openAccess | |
| dc.subject | Artificial İntelligence | |
| dc.subject | Chatgpt | |
| dc.subject | Claude | |
| dc.subject | Dental Education | |
| dc.subject | Endodontics | |
| dc.subject | Gemini | |
| dc.subject | Grok | |
| dc.subject | Regenerative Endodontic Procedures | |
| dc.title | The impact of language variability on artificial intelligence performance in regenerative endodontics | |
| dc.type | Article |












