CHATGPT VE DEEPSEEK’İN, PERİODONTAL İŞLEMLERDE ANTİBİYOTİK PROFİLAKSİ UYGULAMALARI KONUSUNDAKİ SORULARI YANITLAMA ETKİNLİKLERİNİN MULTİDİSİPLİNER OLARAK DEĞERLENDİRİLMESİ

Acıpınar Ş., Aydın İ. S., Şahinbaş M.

4. ULUSLARARASI DENTAL ORAL ENFEKSİYONLAR VE 3. AĞIZ MİKROBİYOTASI KONGRESİ 21 - 23 ŞUBAT 2025, Sakarya, Türkiye, 21 - 23 Şubat 2025, ss.132-133, (Özet Bildiri)

Yayın Türü: Bildiri / Özet Bildiri
Basıldığı Şehir: Sakarya
Basıldığı Ülke: Türkiye
Sayfa Sayıları: ss.132-133
Sivas Cumhuriyet Üniversitesi Adresli: Evet

Bu çalışmanın amacı, periodontoloji kliniğindeki işlemler öncesi antibiyotik profilaksi uygulaması konusunda hastaların sorma potansiyellerinin yüksek olduğu sorulara iki farklı yapay zeka uygulaması (ChatGPT 4.0, DeepSeek) tarafından verilen cevapların doğruluğunun ve tutarlılığının, multidisipliner bir şekilde, periodontoloji ve kardiyoloji alanının bakış açısı ile değerlendirilmesidir.

Gereç ve Yöntem: Periodontal işlemler öncesi antibiyotik profilaksisi konusunda hastaların diş hekimlerine sorma potansiyelleri yüksek olan 30 soru belirlendi. Tüm sorular her iki uygulamaya soruldu ve cevaplar kaydedildi. Verilen cevaplar bir periodontolog ve bir kardiyolog tarafından Küresel Kalite Ölçeğine göre 1 ile 5 arasında skorlandı. İki yapay zeka aracı ve iki uzman arası farklılıklar Mann-whitney U testi ile, uyum ise Kappa testi ile değerlendirildi.

Bulgular: Verilen cevaplarda her iki uygulama için verilen en düşük skor 3 ve en yüksek skor 5 idi. ChatGPT4.0 uygulaması için genel ortalama 4.56±0.56 iken DeepSeek uygulaması için genel ortalama 4.55±0.62 idi. İki grup ve iki hekim arası farklılıklar değerlendirildiğinde yalnızca ChatGPT4.0 uygulaması için uzmanlar arasında istatistiksel olarak anlamlı bir fark belirlendi. Kardiyoloji alanında uzman hekiminin ChatGPT4.0 için verdiği skorlar periodontologtan daha yüksek olarak tespit edildi. Kappa testi yalnızca DeepSeek uygulaması için uzmanlar arasında istatistiksel olarak anlamlı ve orta düzeyde uyum olduğunu belirledi (Kappa değeri: 0,547). ChatGPT 4.0 için uzmanlar arasında fark belirlenmesi ve kappa testi uyumunun önemsiz (Kappa değeri:0,146) olması bu aracın periodontoloji spesifik sorulara verdiği yanıtların periodontolog tarafından daha düşük skorlanmasından kaynaklandığı düşünülmektedir.

Sonuç: Çalışmamızdaki uzman değerlendirmelerine göre; ChatGPT4.0 ve DeepSeek uygulamaları periodontal işlemlerde antibiyotik profilaksi uygulaması konusunda bilgi kaynağı olarak, hem periodontoloji hem de kardiyoloji alanı açısından önemli bir potansiyele sahiptir. Ancak oral hijyen uygulamaları ve enfeksiyon olmaması için alınacak önlemler konusunda, hasta bilgilendirmesinde ChatGPT 4.0 aracının eksik yönlerinin olduğu düşünülmektedir. Sonuç olarak; bu araçlarının yanıtları günümüzde hala tam bir uyum göstermemektedir. Bu nedenle, güvenilirliklerinin ve doğruluklarının geliştirilmesi için teknik geliştirmelere ve geniş kapsamlı yapılacak çalışmalara ihtiyaç vardır.

The aim of this study was to evaluate the accuracy and consistency of the answers given by two different artificial intelligence applications (ChatGPT 4.0, DeepSeek) to the questions that patients have a high potential to ask about antibiotic prophylaxis before procedures in periodontology clinic, in a multidisciplinary manner, from the perspective of periodontology and cardiology.

Materials and Method: We identified 30 questions about antibiotic prophylaxis before periodontal procedures that patients were most likely to ask their dentists. All questions were asked to both applications and the answers were recorded. The answers were scored from 1 to 5 on the Global Quality Scale by a periodontologist and a cardiologist. Differences between the two AI tools and the two experts were evaluated by Mann-whitney U test and agreement was evaluated by Kappa test.

Results: The lowest score was 3 and the highest score was 5 for both applications. The overall mean for ChatGPT4.0 application was 4.56±0.56, while the overall mean for DeepSeek application was 4.55±0.62. When the differences between the two groups and the two physicians were evaluated, a statistically significant difference was determined between the specialists only for ChatGPT4.0 application. The scores given by the cardiology specialist for ChatGPT4.0 were higher than the periodontologist. The Kappa test determined a statistically significant and moderate level of agreement between the specialists only for the DeepSeek application (Kappa value: 0.547). The difference between the experts for ChatGPT 4.0 and the insignificant kappa test agreement (Kappa value: 0.146) is thought to be due to the lower scoring of the periodontologist's answers to periodontology-specific questions by this tool.

Conclusion: According to the expert evaluations in our study, ChatGPT4.0 and DeepSeek applications have an important potential for both periodontology and cardiology as a source of information on antibiotic prophylaxis in periodontal procedures. However, it is thought that the ChatGPT 4.0 tool has deficiencies in informing patients about oral hygiene practices and precautions to be taken to prevent infection. In conclusion, the responses of these tools still do not show complete agreement today. Therefore, technical improvements and large-scale studies are needed to improve their reliability and accuracy.