Evaluating the Clinical Decision-Making Accuracy of Artificial Intelligence in C...

연구 요약

Evaluating the Clinical Decision-Making Accuracy of Artificial Intelligence in Common Geriatric Syndromes Using Evidence-Based Guidelines.

Cureus 학술지에 발표된 이 연구는 Cassar P, Galea F, Ferry P 외 연구팀이 수행하였습니다.

이 연구는 'Evaluating the Clinical Decision-Making Accuracy of Artificial Intelligence in Common Geriatric Syndromes Using Evidence-Based Guidelines.'에 대한 과학적 분석을 제공합니다.

핵심 내용

Background Artificial intelligence (AI) tools such as ChatGPT are increasingly being explored for clinical decision support, yet their role in geriatric medicine remains uncertain due to the complexity of multimorbidity and care planning. This study aimed to evaluate the clinical accuracy, completeness, and guideline alignment of ChatGPT's responses to common geriatric scenarios using standardized vignettes. Methodology Seven standardized vignettes representing common geriatric scenarios, namely, polypharmacy, falls, dementia, delirium, frailty, advance care planning, and urinary incontinence, were submitted to ChatGPT (GPT-5). Responses were evaluated by five independent consultant geriatricians using a standardized rubric across the following five domains: accuracy, completeness, guideline alignment, safety, and clarity (0-2 score per domain). Descriptive statistics summarized performance, and qualitative feedback was thematically analyzed. Inter-rater reliability was assessed using Krippendorff's alpha. Results ChatGPT scored the highest in clarity (66/70) and safety (63/70), with slightly lower performance in accuracy (59/70) and completeness (55/70). Guideline alignment was generally strong (61/70). Advance care planning received the highest domain scores; urinary incontinence scored the lowest. Krippendorff's alpha showed high inter-rater agreement (0.969). Reviewers identified key omissions, such as missing assessments or guideline-recommended tools, in multiple vignettes. Conclusions ChatGPT showed potential as a supportive tool in geriatric care, offering clear and generally safe responses aligned with guidelines. However, it lacked clinical depth and missed key elements in complex scenarios. AI tools such as ChatGPT should be used with caution, under expert oversight, and not as standalone decision makers in clinical practice.

일반인을 위한 해석

구체적인 실천 사항은 담당 의사 또는 약사와 상담하시기 바랍니다.

실천 사항

현재 복용 중인 약물이나 영양제에 대해 궁금한 점이 있다면 담당 의사 또는 약사와 상담하시기 바랍니다
약물이나 영양제의 용법·용량을 임의로 변경하지 마세요
이상 반응이 나타나면 즉시 전문가에게 문의하세요

의사/약사의 전문적 판단을 대체하지 않습니다 (PMID: 41717208)

📄 [전문 보기 (Markdown)](fulltext/41717208-evaluating-the-clinical-decision-making-accuracy-of-artifici.md)

이 정보는 의학 논문의 요약이며, 의사/약사의 전문적 판단을 대체하지 않습니다.