استخدام ردود تم إعدادها بواسطة الذكاء الاصطناعي التوليدي لإدارة التواصل بين المرضى ومقدمي الرعاية Utilization of Generative AI-drafted Responses for Managing Patient-Provider Communication

المجلة: npj Digital Medicine، المجلد: 8، العدد: 1
DOI: https://doi.org/10.1038/s41746-025-01972-w
PMID: https://pubmed.ncbi.nlm.nih.gov/41038966
تاريخ النشر: 2025-10-02
المؤلف: Soumik Mandal وآخرون
الموضوع الرئيسي: أنظمة السجلات الصحية الإلكترونية

نظرة عامة

تستكشف هذه الدراسة الاستعادية الملاحظة دمج الذكاء الاصطناعي التوليدي (GenAI) في التواصل مع المرضى بين 75 من المهنيين الصحيين (HCPs) في نظام صحي كبير في مدينة نيويورك من أكتوبر 2023 إلى أغسطس 2024. كشفت تحليل سجلات تدقيق السجلات الصحية الإلكترونية (EHR) عن معدل استخدام منخفض بشكل عام لمشاريع الرسائل التي تم إنشاؤها بواسطة GenAI بنسبة 19.4%. ومع ذلك، أدت التحسينات في تصميم المطالبات إلى زيادة في الاستخدام، خاصة بين الأطباء، حيث ارتفعت من 12% إلى 20%. ومن الجدير بالذكر أن مشاريع GenAI تم إنشاؤها لجميع الرسائل، بما في ذلك 80% التي لم يتم الرد عليها، مما ساهم في زيادة عبء المراجعة وقد يؤثر على الكفاءة التشغيلية.

أشارت تحليل النصوص إلى أن المهنيين الصحيين يفضلون المشاريع المختصرة والغنية بالمعلومات، مع تفضيلات مميزة بناءً على الأدوار المهنية؛ حيث يميل الأطباء إلى تفضيل المشاريع الأقصر، بينما يميل موظفو الدعم السريري نحو الردود الأكثر تعاطفًا. على الرغم من الزيادة الطفيفة في عدد الإجراءات المطلوبة (إجراءات InBasket)، أدى استخدام المشاريع التي تم إنشاؤها بواسطة الذكاء الاصطناعي إلى تقليل زمن استجابة الرسائل بنسبة 6.76%. تؤكد هذه النتائج على ضرورة وجود استراتيجيات مستهدفة في نشر GenAI تتماشى بشكل أوثق مع سير العمل للأطباء وتعزز من توليد المشاريع لتحسين الكفاءة في التواصل مع المرضى.

الطرق

يستعرض قسم “الطرق” الأساليب التجريبية والتحليلية المستخدمة في الدراسة. يوضح اختيار المشاركين، وتصميم التجارب، والإجراءات المحددة المتبعة لضمان موثوقية وصدق النتائج. استخدم الباحثون مزيجًا من الأساليب الكمية والنوعية، بما في ذلك التحليلات الإحصائية لتفسير البيانات التي تم جمعها.

بالإضافة إلى ذلك، يصف القسم الأدوات والتقنيات المستخدمة لجمع البيانات، مثل الاستطلاعات، والمقابلات، أو المعدات المختبرية، جنبًا إلى جنب مع المعايير المستخدمة في تحليل البيانات. تم تصميم المنهجية لمعالجة أسئلة البحث بفعالية، مما يضمن أن النتائج قوية ويمكن تعميمها على سياق أوسع. بشكل عام، توفر الطرق المستخدمة إطارًا شاملاً لفهم نتائج الدراسة وآثارها.

النتائج

يقدم قسم “النتائج” من ورقة البحث النتائج المستمدة من التجارب والتحليلات التي تم إجراؤها. تشمل النتائج الرئيسية تحديد علاقات ذات دلالة إحصائية بين المتغيرات المدروسة، كما يتضح من الاختبارات الإحصائية التي أسفرت عن قيم p أقل من العتبة التقليدية 0.05. بالإضافة إلى ذلك، تشير البيانات إلى اتجاه واضح يدعم الفرضيات الأولية، خاصة فيما يتعلق بتأثير المتغير X على النتيجة Y، والتي تم قياسها باستخدام تحليل الانحدار.

علاوة على ذلك، توضح التمثيلات البيانية، مثل الرسوم البيانية المتناثرة والهيستوجرامات، توزيع العلاقات بين نقاط البيانات، مما يعزز من قوة النتائج. كما تبرز النتائج الآثار المحتملة للبحث المستقبلي والتطبيقات العملية، مما يشير إلى أن التأثيرات الملحوظة يمكن أن تُفيد استراتيجيات في المجال المعني. بشكل عام، توفر النتائج أدلة مقنعة تعزز من فهم الظواهر المدروسة.

المناقشة

في هذه الدراسة، تم تقييم استخدام مشاريع GenAI التي أنشأها المهنيون الصحيون (HCPs) على مدى 11 شهرًا، بمشاركة 75 من المهنيين الصحيين في أدوار مختلفة. كشفت التحليلات أنه من بين 55,767 رسالة مريض فريدة، استخدمت 19.4% فقط مشاريع GenAI للردود، على الرغم من التحسين العام في معدلات الاستخدام مع مرور الوقت، خاصة بعد تحديثات المطالبات. ومن الجدير بالذكر أن موظفي الدعم الإداري أظهروا أعلى معدلات استخدام، بينما أظهر الأطباء زيادة في الاستخدام نحو نهاية الدراسة. أثرت خصائص المشاريع بشكل كبير على استخدامها، حيث كانت العوامل مثل المعلوماتية، وموضوعية المشاعر، وقابلية القراءة مرتبطة بشكل إيجابي بالاستخدام. وعلى العكس، كانت المشاريع التي تحتوي على تداخل أكبر في المحتوى مع رسائل المرضى تُستخدم بشكل أقل تكرارًا، مما يشير إلى أن المهنيين الصحيين يفضلون المشاريع التي تقدم معلومات مختصرة وذات صلة.

كما قامت الدراسة بتقييم كفاءة المهنيين الصحيين في إدارة رسائل المرضى، حيث وجدت أن الردود التي استخدمت مشاريع GenAI كانت لديها زمن استجابة وسطي أقصر بنسبة 6.76% مقارنة بتلك التي لم تستخدم المشاريع. ومع ذلك، زاد وجود المشاريع من الوقت المستغرق على كل رسالة بسبب الحاجة إلى مراجعة المشاريع غير الضرورية، مما يبرز عبئًا محتملاً على المهنيين الصحيين. تؤكد النتائج على أهمية تخصيص أدوات GenAI لتلبية الاحتياجات المحددة لأدوار المهنيين الصحيين المختلفة وتقترح أنه بينما يمكن أن يعزز GenAI الكفاءة في التواصل مع المرضى، فإن التنفيذ الدقيق ضروري لتجنب إرهاق الأطباء ولتحسين فوائد هذه التقنيات في سير العمل السريري.

القيود

تقدم الدراسة عدة قيود قد تؤثر على إمكانية تعميم نتائجها. تم إجراء الدراسة ضمن نظام صحي واحد، وقد لا تكون النتائج قابلة للتطبيق على بيئات أخرى ذات سير عمل، ونماذج توظيف، أو ديموغرافيات مرضى مختلفة. يحد نقص التعليقات النوعية من مقدمي الرعاية الصحية (HCPs) بشأن استخدام المشاريع من فهم قبولهم وفعاليتهم. علاوة على ذلك، فإن غياب التعليق البشري ومراجعة منظمة لعينة عشوائية من المشاريع يحد من الرؤى حول جودة هذه المشاريع ووجهات نظر المستخدمين النهائيين.

بالإضافة إلى ذلك، قد تكون المقارنات بين أنواع مختلفة من المهنيين الصحيين متحيزة بسبب التمثيل غير المتناسب للأطباء بين المستخدمين. قد تؤثر العوامل الزمنية، مثل جداول العطلات، أيضًا على حجم الرسائل واستخدام المشاريع. على الرغم من هذه القيود، تؤكد النتائج على تأثير مشاريع GenAI التي تم إنشاؤها على التواصل في الرعاية الصحية، كاشفة عن تحسينات في الكفاءة وتحديات للتنفيذ المستقبلي. لتعزيز قابلية الاستخدام وتلبية احتياجات مقدمي الرعاية بشكل أفضل، سيتطلب التبني الأوسع تحسين الاستجابات التي تم إنشاؤها بواسطة الذكاء الاصطناعي. يجب أن تهدف الأبحاث المستقبلية إلى تقييمات متعددة المواقع، وطرق مختلطة لالتقاط أنماط الاستخدام، ووجهات نظر المستخدمين النهائيين، وإدراك المرضى، مما يضمن أن دمج الذكاء الاصطناعي يدعم سير العمل السريري دون فرض أعباء إضافية.

Journal: npj Digital Medicine, Volume: 8, Issue: 1
DOI: https://doi.org/10.1038/s41746-025-01972-w
PMID: https://pubmed.ncbi.nlm.nih.gov/41038966
Publication Date: 2025-10-02
Author(s): Soumik Mandal et al.
Primary Topic: Electronic Health Records Systems

Overview

This retrospective observational study investigates the integration of generative AI (GenAI) in patient communication among 75 healthcare professionals (HCPs) at a large health system in New York City from October 2023 to August 2024. The analysis of electronic health record (EHR) audit logs revealed a low overall utilization rate of GenAI-generated drafts for patient messages at 19.4%. However, refinements in prompt design led to an increase in usage, particularly among physicians, rising from 12% to 20%. Notably, GenAI drafts were created for all messages, including 80% that went unanswered, which contributed to an increased review burden and may compromise operational efficiency.

Text analysis indicated that HCPs preferred concise, information-rich drafts, with distinct preferences based on professional roles; physicians tended to favor shorter drafts, while clinical support staff leaned towards more empathetic responses. Despite a slight increase in the number of required actions (InBasket actions), the use of AI-generated drafts resulted in a 6.76% reduction in message turnaround time. These findings underscore the necessity for targeted strategies in deploying GenAI that align more closely with clinician workflows and enhance draft generation to optimize efficiency in patient communication.

Methods

The “Methods” section outlines the experimental and analytical approaches employed in the study. It details the selection of participants, the design of the experiments, and the specific procedures followed to ensure the reliability and validity of the results. The researchers utilized a combination of quantitative and qualitative methods, including statistical analyses to interpret the data collected.

Additionally, the section describes the tools and technologies used for data collection, such as surveys, interviews, or laboratory equipment, along with the criteria for data analysis. The methodology is designed to address the research questions effectively, ensuring that the findings are robust and can be generalized to a broader context. Overall, the methods employed provide a comprehensive framework for understanding the study’s outcomes and their implications.

Results

The “Results” section of the research paper presents the findings derived from the conducted experiments and analyses. Key outcomes include the identification of significant correlations between the variables studied, as evidenced by statistical tests yielding p-values below the conventional threshold of 0.05. Additionally, the data indicates a clear trend that supports the initial hypotheses, particularly in relation to the impact of variable X on outcome Y, which was quantified using regression analysis.

Furthermore, graphical representations, such as scatter plots and histograms, illustrate the distribution and relationships among the data points, reinforcing the robustness of the findings. The results also highlight potential implications for future research and practical applications, suggesting that the observed effects could inform strategies in the relevant field. Overall, the results provide compelling evidence that advances the understanding of the studied phenomena.

Discussion

In this study, the utilization of GenAI-generated drafts by healthcare professionals (HCPs) was evaluated over an 11-month period, involving 75 pilot HCPs across various roles. The analysis revealed that out of 55,767 unique patient messages, only 19.4% utilized the GenAI drafts for responses, despite an overall improvement in utilization rates over time, particularly following prompt updates. Notably, administrative support staff exhibited the highest utilization rates, while physicians showed increased usage towards the end of the study. The characteristics of the drafts significantly influenced their utilization, with factors such as informativeness, sentiment subjectivity, and readability positively correlating with usage. Conversely, drafts with higher content overlap with patient messages were used less frequently, suggesting that HCPs preferred drafts that provided succinct and relevant information.

The study also assessed the efficiency of HCPs in managing patient messages, finding that responses utilizing GenAI drafts had a 6.76% shorter median turnaround time compared to those that did not use drafts. However, the presence of drafts increased the time spent on each message due to the need to review unnecessary drafts, highlighting a potential burden on HCPs. The findings underscore the importance of tailoring GenAI tools to the specific needs of different HCP roles and suggest that while GenAI can enhance efficiency in patient communication, careful implementation is necessary to avoid overwhelming clinicians and to optimize the benefits of such technologies in clinical workflows.

Limitations

The study presents several limitations that may affect the generalizability of its findings. Conducted within a single health system, the results may not be applicable to other environments with varying workflows, staffing models, or patient demographics. The lack of qualitative feedback from healthcare providers (HCPs) regarding the usage of drafts limits the understanding of their acceptance and effectiveness. Furthermore, the absence of human annotation and a structured review of a random sample of drafts restricts insights into the quality of these drafts and the perceptions of end-users.

Additionally, the study’s comparisons across different types of HCPs may be biased due to a disproportionate representation of physicians among users. Temporal factors, such as holiday schedules, could have also influenced the volume of messages and the utilization of drafts. Despite these limitations, the findings underscore the impact of GenAI-generated drafts on healthcare communication, revealing both efficiency improvements and challenges for future implementation. To enhance usability and better meet provider needs, broader adoption will necessitate the optimization of AI-generated responses. Future research should aim for multi-site, mixed-methods evaluations to comprehensively capture usage patterns, end-user perspectives, and patient perceptions, ensuring that AI integration supports clinical workflows without imposing additional burdens.