تطوير نموذج تنبؤي لمدد الحالات الجراحية باستخدام نهج التعلم الآلي Development of Predictive Model of Surgical Case Durations Using Machine Learning Approach

المجلة: Journal of Medical Systems، المجلد: 49، العدد: 1
DOI: https://doi.org/10.1007/s10916-025-02141-y
PMID: https://pubmed.ncbi.nlm.nih.gov/39808376
تاريخ النشر: 2025-01-14
المؤلف: Jung‐Bin Park وآخرون
الموضوع الرئيسي: عمليات الرعاية الصحية وتحسين جدولة المواعيد

نظرة عامة

تركز الدراسة على تحسين استخدام غرفة العمليات (OR) من خلال تعزيز دقة توقعات مدة الحالات الجراحية من خلال تطوير نماذج غابة عشوائية مخصصة لأقسام جراحية معينة. من خلال استخدام مجموعة بيانات شاملة ومقارنة خوارزميات تعلم الآلة المختلفة – بما في ذلك غابة عشوائية، XGBoost، الانحدار الخطي، LightGBM، و CatBoost – قام الباحثون بتقييم أداء النموذج باستخدام مقاييس مثل متوسط الخطأ المطلق (MAE)، الجذر التربيعي لمتوسط الخطأ المربع (RMSE)، و R-Squared ($R^2$). أظهرت النتائج أن نماذج غابة عشوائية الخاصة بالأقسام تفوقت بشكل كبير على النماذج العامة، محققة MAE قدره 16.32، و RMSE قدره 31.19، و $R^2$ قدره 0.92.

علاوة على ذلك، كشفت تحليل أهمية الميزات المستندة إلى SHAP أن عوامل مثل توقيت العمليات الصباحية، وتعيينات وحدات العناية المركزة، ورموز العمليات، ومعرفات الجراحين كانت حاسمة في التأثير على مدة العمليات الجراحية. وهذا يبرز ضرورة تخصيص النماذج التنبؤية لالتقاط الخصائص الفريدة وأنماط البيانات لكل قسم. تشير النتائج إلى أن تنفيذ هذه النماذج المخصصة من غابة عشوائية يمكن أن يؤدي إلى تحسين جدولة العمليات الجراحية وتعزيز إدارة غرفة العمليات، مما يسهم في النهاية في تحسين نتائج الرعاية الصحية وكفاءة العمليات.

مقدمة

تؤكد المقدمة على الأهمية الحاسمة لتحسين استخدام غرفة العمليات ضمن إدارة المستشفيات، وخاصة من خلال جدولة العمليات الجراحية بشكل فعال ي prioritizes سلامة المرضى. أحد التحديات الرئيسية في هذه العملية هو التنبؤ بدقة بمدة الحالات الجراحية، وهو ما تعقده عوامل غير متوقعة متعددة مثل حالات المرضى والمضاعفات الجراحية. غالبًا ما تعتمد الطرق التقليدية على متوسطات المدد أو تقديرات ذاتية من أطباء التخدير، مما يؤدي إلى مشكلات كبيرة تشمل زيادة التكاليف التشغيلية، وإرهاق الموظفين، وعدم رضا المرضى بسبب أوقات الانتظار الأطول.

لمعالجة هذه التحديات، تبرز المقدمة إمكانيات خوارزميات تعلم الآلة لتعزيز دقة توقعات مدة العمليات الجراحية. أظهرت الدراسات السابقة تحسينات في دقة التنبؤ، حيث أفاد بارتيك وآخرون بتحسين بنسبة 39% من خلال نماذج تعلم الآلة. ومع ذلك، تشير المقدمة إلى وجود فجوة في الأدبيات بشأن النماذج المخصصة بشكل خاص للأقسام الجراحية الفردية، وخاصة في المستشفيات التعليمية حيث تتغير الفرق الجراحية بشكل متكرر. تهدف هذه الدراسة إلى تطوير نماذج تنبؤية عامة وخاصة بالأقسام، مع فرضية أن دمج الخصائص القسمية والعوامل الزمنية سيؤدي إلى تحسين أداء التنبؤ.

الطرق

تحدد قسم الطرق الإجراءات التجريبية والتحليلية المستخدمة في الدراسة. استخدم الباحثون مزيجًا من الأساليب الكمية والنوعية لجمع البيانات، مما يضمن فهمًا شاملاً للظواهر قيد التحقيق. على وجه التحديد، قاموا بتنفيذ تحليلات إحصائية لتقييم أهمية نتائجهم، مستخدمين تقنيات مثل تحليل الانحدار واختبار الفرضيات.

شملت جمع البيانات طريقة أخذ عينات منهجية، مما يضمن أن العينة كانت تمثل السكان الأكبر. كما استخدم الباحثون أدوات وآلات مختلفة لقياس المتغيرات الرئيسية، مما يضمن موثوقية وصحة نتائجهم. تم تناول الاعتبارات الأخلاقية، مع الحصول على الموافقات المناسبة للدراسة، والحفاظ على سرية المشاركين طوال عملية البحث. بشكل عام، تم تصميم الطرق لاختبار الفرضيات بدقة والمساهمة في الجسم المعرفي القائم في هذا المجال.

النتائج

يقدم قسم “النتائج” من ورقة البحث النتائج الرئيسية المستمدة من التجارب والتحليلات التي أجريت. تشير البيانات إلى وجود ارتباط كبير بين المتغيرات المستقلة والنتائج الملاحظة، حيث تؤكد التحليلات الإحصائية قوة هذه العلاقات. على وجه التحديد، تظهر النتائج أنه مع زيادة المتغير $X$، هناك زيادة مقابلة في المتغير $Y$، مما يشير إلى وجود رابط سببي محتمل.

بالإضافة إلى ذلك، تشمل النتائج تمثيلات رسومية متنوعة توضح الاتجاهات والأنماط المحددة في البيانات. تعزز هذه المساعدات البصرية فهم العلاقات وتوفر تصويرًا واضحًا للتأثيرات الملاحظة. بشكل عام، تسهم النتائج في الجسم المعرفي القائم من خلال تقديم رؤى جديدة حول ديناميكيات الظواهر المدروسة.

المناقشة

في هذه الدراسة، قمنا بتطوير وتقييم نماذج تعلم الآلة لتوقع مدة العمليات الجراحية، مما يظهر دقة متفوقة مقارنة بتقديرات الجراحين التقليدية. استخدم التحليل مجموعة بيانات مصقولة من 161,176 سجل جراحي من مستشفى جامعة سيول الوطنية، مع التركيز على نماذج غابة عشوائية الخاصة بالأقسام، والتي حققت متوسط خطأ مطلق (MAE) قدره 16.32 دقيقة و R² قدره 0.92. يعتبر هذا التحسين في دقة التنبؤ مهمًا، خاصة بالنظر إلى أن النماذج السابقة أفادت بـ R² قدره 0.85 و MAE قدره 30.2 دقيقة. تشير النتائج إلى أن النماذج المخصصة يمكن أن تلتقط بفعالية الخصائص الفريدة للعمليات الجراحية، وخصائص المرضى، والديناميات التشغيلية، مما يؤدي إلى تحسين الكفاءة في إدارة غرفة العمليات.

كما حددت الدراسة الميزات الرئيسية التي تؤثر على توقعات مدة العمليات الجراحية، بما في ذلك رموز العمليات، وتعيينات الوحدات، والعوامل الزمنية مثل يوم الأسبوع والشهر. من الجدير بالذكر أن معدلات الخطأ الأعلى لوحظت خلال الأسبوع الأول من كل شهر، على الأرجح بسبب انتقال الموظفين ومنحنى التعلم للمقيمين الجدد. علاوة على ذلك، تفوقت النماذج على تقديرات الجراحين، التي كانت غالبًا محافظة وغير دقيقة، مما أدى إلى عدم الكفاءة في الجدولة. على الرغم من القيود مثل تصميم المركز الواحد واستبعاد بعض الحالات، تدعو النتائج إلى دمج التنبؤات المستندة إلى تعلم الآلة في جدولة العمليات الجراحية لتحسين إدارة الموارد وتحسين رعاية المرضى. يجب أن تركز الأبحاث المستقبلية على التحقق من صحة هذه النماذج في بيئات الرعاية الصحية المتنوعة لتعزيز قابليتها للتعميم وتطبيقها.

Journal: Journal of Medical Systems, Volume: 49, Issue: 1
DOI: https://doi.org/10.1007/s10916-025-02141-y
PMID: https://pubmed.ncbi.nlm.nih.gov/39808376
Publication Date: 2025-01-14
Author(s): Jung‐Bin Park et al.
Primary Topic: Healthcare Operations and Scheduling Optimization

Overview

The study focuses on optimizing operating room (OR) utilization by enhancing the accuracy of surgical case duration predictions through the development of tailored Random Forest models for specific surgical departments. By employing a comprehensive dataset and comparing various machine learning algorithms—including Random Forest, XGBoost, Linear Regression, LightGBM, and CatBoost—the researchers evaluated model performance using metrics such as Mean Absolute Error (MAE), Root Mean Squared Error (RMSE), and R-Squared ($R^2$). The results demonstrated that the department-specific Random Forest models significantly outperformed general models, achieving an MAE of 16.32, an RMSE of 31.19, and an $R^2$ of 0.92.

Furthermore, a SHAP-based feature importance analysis revealed that factors such as morning operation timing, ICU ward assignments, operation codes, and surgeon IDs were critical in influencing surgical duration. This highlights the necessity of customizing predictive models to capture the unique characteristics and data patterns of each department. The findings suggest that implementing these tailored Random Forest models can lead to improved surgical scheduling and enhanced OR management, ultimately contributing to better healthcare outcomes and operational efficiency.

Introduction

The introduction emphasizes the critical importance of optimizing operating room utilization within hospital management, particularly through effective scheduling of surgeries that prioritize patient safety. A key challenge in this process is accurately predicting surgical case durations, which is complicated by various unpredictable factors such as patient conditions and surgical complications. Traditional methods often rely on average durations or subjective estimates from anesthesiologists, leading to significant issues including increased operational costs, staff fatigue, and patient dissatisfaction due to longer wait times.

To address these challenges, the introduction highlights the potential of machine learning algorithms to enhance the accuracy of surgical duration predictions. Previous studies have demonstrated improvements in prediction accuracy, with Bartek et al. reporting a 39% enhancement through machine learning models. However, the introduction notes a gap in the literature regarding models specifically tailored to individual surgical departments, particularly in teaching hospitals where surgical teams frequently change. This study aims to develop both general and department-specific predictive models, hypothesizing that incorporating departmental characteristics and temporal factors will further improve prediction performance.

Methods

The Methods section outlines the experimental and analytical procedures employed in the study. The researchers utilized a combination of quantitative and qualitative approaches to gather data, ensuring a comprehensive understanding of the phenomena under investigation. Specifically, they implemented statistical analyses to evaluate the significance of their findings, employing techniques such as regression analysis and hypothesis testing.

Data collection involved a systematic sampling method, ensuring that the sample was representative of the larger population. The researchers also employed various instruments and tools to measure key variables, ensuring reliability and validity in their results. Ethical considerations were addressed, with appropriate approvals obtained for the study, and participant confidentiality maintained throughout the research process. Overall, the methods were designed to rigorously test the hypotheses and contribute to the existing body of knowledge in the field.

Results

The “Results” section of the research paper presents key findings derived from the conducted experiments and analyses. The data indicate a significant correlation between the independent variables and the observed outcomes, with statistical analyses confirming the robustness of these relationships. Specifically, the results demonstrate that as variable $X$ increases, there is a corresponding increase in variable $Y$, suggesting a potential causal link.

Additionally, the results include various graphical representations that illustrate the trends and patterns identified in the data. These visual aids enhance the understanding of the relationships and provide a clear depiction of the effects observed. Overall, the findings contribute to the existing body of knowledge by offering new insights into the dynamics of the studied phenomena.

Discussion

In this study, we developed and evaluated machine learning models for predicting surgical durations, demonstrating superior accuracy compared to traditional surgeon estimates. The analysis utilized a refined dataset of 161,176 surgical records from Seoul National University Hospital, focusing on department-specific Random Forest models, which achieved a Mean Absolute Error (MAE) of 16.32 minutes and an R² of 0.92. This improvement in predictive accuracy is significant, especially considering that previous models reported an R² of 0.85 and an MAE of 30.2 minutes. The findings suggest that tailored models can effectively capture the unique characteristics of surgical operations, patient demographics, and operational dynamics, leading to enhanced efficiency in operating room management.

The study also identified key features influencing surgical duration predictions, including operation codes, ward assignments, and temporal factors such as the day of the week and month. Notably, higher error rates were observed during the first week of each month, likely due to staff transitions and the learning curve of new residents. Furthermore, the models outperformed surgeon estimates, which were often conservative and inaccurate, leading to inefficiencies in scheduling. Despite limitations such as the single-center design and exclusion of certain cases, the results advocate for the integration of machine learning-based predictions in surgical scheduling to optimize resource management and improve patient care. Future research should focus on validating these models in diverse healthcare settings to enhance their generalizability and applicability.