تطبيق الغابات السببية على بيانات التجارب السريرية العشوائية لتحديد تأثيرات العلاج غير المتجانسة: دراسة حالة Application of causal forests to randomised controlled trial data to identify heterogeneous treatment effects: a case study

المجلة: BMC Medical Research Methodology، المجلد: 25، العدد: 1
DOI: https://doi.org/10.1186/s12874-025-02489-2
PMID: https://pubmed.ncbi.nlm.nih.gov/39987431
تاريخ النشر: 2025-02-22
المؤلف: Eleanor Van Vogt وآخرون
الموضوع الرئيسي: تقنيات الاستدلال السببي المتقدمة

نظرة عامة

تناقش هذه الفقرة استخدام أساليب التعلم الآلي السببي غير المعلم، وبالتحديد الغابات السببية (CFs)، لتحديد تأثيرات العلاج غير المتجانسة (HTEs) في التجارب السريرية العشوائية (RCTs). غالبًا ما تعتمد الطرق التقليدية لتحليل المجموعات الفرعية على اختبار التفاعلات بين معدلات تأثير العلاج ومعدلات تعديل تأثير العلاج، مما قد لا يؤدي إلى مجموعات فرعية ذات صلة سريريًا، خاصة مع المتغيرات المستمرة. أجرت الدراسة تحليلًا ثانويًا لتجربة VANISH RCT، مقارنةً بين الفازوبريسين والنورإبينفرين للبقاء على قيد الحياة بدون فشل كلوي في مرضى الصدمة الإنتانية. استخدمت طرقًا كلاسيكية وتقنيات متكيفة مع البيانات وCFs لتحليل HTEs للنتيجة الرئيسية للبقاء على قيد الحياة بعد 28 يومًا.

كشفت النتائج أن الطرق الثلاثة حددت مستويات البوتاسيوم في المصل كعامل تعديل مهم لتأثير العلاج. على وجه التحديد، أشار الانحدار اللوجستي أحادي المتغير إلى نسبة الأرجحية (OR) تبلغ 0.435 (95% CI [0.270, 0.683]، p = 0.0004)، بينما أسفر انحدار لاسو الهرمي عن OR موحد قدره 0.604 (95% CI [0.259, 0.701]). أظهرت طريقة الغابة السببية، على الرغم من أنها أظهرت بعض الأدلة على HTE (p = 0.124)، حدًا متوسطًا لمستوى البوتاسيوم في المصل عند 4.68 مليمول/لتر، مما أدى إلى اختلافات واضحة في المخاطر في البقاء بناءً على هذا الحد. تستنتج الدراسة أن CFs تكمل بفعالية الطرق الكلاسيكية والمتكيفة مع البيانات في تحديد HTEs، مما يوفر نهجًا جديدًا لتعريف الحدود ذات الصلة سريريًا التي قد تعزز فهم تأثيرات العلاج في مجموعات مرضى متنوعة.

مقدمة

تؤكد مقدمة هذه الورقة البحثية على أهمية فهم تأثيرات العلاج غير المتجانسة (HTEs) في التجارب السريرية العشوائية (RCTs)، لا سيما في بيئات الرعاية الحرجة حيث يمكن أن تختلف استجابة المرضى للتدخلات بشكل كبير عن متوسط تأثير العلاج (ATE). يمثل ATE الفرق السببي بين النتائج المتوسطة للمرضى المعالجين وغير المعالجين، ولكن قد يثني ATE الصفري في التجارب الأخيرة عن المزيد من التحقيق في العلاجات المحتملة المفيدة. يجادل المؤلفون بأن فهمًا أعمق لـ HTEs يمكن أن يسهل البحث المستمر في التدخلات التي قد تفيد مجموعات فرعية معينة من المرضى، مما يؤدي إلى تحسين تخصيص الموارد وتحسين رعاية المرضى.

تسلط الورقة الضوء على قيود الطرق التقليدية لتقدير HTEs، والتي غالبًا ما تتطلب أحجام عينات كبيرة وتحليلات مسبقة التخطيط للمجموعات الفرعية، مما يؤدي إلى زيادة مخاطر الأخطاء من النوع الأول عند اختبار خصائص متعددة. في المقابل، يقترح المؤلفون استخدام الغابات السببية (CFs) وأساليب التعلم الآلي السببي الأخرى كطرق مبتكرة لتقدير تأثيرات العلاج المتوسطة الشرطية (CATEs) بناءً على خصائص المرضى. تستخدم طريقة CF نهجًا “صادقًا” لضمان استنتاج صحيح من خلال تقسيم البيانات إلى مجموعات فرعية لتحديد انقسامات الشجرة وتقدير CATEs. يهدف المؤلفون إلى استكشاف فائدة CFs في توليد مجموعات فرعية قابلة للتنفيذ سريريًا ومقارنة هذا النهج باختبارات التفاعل أحادية المتغير التقليدية والطرق المتكيفة مع البيانات، باستخدام تجربة VANISH كدراسة حالة لتحليلهم.

طرق

في الدراسة التي أجراها فان فوغت وآخرون (2025)، كانت النتيجة الرئيسية التي تم تقييمها هي عدد الأيام التي عاشها المرضى بدون فشل كلوي خلال 28 يومًا بعد العشوائية، والتي تجمع بين كل من الوفيات خلال 28 يومًا والأيام بدون إصابة كلوية. يركز هذا البحث على نمذجة تأثير العلاج فيما يتعلق بالنتيجة الثنائية لوفيات 28 يومًا، مقارنةً بين الفازوبريسين والنورإبينفرين كعلاجات خط أول. أشارت النتائج إلى عدم وجود فرق كبير في معدلات الوفيات بين المجموعتين، حيث كانت 30.9% في مجموعة الفازوبريسين و27.5% في مجموعة النورإبينفرين، مما أسفر عن فرق في المخاطر قدره 3.4% (95% CI: [-5.4%، 12.3%]).

يؤكد المؤلفون أن صحة هذه النتيجة الصفرية تعتمد على التحديد الصحيح لنموذج الانحدار اللوجستي الأساسي وفرضية تجانس تأثير العلاج. إذا تم انتهاك هذه الفرضية، فقد تخفي النتيجة الصفرية الملاحظة التباين في استجابات العلاج بين المشاركين. وبالتالي، تبرز الدراسة أهمية استكشاف عدم التجانس في تأثيرات العلاج (HTEs) لتحديد مجموعات فرعية معينة من المرضى قد تكون قد استفادت أو تأثرت سلبًا من العلاجات.

نتائج

في قسم النتائج من تحليل تجربة VANISH، تم تضمين جميع المشاركين، وتم تفصيل الخصائص الأساسية في الجدول 1. بعد استبعاد المتغيرات ذات الفقدان الكبير (أكثر من 30%) والارتباط العالي (أكثر من 0.7)، تبقى ما مجموعه 49 متغيرًا أساسيًا للتقييم. أشار التحليل إلى أن الخصائص كانت متوازنة جيدًا عبر أذرع العلاج، مما يتماشى مع التوقعات التي وضعتها عملية العشوائية.

مناقشة

في هذا التحليل الثانوي لتجربة VANISH، استكشف الباحثون عدم تجانس تأثير العلاج (HTE) المتعلق بمستويات البوتاسيوم في المصل، كاشفين أن المستويات الأعلى من البوتاسيوم كانت مرتبطة باستجابات علاجية أسوأ. تتماشى هذه النتيجة مع الفهم بأن فرط بوتاسيوم الدم يحدث غالبًا في المرضى الذين يعانون من خلل كلوي متقدم، مما يشير إلى أن أولئك الذين يعانون من تعفن الدم الشديد قد لا يستفيدون من علاج الفازوبريسين المبكر. حددت طرق تحليلية متنوعة، بما في ذلك الانحدار أحادي المتغير، لاسو الهرمي، والغابات السببية (CF)، بشكل متسق مستوى البوتاسيوم في المصل كعامل مهم يؤثر على HTE. من الجدير بالذكر أن نهج CF حدد مجموعتين فرعيتين متميزتين بناءً على مستويات البوتاسيوم، مما يبرز حدًا ضمن النطاق الطبيعي كان يمكن أن يتم تجاهله دون هذا التحليل المتقدم.

تؤكد الدراسة على مزايا استخدام الغابات السببية مقارنةً بأساليب الانحدار التقليدية، لا سيما في تحديد المجموعات الفرعية المعرفة بواسطة المتغيرات المستمرة. بينما أشارت تقنيات الانحدار إلى علاقة سلبية بين مستويات البوتاسيوم واستجابة العلاج، قدمت CF فهمًا أكثر دقة من خلال الكشف عن اختلافات المجموعات الفرعية التي يمكن أن توجه اتجاهات البحث المستقبلية. ومع ذلك، يحذر المؤلفون من أن هذه النتائج الاستكشافية يجب ألا تؤثر مباشرة على الممارسة السريرية دون مزيد من التحقق، حيث قد تكون المجموعات الفرعية المحددة غير قوية وتتطلب مزيدًا من التحقيق لتأسيس فترات ثقة قوية. بشكل عام، تسلط التحليل الضوء على إمكانيات الأساليب المعتمدة على البيانات لكشف رؤى ذات صلة سريريًا قد تفوتها الأساليب التقليدية.

Journal: BMC Medical Research Methodology, Volume: 25, Issue: 1
DOI: https://doi.org/10.1186/s12874-025-02489-2
PMID: https://pubmed.ncbi.nlm.nih.gov/39987431
Publication Date: 2025-02-22
Author(s): Eleanor Van Vogt et al.
Primary Topic: Advanced Causal Inference Techniques

Overview

This section discusses the use of non-parametric causal machine learning approaches, specifically causal forests (CFs), to identify heterogeneous treatment effects (HTEs) in randomized controlled trials (RCTs). Traditional methods for subgroup analysis often rely on testing interactions between treatment effect modifiers and treatment effects, which may not yield clinically relevant subgroups, especially with continuous covariates. The study conducted a secondary analysis of the VANISH RCT, comparing vasopressin and norepinephrine for renal failure-free survival in septic shock patients. It employed classical methods, data-adaptive techniques, and CFs to analyze HTEs for the primary outcome of survival at 28 days.

The findings revealed that all three methods identified serum potassium levels as a significant modifier of treatment effect. Specifically, univariable logistic regression indicated an odds ratio (OR) of 0.435 (95% CI [0.270, 0.683], p = 0.0004), while hierarchical lasso regression yielded a standardized OR of 0.604 (95% CI [0.259, 0.701]). The causal forest approach, while showing some evidence of HTE (p = 0.124), identified a mean threshold of serum potassium at 4.68 mmol/L, leading to distinct risk differences in survival based on this threshold. The study concludes that CFs effectively complement classical and data-adaptive methods in identifying HTEs, offering a novel approach to defining clinically relevant thresholds that may enhance the understanding of treatment effects in diverse patient populations.

Introduction

The introduction of this research paper emphasizes the importance of understanding heterogeneous treatment effects (HTEs) in randomized controlled trials (RCTs), particularly in critical care settings where patient responses to interventions can vary significantly from the average treatment effect (ATE). The ATE represents the causal difference between the mean outcomes of treated and untreated patients, but a null ATE in recent trials may discourage further investigation into potentially beneficial treatments. The authors argue that a deeper understanding of HTEs could facilitate continued research into interventions that may benefit specific patient subgroups, thereby optimizing resource allocation and improving patient care.

The paper highlights the limitations of classical methods for estimating HTEs, which often require large sample sizes and pre-planned subgroup analyses, leading to increased risks of type I errors when testing multiple characteristics. In contrast, the authors propose the use of causal forests (CFs) and other causal machine learning methods as innovative approaches to estimate conditional average treatment effects (CATEs) based on patient characteristics. The CF method employs an “honest” approach to ensure valid inference by splitting data into subsets for determining tree splits and estimating CATEs. The authors aim to explore the utility of CFs in generating clinically actionable subgroups and compare this approach to traditional univariable interaction tests and data-adaptive methods, using the VANISH trial as a case study for their analysis.

Methods

In the study conducted by Van Vogt et al. (2025), the primary outcome assessed was the number of days alive and free from kidney failure within 28 days post-randomization, which combines both 28-day mortality and days without kidney injury. The focus of this paper is on modeling the treatment effect concerning the binary outcome of 28-day mortality, comparing early vasopressin to norepinephrine as first-line treatments. The results indicated no significant difference in mortality rates between the two groups, with 30.9% in the vasopressin group and 27.5% in the norepinephrine group, yielding a risk difference of 3.4% (95% CI: [-5.4%, 12.3%]).

The authors emphasize that the validity of this null result hinges on the correct specification of the underlying logistic regression model and the assumption of treatment effect homogeneity. If this assumption is violated, the observed null effect may mask variability in treatment responses among participants. Consequently, the study highlights the importance of exploring heterogeneity in treatment effects (HTEs) to identify specific patient subgroups that may have either benefited from or been adversely affected by the treatments.

Results

In the results section of the VANISH trial analysis, all participants were included, and baseline characteristics are detailed in Table 1. Following the exclusion of variables with significant missingness (greater than 30%) and high correlation (greater than 0.7), a total of 49 baseline variables remained for evaluation. The analysis indicated that the characteristics were well balanced across the treatment arms, which aligns with the expectations set by the randomization process.

Discussion

In this secondary analysis of the VANISH trial, the researchers explored treatment effect heterogeneity (HTE) related to serum potassium levels, revealing that higher potassium levels were associated with poorer treatment responses. This finding aligns with the understanding that hyperkalaemia often occurs in patients with advanced renal dysfunction, suggesting that those with severe sepsis may not benefit from early vasopressin treatment. Various analytical methods, including univariable regression, hierarchical lasso, and causal forests (CF), consistently identified serum potassium as a significant factor influencing HTE. Notably, the CF approach identified two distinct subgroups based on potassium levels, highlighting a cutoff within the normal range that could have been overlooked without this advanced analysis.

The study emphasizes the advantages of using causal forests over traditional regression methods, particularly in identifying subgroups defined by continuous covariates. While the regression techniques indicated a negative relationship between potassium levels and treatment response, the CF provided a more nuanced understanding by revealing subgroup differences that could inform future research directions. However, the authors caution that these exploratory findings should not directly inform clinical practice without further validation, as the subgroups identified may be underpowered and require additional investigation to establish robust confidence intervals. Overall, the analysis underscores the potential of data-driven methods to uncover clinically relevant insights that traditional approaches may miss.