التنبؤ بالهشاشة لدى كبار السن في المجتمع بناءً على التعلم الآلي: مراجعة منهجية وتحليل ميتا Prediction of frailty in community older adults based on machine learning: a systematic review and meta-analysis

المجلة: Frontiers in Public Health، المجلد: 13
DOI: https://doi.org/10.3389/fpubh.2025.1667792
PMID: https://pubmed.ncbi.nlm.nih.gov/41602088
تاريخ النشر: 2026-01-12
المؤلف: Yifan Ou وآخرون
الموضوع الرئيسي: الهشاشة لدى كبار السن

نظرة عامة

تقييمت هذه المراجعة المنهجية والتحليل التلوي أداء نماذج التعلم الآلي (ML) التي تم تطويرها للتنبؤ بالهشاشة لدى كبار السن الذين يعيشون في المجتمع. تم تحديد 10 دراسات من خلال بحث شامل عبر عدة قواعد بيانات، مما أدى إلى تطوير 45 نموذجًا من نماذج التعلم الآلي، حيث خضعت 36 منها للتحقق الداخلي و9 للتحقق الخارجي. أظهر التحليل منطقة تحت المنحنى المجمعة (AUC) تبلغ 0.878 (95% CI 0.799، 0.958) لتنبؤ الهشاشة الأساسية و0.730 (95% CI 0.670، 0.790) للتنبؤ الطولي. كانت منطقة AUC المجمعة عبر جميع الدراسات 0.786 (95% CI 0.697، 0.875)، مما يشير إلى أداء تنبؤي مرضٍ.

على الرغم من هذه النتائج الواعدة، وُجد أن جودة وملاءمة النماذج كانت غير كافية، حيث أظهرت العديد من الدراسات خطرًا عاليًا من التحيز وعدم كفاية التحقق الخارجي. يوصي المؤلفون بأن تلتزم الأبحاث المستقبلية بإرشادات TRIPOD+AI وPROBAST+AI لتعزيز جودة وملاءمة النماذج. تشمل التوصيات الرئيسية الإبلاغ الشامل عن مصادر البيانات، وتعريفات المتنبئين، والتعامل مع البيانات المفقودة، ومنهجيات التحقق الصارمة. من الضروري التأكيد على الشفافية الكاملة والصرامة المنهجية لتحسين موثوقية وقابلية إعادة إنتاج النماذج التنبؤية في هذا المجال.

مقدمة

تسلط المقدمة الضوء على التحول الديموغرافي العالمي الكبير نحو زيادة عدد السكان المسنين، مع توقعات تشير إلى أنه بحلول عام 2050، سيكون هناك 2.1 مليار فرد فوق سن 60. يرافق هذا الاتجاه الديموغرافي زيادة في انتشار الهشاشة، والتي تُعرف بانخفاض الاحتياطيات الفسيولوجية وتقليل المقاومة للضغوط، مما يؤثر على حوالي 43.3% من كبار السن في البيئات المجتمعية.

كما تؤكد هذه الفقرة على المرحلة الناشئة من البحث الذي يستخدم التعلم الآلي للتنبؤ بالهشاشة بين كبار السن الذين يعيشون في المجتمع. تشير منطقة تحت المنحنى المجمعة (AUC) التي تبلغ 0.786 (95% CI 0.697، 0.875) إلى قدرات تنبؤية واعدة، على الرغم من وجود تباين كبير بين الدراسات. تشير المخاوف المتعلقة بجودة وتحامل الأبحاث الحالية، كما حددها إطار عمل PROBAST+AI، إلى أن النتائج قد تكون مبالغ فيها، مما يستلزم تعزيز التوحيد القياسي في الدراسات المستقبلية بشأن السكان ومصادر البيانات والعوامل التنبؤية وطرق التحليل لتحسين القوة العامة وقابلية التعميم.

النتائج

في هذه الدراسة، أسفر بحث منهجي عبر 10 قواعد بيانات عن إجمالي 8,210 مقالة، بالإضافة إلى تحديد 4 دراسات إضافية من خلال مراجعات قوائم المراجع. بعد إزالة التكرارات باستخدام EndNote X9، تبقى 4,836 مقالة للتقييم الإضافي. أسفر الفحص الأولي بناءً على العناوين والملخصات عن استبعاد 4,812 مقالة، مما قلص الاختيار إلى 24 مقالة لتقييم الأهلية للنص الكامل. بعد مراجعة دقيقة لهذه النصوص، تم استبعاد 14 مقالة لأسباب متنوعة، كما هو موضح في الشكل 1. في النهاية، تم اعتبار 10 مقالات مناسبة للإدراج في الدراسة، كما هو موضح في مخطط تدفق PRISMA 2020 المقدم في الشكل 1.

المناقشة

توضح فقرة المناقشة في ورقة البحث المنهجية والنتائج من مراجعة منهجية لنماذج التعلم الآلي (ML) المستخدمة للتنبؤ بالهشاشة لدى كبار السن الذين يعيشون في المجتمع. تم تحديد عشرة دراسات من خلال بحث شامل عبر عدة قواعد بيانات، نشرت بين عامي 2022 و2025، وكانت في الغالب قائمة على المجموعات، مع أحجام عينات تتراوح من 214 إلى 14,540 مشاركًا. استخدمت الدراسات أدوات تقييم الهشاشة المختلفة، حيث كانت ظاهرة فريد ومؤشر الهشاشة الأكثر شيوعًا. أظهر التحليل أن نماذج التنبؤ الأساسية (AUC = 0.878) تفوقت على النماذج الطولية (AUC = 0.730)، وذلك بسبب القياس المتزامن للمتنبئين وحالة الهشاشة في النماذج الأساسية، مما يمكن أن يؤدي إلى مبالغات في الأداء بسبب مشاكل الارتباط الداخلي.

كما سلطت المراجعة الضوء على المخاوف المتعلقة بجودة وملاءمة الدراسات المدرجة، كما تم تقييمه بواسطة أداة PROBAST+AI. أظهرت عدة دراسات خطرًا عاليًا من التحيز، ويرجع ذلك أساسًا إلى التصاميم الاسترجاعية وعدم كفاية الإبلاغ عن طرق جمع البيانات. يوصي المؤلفون بإعطاء الأولوية للدراسات متعددة المراكز والمستقبلية لتعزيز قابلية تعميم النماذج وصلاحيتها. بالإضافة إلى ذلك، يبرز التباين في معدلات انتشار الهشاشة عبر الدراسات الحاجة إلى أدوات تقييم موحدة وتعريفات واضحة لنتائج الهشاشة. تؤكد النتائج على أهمية تصميم النماذج بعناية والتعامل مع البيانات لتحسين القوة وقابلية التفسير للنماذج التنبؤية في أبحاث الهشاشة.

القيود

في هذه المراجعة المنهجية والتحليل التلوي لطرق التعلم الآلي لتنبؤ خطر الهشاشة لدى كبار السن، تم تحديد عدة قيود قد تؤثر على النتائج وملاءمتها في الممارسة السريرية. أولاً، أدت قلة التحقق الخارجي الكافي عبر معظم الدراسات المدرجة إلى عرقلة القدرة على إجراء تحليل تلوي لمنطقة تحت المنحنى المجمعة (AUC) لمجموعات التحقق الخارجي، مما قد يحد من قابلية تعميم النماذج التنبؤية المطورة.

ثانيًا، يُدخل التباين في طرق التقييم المستخدمة للهشاشة ومؤشراتها بين الدراسات تحيزًا محتملاً، مما يعقد تجميع النتائج. أخيرًا، كشفت التحليلات التلوية عن تباين كبير داخلي، ناتج عن اختلافات في خصائص المشاركين وتصاميم الدراسات ومجموعات المتنبئين وتقنيات تحليل البيانات. تؤكد هذه العوامل مجتمعة على الحاجة إلى مزيد من الأساليب الموحدة وعمليات التحقق القوية في الأبحاث المستقبلية لتعزيز موثوقية نماذج التنبؤ بخطر الهشاشة.

Journal: Frontiers in Public Health, Volume: 13
DOI: https://doi.org/10.3389/fpubh.2025.1667792
PMID: https://pubmed.ncbi.nlm.nih.gov/41602088
Publication Date: 2026-01-12
Author(s): Yifan Ou et al.
Primary Topic: Frailty in Older Adults

Overview

This systematic review and meta-analysis evaluated the performance of machine learning (ML) models developed for predicting frailty in community-dwelling older adults. A comprehensive search across multiple databases identified 10 studies, resulting in the development of 45 ML models, with 36 undergoing internal validation and 9 external validation. The analysis revealed a pooled area under the curve (AUC) of 0.878 (95% CI 0.799, 0.958) for baseline frailty prediction and 0.730 (95% CI 0.670, 0.790) for longitudinal prediction. The overall pooled AUC across all studies was 0.786 (95% CI 0.697, 0.875), indicating satisfactory predictive performance.

Despite these promising results, the quality and applicability of the models were found to be inadequate, with many studies exhibiting a high risk of bias and insufficient external validation. The authors recommend that future research adhere to the TRIPOD+AI and PROBAST+AI guidelines to enhance model quality and applicability. Key recommendations include thorough reporting of data sources, predictor definitions, handling of missing data, and rigorous validation methodologies. Emphasizing complete transparency and methodological rigor is essential for improving the reliability and reproducibility of predictive models in this field.

Introduction

The introduction highlights the significant global demographic shift towards an aging population, with projections indicating that by 2050, there will be 2.1 billion individuals over the age of 60. This demographic trend is accompanied by an increasing prevalence of frailty, which is defined by diminished physiological reserves and reduced resistance to stressors, affecting approximately 43.3% of older adults in community settings.

The section also emphasizes the nascent stage of research utilizing machine learning to predict frailty among community-dwelling older adults. A pooled area under the curve (AUC) of 0.786 (95% CI 0.697, 0.875) indicates promising predictive capabilities, although substantial heterogeneity exists among studies. Concerns regarding the quality and bias of current research, as identified by the PROBAST+AI framework, suggest that findings may be overestimated, necessitating enhanced standardization in future studies regarding populations, data sources, predictive factors, and analytical methods to improve robustness and generalizability.

Results

In this study, a systematic search across 10 databases yielded a total of 8,210 articles, supplemented by the identification of 4 additional studies through reference list reviews. After deduplication with EndNote X9, 4,836 articles remained for further evaluation. An initial screening based on titles and abstracts resulted in the exclusion of 4,812 articles, narrowing the selection to 24 articles for full-text eligibility assessment. Following a thorough review of these texts, 14 articles were excluded for various reasons, as detailed in Figure 1. Ultimately, 10 articles were deemed suitable for inclusion in the study, as illustrated in the PRISMA 2020 flowchart presented in Figure 1.

Discussion

The discussion section of the research paper outlines the methodology and findings from a systematic review of machine learning (ML) models used to predict frailty in community-dwelling older adults. A comprehensive search across multiple databases identified ten studies published between 2022 and 2025, primarily cohort-based, with sample sizes ranging from 214 to 14,540 participants. The studies employed various frailty assessment tools, with the Fried Phenotype and Frailty Index being the most common. The analysis revealed that baseline prediction models (AUC = 0.878) outperformed longitudinal models (AUC = 0.730), attributed to the simultaneous measurement of predictors and frailty status in baseline models, which can lead to overestimation of performance due to endogeneity issues.

The review also highlighted concerns regarding the quality and applicability of the included studies, as assessed by the PROBAST+AI tool. Several studies exhibited high risk of bias, primarily due to retrospective designs and inadequate reporting of data collection methods. The authors recommend prioritizing multicenter, prospective studies to enhance model generalizability and validity. Additionally, the variability in frailty prevalence rates across studies underscores the need for standardized assessment tools and clear definitions of frailty outcomes. The findings emphasize the importance of careful model design and data handling to improve the robustness and interpretability of predictive models in frailty research.

Limitations

In this systematic review and meta-analysis of machine learning methods for predicting frailty risk in older adults, several limitations were identified that may impact the findings and their applicability in clinical practice. Firstly, the lack of sufficient external validation across most included studies hindered the ability to conduct a meta-analysis of the pooled area under the curve (AUC) for external validation sets, potentially limiting the generalizability of the predictive models developed.

Secondly, the heterogeneity in assessment methods used for frailty and its predictors among the studies introduces potential bias, complicating the synthesis of results. Lastly, the meta-analyses revealed significant in-house heterogeneity, stemming from variations in participant characteristics, study designs, predictor sets, and data analysis techniques. These factors collectively underscore the need for more standardized approaches and robust validation processes in future research to enhance the reliability of frailty risk prediction models.