تحسين التنبؤات بفترة ما بعد الوفاة من خلال الميتابولوم البشري وتعلم الآلة The human metabolome and machine learning improves predictions of the post-mortem interval

المجلة: Nature Communications، المجلد: 17، العدد: 1
DOI: https://doi.org/10.1038/s41467-026-69158-w
PMID: https://pubmed.ncbi.nlm.nih.gov/41673021
تاريخ النشر: 2026-02-11
المؤلف: Rasmus Magnusson وآخرون
الموضوع الرئيسي: علم الحشرات الجنائي ودراسات ذبابة

طرق

في هذا القسم، يقوم المؤلفون بالتحقيق في القدرات التنبؤية لنماذج التعلم الآلي الخاضعة للإشراف المختلفة لتوقع فترة ما بعد الوفاة (PMI) (من المحتمل أن تكون مقياسًا أو مؤشرًا محددًا). قاموا بإنشاء مجموعة من النماذج، بما في ذلك الانحدار المتدرج، LASSO، الانحدار باستخدام دعم المتجهات (SVR)، الانحدار باستخدام الغابة العشوائية، الانحدار باستخدام تعزيز التدرج، وأقرب الجيران (k-NN). تم تدريب النماذج على 80% من مجموعة البيانات (3,907 ملفات تعريف) واختبارها على 10% (471 ملف تعريف)، مما كشف أن خمسة من أصل ستة نماذج أظهرت قوة تنبؤية كبيرة.

تفوق الشبكة العصبية الأمامية (FFNN) على النماذج الأخرى، حيث حققت متوسط خطأ مطلق أقل من البقية وقيمة $R^2$ تبلغ 0.56. تلتها نماذج أخرى مع LASSO عند 0.49، وتعزيز التدرج عند 0.46، والغابة العشوائية عند 0.40، وSVR عند 0.39. أظهرت K-NN والانحدار المتدرج أداءً أضعف، مع قيم $R^2$ تبلغ 0.18 و0.13، على التوالي. أشار التحليل الإحصائي باستخدام اختبار ويلكوكسون ذو الجانبين إلى وجود اختلافات كبيرة في الأداء بين معظم النماذج (p < 0.05)، باستثناء مقارنات محددة تتعلق بـ LASSO وSVR، وLASSO والغابة العشوائية، وLASSO وتعزيز التدرج، وSVR وتعزيز التدرج. بشكل عام، تشير النتائج إلى أنه بينما يمكن للنماذج الأبسط توقع PMI بفعالية، تظل FFNN الأكثر دقة بين تلك التي تم اختبارها.

نتائج

تظهر النتائج فعالية نموذج الشبكة العصبية الأمامية (FFNN) في توفير تقديرات موثوقة لفترة ما بعد الوفاة (PMI) باستخدام بيانات الميتابولوم من 4,876 عينة دم فخذي. تراوحت قيم PMI من 1 إلى 67 يومًا، حيث كانت 97% من القيم تقع بين 1 و13 يومًا ومتوسط PMI يبلغ 5 أيام. تم تقسيم مجموعة البيانات إلى مجموعات تدريب (80%)، والتحقق (10%)، والاختبار (10%)، وخضع النموذج لتحسين المعلمات الفائقة، واختبار تكوينات مختلفة من الطبقات المخفية ومعدلات التسرب. من الجدير بالذكر أن تضمين آلية انتباه مصممة خصيصًا عزز بشكل كبير أداء النموذج، حيث حقق النموذج الأفضل أداءً متوسط خطأ مطلق (MAE) قدره 1.45 يومًا ومتوسط خطأ مطلق قدره 1.03 يومًا.

عند مقارنته بنموذج تقدير PMI القائم على البوتاسيوم، أظهر نموذج FFNN قدرات تنبؤية متفوقة. على سبيل المثال، عند PMI الفعلي البالغ 3 أيام، قدم نموذج FFNN فترة توقع بنسبة 95% تتراوح بين 2.5-5.9 أيام، وعند 7 أيام، كانت الفترة 4.2-8.3 أيام. بالمقابل، أنتج النموذج القائم على البوتاسيوم فترات أوسع وأقل عملية، مما يشير إلى أن نموذج FFNN يوفر أداة أكثر دقة وموثوقية لتقدير PMI في السياقات الجنائية.

مناقشة

تظهر الدراسة فعالية بيانات الميتابولوم المدمجة مع تقنيات التعلم الآلي، وتحديدًا الشبكة العصبية الأمامية (FFNN)، في توقع فترة ما بعد الوفاة (PMI) بدقة ملحوظة. باستخدام بيانات مطيافية الكتلة عالية الدقة من الحالات الجنائية، حققت FFNN متوسط خطأ مطلق (MAE) قدره 1.45 يومًا، وهو ما يقارن بعدم اليقين الكامن في تقديرات PMI. تم التحقق من قدرة النموذج التنبؤية عبر بيانات اختبار مستقلة، محتفظًا بمتوسط خطأ مطلق قدره 1.78 يومًا على الرغم من التحديات المرتبطة بتباين المنصات. من الجدير بالذكر أن النموذج أظهر دقة أقل عند قيم PMI القصوى، حيث كان يبالغ في تقدير الفترات القصيرة (أقل من 3 أيام) ويقلل من تقدير الفترات الأطول (أكثر من 11 يومًا)، مما يبرز الحاجة إلى مجموعة بيانات تدريب أكثر توازنًا.

بالإضافة إلى ذلك، حددت الدراسة تغييرات ميتابولومية مميزة على مر الزمن من خلال تجميع السلاسل الزمنية الزائفة، كاشفة عن ثلاث مجموعات من المستقلبات: تلك التي تزداد وفرتها، وتلك التي تنخفض وفرتها، وتلك التي تظهر أنماط متقلبة. كانت المستقلبات الرئيسية مثل ليسوفوسفاتيديل كولين وكارنيتينات الأسيل مرتبطة بعمليات بيولوجية مثل تدهور الغشاء وفقدان النشاط الإنزيمي، بينما أشارت الزيادات في الأحماض الأمينية إلى تدهور الأنسجة. تشير النتائج إلى أنه حتى عدد محدود من العينات (بقدر 256) يمكن أن ينتج تقديرات موثوقة لـ PMI، مما يبرز الإمكانية لتنفيذ نماذج قائمة على الميتابولوم في التحقيقات الجنائية. ستركز الأعمال المستقبلية على تعزيز قوة النموذج، ودمج بيانات وصفية إضافية، وتطوير أدوات سهلة الاستخدام للتطبيق الروتيني في البيئات الجنائية.

Journal: Nature Communications, Volume: 17, Issue: 1
DOI: https://doi.org/10.1038/s41467-026-69158-w
PMID: https://pubmed.ncbi.nlm.nih.gov/41673021
Publication Date: 2026-02-11
Author(s): Rasmus Magnusson et al.
Primary Topic: Forensic Entomology and Diptera Studies

Methods

In this section, the authors investigate the predictive capabilities of various supervised machine learning models for predicting PMI (presumably a specific metric or index). They constructed a compendium of models, including ridge regression, LASSO, support vector regression (SVR), random forest regression, gradient boosting regression, and K-nearest neighbors (k-NN). The models were trained on 80% of the dataset (3,907 profiles) and tested on 10% (471 profiles), revealing that five out of the six models demonstrated significant predictive power.

The feedforward neural network (FFNN) outperformed the other models, achieving a mean absolute error lower than the rest and an $R^2$ value of 0.56. Other models followed with LASSO at 0.49, gradient boosting at 0.46, random forest at 0.40, and SVR at 0.39. K-NN and ridge regression exhibited weaker performance, with $R^2$ values of 0.18 and 0.13, respectively. Statistical analysis using the two-sided Wilcoxon signed-rank test indicated significant differences in performance among most models (p < 0.05), except for specific comparisons involving LASSO and SVR, LASSO and random forest, LASSO and gradient boosting, and SVR and gradient boosting. Overall, the findings suggest that while simpler models can predict PMI effectively, the FFNN remains the most accurate among those tested.

Results

The results demonstrate the efficacy of a feed-forward neural network (FFNN) model in providing reliable post-mortem interval (PMI) estimations using metabolomics data from 4,876 femoral blood samples. The PMI values ranged from 1 to 67 days, with 97% of the values falling between 1 and 13 days and a median PMI of 5 days. The dataset was divided into training (80%), validation (10%), and test (10%) sets, and the model underwent hyperparameter optimization, testing various configurations of hidden layers and dropout rates. Notably, the inclusion of a custom-built attention mechanism significantly enhanced model performance, with the best-performing model achieving a mean absolute error (MAE) of 1.45 days and a median absolute error of 1.03 days.

When benchmarked against an established potassium-based PMI estimation model, the FFNN model demonstrated superior predictive capabilities. For example, at an actual PMI of 3 days, the FFNN model provided a 95% prediction interval of 2.5-5.9 days, and at 7 days, the interval was 4.2-8.3 days. In contrast, the potassium-based model yielded broader and less practical intervals, indicating that the FFNN model offers a more precise and reliable tool for PMI estimation in forensic contexts.

Discussion

The study demonstrates the efficacy of metabolomics data combined with machine learning techniques, specifically a feedforward neural network (FFNN), in predicting the post-mortem interval (PMI) with notable accuracy. Utilizing high-resolution mass spectrometry data from forensic cases, the FFNN achieved a mean absolute error (MAE) of 1.45 days, comparable to the inherent uncertainties in PMI estimates. The model’s predictive capability was validated across independent test data, retaining an MAE of 1.78 days despite challenges associated with cross-platform variability. Notably, the model exhibited less accuracy at extreme PMI values, overestimating short intervals (less than 3 days) and underestimating longer ones (greater than 11 days), highlighting the need for a more balanced training dataset.

Additionally, the study identified distinct metabolomic changes over time through pseudo-time series clustering, revealing three groups of metabolites: those with decreasing abundance, those with increasing abundance, and those exhibiting fluctuating patterns. Key metabolites such as lysophosphatidylcholines and acylcarnitines were linked to biological processes like membrane degradation and loss of enzymatic activity, while increases in amino acids indicated tissue breakdown. The findings suggest that even a limited number of samples (as few as 256) can yield reliable PMI predictions, emphasizing the potential for implementing metabolomics-based models in forensic investigations. Future work will focus on enhancing model robustness, integrating additional metadata, and developing user-friendly tools for routine application in forensic settings.