التعلم الديناميكي لنقل المعرفة عبر المجالات لرصد تعب السائق: دمج مستشعرات متعددة النماذج مع تخصيصات شخصية في الوقت الحقيقي Dynamic cross-domain transfer learning for driver fatigue monitoring: multi-modal sensor fusion with adaptive real-time personalizations

المجلة: Scientific Reports، المجلد: 15، العدد: 1
DOI: https://doi.org/10.1038/s41598-025-92701-6
PMID: https://pubmed.ncbi.nlm.nih.gov/40328844
تاريخ النشر: 2025-05-06
المؤلف: S. S. Aravinth وآخرون
الموضوع الرئيسي: النوم والإرهاق المرتبط بالعمل

نظرة عامة

تقدم ورقة البحث إطار عمل جديد لتعلم النقل الديناميكي عبر المجالات يهدف إلى تعزيز مراقبة تعب السائق من خلال دمج بيانات المستشعرات متعددة الأنماط. يتناول الإطار تحديات حاسمة مثل تعميم المجال، وتنوع المستشعرات، والتخصيص في الوقت الحقيقي، والتي تتواجد بشكل شائع في أنظمة الكشف عن التعب الحالية. تشمل الابتكارات الرئيسية استخدام شبكة عصبية معادية للمجال لاستخراج الميزات من مدخلات EEG وECG والفيديو، محققة فجوة تكيف المجال أقل من 5% وتحسين دقة النقل عبر المجالات بنسبة تصل إلى 15%. يسهل ASF-Transformer دمج الميزات عبر الأنماط بشكل فعال، مما يؤدي إلى تحسينات في الدقة تتراوح بين 5-8% في ظل ظروف فقدان النمط. بالإضافة إلى ذلك، يقوم شبكة اختيار النمط المغلقة (GMSN) بتقييم جودة المستشعرات ديناميكيًا، مما يحافظ على انخفاضات الأداء أقل من 5% حتى مع البيانات المفقودة أو المزعجة، بينما يسمح آلية التخصيص الدقيقة عبر الإنترنت (OPFT) بالتكيف في الوقت الحقيقي مع السائقين الفرديين، مما يعزز الدقة بنسبة 5-7% خلال ساعتين.

تظهر التقييمات التجريبية على مجموعات بيانات SEED VIG وPhysioNet Driving Fatigue قوة الإطار، محققة دقة متوسطة عبر المجالات تبلغ 87.8%، متفوقة بشكل كبير على الطرق الأساسية. حافظت GMSN على دقة متوسطة تبلغ 83.0% تحت فقدان 20% من المستشعرات، بينما حسنت OPFT الدقة بنسبة 2.5% بعد ساعتين من النشر مع زمن تأخير ضئيل. تؤكد النتائج على التطبيق العملي للإطار في ظروف القيادة الواقعية، مقدمة تقدمًا كبيرًا في سلامة السائق من خلال الكشف المبكر عن التعب. تشمل اتجاهات البحث المستقبلية دمج أنماط إضافية، والتكيف مع سياقات متعددة السائقين، وتطوير هياكل خفيفة الوزن تعتمد على الحافة، وإجراء دراسات طولية لتعزيز القابلية للتعميم عبر ديموغرافيات وبيئات قيادة متنوعة.

مقدمة

تستعرض مقدمة الورقة عقدًا من البحث حول الكشف عن تعب السائق، مع تسليط الضوء على الحاجة المتزايدة لتعزيز سلامة الطرق ومنع الحوادث. تلخص النتائج من 50 دراسة حديثة تستخدم نهج بيانات فسيولوجية وسلوكية ومتعددة الأنماط، إلى جانب تقنيات التعلم الآلي والتعلم العميق المتقدمة. تشمل الطرق الرئيسية التي تم مناقشتها الكشف عن التعب القائم على EEG، وتحليل حركة العين، ومراقبة زاوية التوجيه، مع مساهمات ملحوظة من باحثين مثل جياو وآخرين وسوباسي وآخرين، الذين حققوا موثوقية ودقة عالية في الكشف عن التعب من خلال تصميمات أنظمة مبتكرة.

تؤكد المراجعة على هيمنة الإشارات الفسيولوجية، مثل تقلب معدل ضربات القلب (HRV) والنشاط الكهربائي للجلد (EDA)، في الكشف عن التعب، بينما تعترف أيضًا بالاهتمام المتزايد بالطرق السلوكية، وخاصة تلك التي تستخدم تقنيات قائمة على الرؤية. على سبيل المثال، قام لي وآخرون بنمذجة التعب بفعالية من خلال تغييرات زاوية عجلة القيادة باستخدام الشبكات العصبية المتكررة (RNNs). علاوة على ذلك، تسلط الورقة الضوء على إمكانيات النهج متعددة الأنماط التي تدمج كل من الإشارات الفسيولوجية والسلوكية، والتي أظهرت أنها تعزز دقة الكشف والموثوقية. على الرغم من التقدم، تشير المراجعة إلى القيود في القابلية للتعميم والمتطلبات الحسابية لبعض الطرق، مما يشير إلى الحاجة إلى مزيد من البحث لتحسين التطبيقات الواقعية لهذه الأنظمة للكشف عن التعب.

الطرق

تسلط الطرق المستخدمة في هذه الدراسة الضوء على التحديات المرتبطة بالتحقق التجريبي، لا سيما في سياق التعرف على التعب باستخدام تخطيط الدماغ الكهربائي (EEG). تستخدم الأبحاث مجموعات بيانات EEG صغيرة، مما يظهر الكشف الفعال عن التعب حتى مع عينات بيانات تدريبية قليلة. ومع ذلك، يُلاحظ أن دقة نظام التعرف على التعب تتناقص عند تطبيقه على مجموعات بيانات أكبر، مما يشير إلى قيود محتملة في قابلية تعميم النموذج وموثوقيته عبر ظروف البيانات المتنوعة. وهذا يشير إلى الحاجة إلى مزيد من التحقيق والتحقق باستخدام مجموعات بيانات أكثر شمولاً لتعزيز موثوقية النتائج.

المناقشة

تسلط قسم المناقشة في ورقة البحث الضوء على التقدم والتحديات في الكشف عن تعب السائق من خلال منهجيات مختلفة، مع التركيز بشكل خاص على دمج تقنيات التعلم الآلي (ML) والمراقبة الفسيولوجية. تشير النتائج الرئيسية إلى أنه بينما تتفوق الطرق التقليدية، مثل الأنظمة القائمة على EEG، في التقاط التغيرات في موجات الدماغ المرتبطة بالتعب، فإنها غالبًا ما تفتقر إلى القابلية للتوسع والموثوقية في التطبيقات الواقعية. بالمقابل، توفر نهج رؤية الكمبيوتر (CV) حلولًا أكثر قابلية للنشر ولكن قد تواجه صعوبات في الظروف القاسية، مثل الإضاءة الشديدة. تحدد الورقة تقنيات دمج الأنماط المتعددة كأكثر الاتجاهات الواعدة، حيث تجمع بين نقاط القوة في أنماط مختلفة لتعزيز دقة الكشف والموثوقية.

يعالج النموذج المقترح، تعلم النقل الديناميكي عبر المجالات لمراقبة تعب السائق، عدم الكفاءة الحالية من خلال دمج تصميم معياري يتضمن مكونات مثل الشبكة العصبية المعادية متعددة الأنماط (MM-DANN) ومحولات دمج المستشعرات التكيفية (ASF-Transformer). تعمل هذه المكونات بشكل متكامل لضمان استخراج ميزات غير متعلقة بالمجال وتخصيص ديناميكي في الوقت الحقيقي، مما يحسن من قدرة النظام على التكيف مع السائقين الفرديين وظروف المستشعر المتغيرة. يتم تقييم فعالية الإطار من خلال إعداد تجريبي شامل، مما يظهر دقة عالية وزمن تأخير منخفض مناسب للتطبيقات في الوقت الحقيقي، مع معالجة التحديات المتعلقة بتنوع المستشعرات وجودة البيانات. يُقترح أن تركز الأعمال المستقبلية على تحسين الكفاءة الحسابية وتعزيز قابلية تفسير النموذج، لضمان إمكانية نشر النظام المقترح بفعالية في بيئات القيادة الديناميكية لتحسين سلامة الطرق.

Journal: Scientific Reports, Volume: 15, Issue: 1
DOI: https://doi.org/10.1038/s41598-025-92701-6
PMID: https://pubmed.ncbi.nlm.nih.gov/40328844
Publication Date: 2025-05-06
Author(s): S. S. Aravinth et al.
Primary Topic: Sleep and Work-Related Fatigue

Overview

The research paper presents a novel Dynamic Cross-Domain Transfer Learning framework aimed at enhancing driver fatigue monitoring through the fusion of multi-modal sensor data. The framework addresses critical challenges such as domain generalization, sensor variability, and real-time personalization, which are prevalent in existing fatigue detection systems. Key innovations include the use of a domain adversarial neural network for feature extraction from EEG, ECG, and video inputs, achieving a domain adaptation gap of less than 5% and improving cross-domain accuracy by up to 15%. The ASF-Transformer facilitates effective cross-modal feature fusion, yielding accuracy improvements of 5-8% under modality dropout conditions. Additionally, the Gated Modality Selection Network (GMSN) dynamically assesses sensor quality, maintaining performance drops below 5% even with noisy or missing data, while the Online Personalized Fine-Tuning (OPFT) mechanism allows for real-time adaptation to individual drivers, enhancing accuracy by 5-7% within two hours.

Experimental evaluations on the SEED VIG and PhysioNet Driving Fatigue datasets demonstrate the framework’s robustness, achieving a mean cross-domain accuracy of 87.8%, significantly outperforming baseline methods. The GMSN maintained an average accuracy of 83.0% under 20% sensor dropout, while OPFT improved accuracy by 2.5% after two hours of deployment with minimal latency. The findings underscore the framework’s practical applicability in real-world driving conditions, offering substantial advancements in driver safety through early fatigue detection. Future research directions include the integration of additional modalities, adaptation for multi-driver contexts, development of lightweight edge-based architectures, and conducting longitudinal studies to enhance generalizability across diverse demographics and driving environments.

Introduction

The introduction of the paper reviews a decade’s worth of research on driver fatigue detection, highlighting the increasing urgency to enhance road safety and prevent accidents. It synthesizes findings from 50 recent studies that employ various physiological, behavioral, and multimodal data approaches, alongside advanced machine learning and deep learning techniques. Key methods discussed include EEG-based fatigue detection, eye movement analysis, and steering angle monitoring, with notable contributions from researchers such as Jiao et al. and Subasi et al., who achieved high reliability and accuracy in fatigue detection through innovative system designs.

The review emphasizes the predominance of physiological signals, such as heart rate variability (HRV) and electrodermal activity (EDA), in fatigue detection, while also acknowledging the growing interest in behavioral methods, particularly those utilizing vision-based techniques. For instance, Li et al. effectively modeled fatigue through steering wheel angle variations using recurrent neural networks (RNNs). Furthermore, the paper highlights the potential of multimodal approaches that integrate both physiological and behavioral signals, which have shown to enhance detection accuracy and robustness. Despite the advancements, the review notes limitations in generalizability and computational demands of certain methods, indicating a need for further research to optimize real-world applications of these fatigue detection systems.

Methods

The methods employed in this study highlight the challenges associated with experimental validation, particularly in the context of fatigue recognition using electroencephalography (EEG). The research utilizes small EEG datasets, demonstrating effective fatigue detection even with minimal training data samples. However, it is noted that the accuracy of the fatigue recognition system diminishes when applied to larger-scale datasets, indicating potential limitations in the model’s generalizability and robustness across diverse data conditions. This suggests a need for further investigation and validation with more extensive datasets to enhance the reliability of the findings.

Discussion

The discussion section of the research paper highlights advancements and challenges in driver fatigue detection through various methodologies, particularly emphasizing the integration of machine learning (ML) and physiological monitoring techniques. Key findings indicate that while traditional methods, such as EEG-based systems, excel in capturing brainwave changes associated with fatigue, they often lack scalability and robustness in real-world applications. In contrast, computer vision (CV) approaches provide more deployable solutions but may struggle under adverse conditions, such as extreme lighting. The paper identifies multimodal fusion techniques as the most promising direction, combining the strengths of different modalities to enhance detection accuracy and robustness.

The proposed model, Dynamic Cross-Domain Transfer Learning for Driver Fatigue Monitoring, addresses existing inefficiencies by incorporating a modular design that includes components like the Multi-Modal Domain-Adversarial Neural Network (MM-DANN) and Adaptive Sensor Fusion Transformer (ASF-Transformer). These components work synergistically to ensure domain-invariant feature extraction and adaptive real-time personalization, thus improving the system’s adaptability to individual drivers and varying sensor conditions. The framework’s effectiveness is evaluated through a comprehensive experimental setup, demonstrating high accuracy and low latency suitable for real-time applications, while also addressing the challenges of sensor variability and data quality. Future work is suggested to focus on optimizing computational efficiency and enhancing model interpretability, ensuring the proposed system can be effectively deployed in dynamic driving environments to improve road safety.