التنبؤ المكاني الزمني لجودة المياه وتقييم المخاطر البيئية في حوض نهر باستخدام T-GCN بناءً على قيود طوبولوجيا شبكة النهر Spatiotemporal prediction of water quality and ecological risk assessment in a river basin using T-GCN based on river network topology constraints

المجلة: Scientific Reports، المجلد: 16، العدد: 1
DOI: https://doi.org/10.1038/s41598-025-34722-9
PMID: https://pubmed.ncbi.nlm.nih.gov/41545421
تاريخ النشر: 2026-01-16
المؤلف: Li Li
الموضوع الرئيسي: جودة المياه وتقييم التلوث

نظرة عامة

تتناول ورقة البحث القيود المفروضة على طرق التنبؤ بجودة المياه الحالية التي غالبًا ما تتجاهل الخصائص الديناميكية لتدفق الشبكات النهرية، مما يؤدي إلى عدم الدقة في نمذجة هجرة الملوثات والتقييمات اللاحقة للمخاطر البيئية. للتغلب على هذه التحديات، يقترح المؤلفون نموذج شبكة الالتفاف الزمني (T-GCN) الذي يدمج القيود الطوبولوجية لشبكة الأنهار وآلية الاتصال الديناميكية المدفوعة بالتدفق. يلتقط هذا النموذج بفعالية تأثير الظروف الهيدرولوجية المتغيرة على نقل الملوثات، مما يعزز دقة توقعات جودة المياه في حوض النهر وموثوقية تقييمات المخاطر البيئية.

تظهر النتائج التجريبية أن نموذج T-GCN يتفوق بشكل كبير على النماذج التقليدية للرسوم البيانية الزمانية المكانية، مثل DCRNN وGraph WaveNet وAGCRN، عبر أربعة مؤشرات لجودة المياه: الأكسجين المذاب (DO) والنيتروجين الأموني (NH₃-N) والفوسفور (TP) وpH. حقق النموذج أخطاء متوسطة مربعة (MSE) أقل لهذه المؤشرات، مع قيمة R² تبلغ 0.884 لـ DO ومعاملات كابا عالية لتمييز المخاطر البيئية عند مستويات مختلفة من المخاطر (0.863 للمنخفض، 0.826 للمتوسط، و0.763 للمرتفع). تشير النتائج إلى أن T-GCN يوفر أداة قوية لإدارة أحواض المياه الذكية والوقاية من المخاطر البيئية، مما يقدم إطارًا كميًا لإدارة فعالة لموارد المياه وأنظمة الإنذار المبكر.

مقدمة

في المقدمة، تتناول الورقة التعقيد المتزايد لتلوث المياه وضرورة التنبؤ الدقيق بجودة المياه وتقييمات المخاطر البيئية في أحواض الأنهار. تنتقد نماذج التنبؤ بجودة المياه الحالية، التي تستخدم في الغالب طرقًا إحصائية أو هياكل تعلم عميقة عامة، لعدم قدرتها على نمذجة الاعتماد المكاني بفعالية على نطاق حوض النهر. يبرز المؤلفون أن العديد من الدراسات تستخدم مصفوفات مجاورة ثابتة أو روابط افتراضية تعتمد على المسافة الإقليدية، متجاهلة عوامل حاسمة مثل الاتجاه، وفارق الزمن، والاعتماد على التدفق، مما يؤدي إلى عدم الدقة في تمثيل عمليات نقل الملوثات.

تؤكد الورقة على الحاجة إلى نهج نمذجة يدمج الخصائص الطوبولوجية لشبكات الأنهار مع الديناميات الزمنية لتعزيز دقة توقعات جودة المياه وتقييمات المخاطر البيئية. تشير إلى تطور نمذجة جودة المياه في الأحواض من التقنيات الإحصائية التقليدية، مثل تحليل المكونات الرئيسية (PCA) وتحليل التجمعات (CA)، إلى نماذج أكثر تقدمًا تعتمد على البيانات، بما في ذلك الشبكات العصبية المتكررة (RNNs) ومتغيراتها. تشمل أمثلة التقدمات الحديثة استخدام شبكات الذاكرة طويلة وقصيرة الأجل (LSTM) مع آليات الانتباه ووحدات التكرار المغلقة (GRUs) لالتقاط الديناميات الزمنية لجودة المياه بشكل أفضل، مما يبرز التحول نحو طرق النمذجة الذكية في هذا المجال.

طرق

في هذا القسم، يصف المؤلفون المنهجية المستخدمة لتقييم فعالية نموذج T-GCN في توقع جودة المياه في الأحواض وتقييم المخاطر البيئية. تركز الدراسة على منطقتين محددتين: قسم تشونغتشينغ من نهر اليانغتسي وقسم ييبين من حوض نهر مينجيانغ، باستخدام بيانات من يناير 2022 إلى ديسمبر 2024. تم الحصول على بيانات جودة المياه، بما في ذلك معلمات مثل الأكسجين المذاب (DO) والنيتروجين الأموني (NH₃-N) والفوسفور الكلي (TP) وpH، من نظام المراقبة التلقائية لجودة المياه السطحية الوطنية وتقارير البيئة المحلية. تم الحصول على بيانات هيدرولوجية، بما في ذلك تدفق المياه اليومي وهطول الأمطار، من الوكالات الهيدرولوجية والأرصاد الجوية ذات الصلة. تم تحليل استخدام الأراضي في منطقة الدراسة باستخدام بيانات تغطية الأراضي العالمية Globeland30 لعام 2023، مما يكشف أن الأراضي الحرجية تسود (62.3%)، تليها الأراضي المزروعة (21.8%) وأراضي البناء (8.5%).

شملت معالجة البيانات محاذاة سلسلة مراقبة جودة المياه إلى شبكة زمنية يومية موحدة ومعالجة البيانات المفقودة من خلال طريقة استيفاء مقيدة جسديًا للحفاظ على الحفاظ على الكتلة. تم توحيد المقاييس باستخدام طريقة Z-score لضمان القابلية للمقارنة عبر مؤشرات مختلفة. تم رسم أقسام المراقبة إلى مقاطع نهرية، وتم بناء مصفوفة مجاورة ديناميكية بناءً على نموذج هيدرولوجي لحوض النهر. تم تقييم التعميم الزمني للنموذج باستخدام نهج تقييم بدء متداول، مع تحديد مجموعات التدريب والتحقق والاختبار عبر أرباع مختلفة من عام 2023. شمل الإعداد التجريبي استخدام مُحسِّن آدم، ومعدل تعلم قدره 0.001، واستراتيجية إيقاف مبكر لتقليل الإفراط في التكيف. تم تحسين المعلمات الرئيسية من خلال بحث الشبكة، مع تكوينات محددة لأبعاد الطبقات المخفية، وترتيب الانتشار، ومعدلات التسرب لتعزيز أداء النموذج.

مناقشة

تسلط قسم المناقشة في الورقة الضوء على قيود طرق نمذجة الاعتماد المكاني التقليدية في تقييم المخاطر البيئية وتوقع جودة المياه للأنهار الحضرية. تؤكد على الحاجة إلى منهجيات متقدمة تأخذ في الاعتبار الطوبولوجيا المعقدة وغير الإقليدية لأنظمة الأنهار. يقترح المؤلفون نهجًا جديدًا باستخدام شبكة الالتفاف الزمني (T-GCN) التي تدمج قيود طوبولوجيا شبكة الأنهار مع العمليات الهيدرولوجية الديناميكية. يقدم هذا النموذج مصفوفة مجاورة ديناميكية تعكس ظروف التدفق في الوقت الحقيقي، مما يسمح بتوقعات أكثر دقة لهجرة الملوثات وتقييم المخاطر البيئية.

تشمل الابتكارات الرئيسية لنموذج T-GCN دمج العلاقات المكانية الهرمية بين أقسام المراقبة، مما يعزز توصيف الاعتماد المكاني في مؤشرات جودة المياه. تعدل آلية مصفوفة المجاورة الديناميكية نسب تحويل التدفق وأوقات السفر بناءً على البيانات الهيدرولوجية الملاحظة، مما يعالج أوجه القصور في النماذج الثابتة. تظهر النتائج انخفاضًا في متوسط الخطأ المربع (MSE) ومتوسط الخطأ المطلق (MAE) بنسبة 3.0% و2.1%، على التوالي، مقارنة بالنماذج الحالية، مما يحسن من توقيت وموثوقية توقعات جودة المياه وتقييمات المخاطر البيئية في أحواض الأنهار. تؤكد هذه الدراسة على أهمية دمج القيود الفيزيائية والديناميات الهيدرولوجية في النمذجة الزمانية المكانية لتعزيز الفهم وإدارة جودة المياه في أنظمة الأنهار الحضرية.

القيود

تسلط قسم القيود في ورقة البحث الضوء على عدة مجالات حاسمة لتحسين نموذج T-GCN الديناميكي الذي تم تطويره لتحليل الاتصال في شبكة الأنهار وديناميات التلوث. أولاً، تعتمد دقة مصفوفة المجاورة الديناميكية على بيانات التدفق النادرة، مما يهدد دقة النموذج، خاصة في الروافد التي تفتقر إلى المراقبة المستمرة. قد تعيق هذه القيود حساسية النموذج للتغيرات الهيدرولوجية قصيرة الأجل. يُشجع البحث المستقبلي على دمج بيانات هيدرولوجية متعددة المصادر ونماذج موزعة لتعزيز اكتمال البيانات ودقتها.

بالإضافة إلى ذلك، يقدم هيكل النموذج تحديات، خاصة في اعتماده على LSTM لتوقعات التدفق، مما قد يؤدي إلى تراكم الأخطاء في مصفوفة المجاورة. للتخفيف من ذلك، يمكن أن تستكشف الأعمال المستقبلية نماذج توقع التدفق الهجينة التي تدمج المبادئ الفيزيائية. كما أن الالتفاف المكاني الثابت في T-GCN يحد من قابليته للتكيف مع مقاييس الأحداث المتغيرة، مما يشير إلى الحاجة إلى آليات انتشار أكثر مرونة. علاوة على ذلك، فإن قابلية تطبيق النموذج محدودة بسبب استبعاده لمداخل التلوث المفاجئة من المصادر غير النقطية وافتقاره إلى متغيرات التحكم لتدابير الإدارة. يجب أن تهدف التطورات المستقبلية إلى إنشاء هيكل توقع ثنائي النمط للأحداث غير الطبيعية وتوسيع النموذج ليشمل تحليل السيناريوهات لتقييم السياسات الفعالة واتخاذ القرارات.

Journal: Scientific Reports, Volume: 16, Issue: 1
DOI: https://doi.org/10.1038/s41598-025-34722-9
PMID: https://pubmed.ncbi.nlm.nih.gov/41545421
Publication Date: 2026-01-16
Author(s): Li Li
Primary Topic: Water Quality and Pollution Assessment

Overview

The research paper addresses the limitations of existing water quality prediction methods that often overlook the dynamic flow characteristics of river networks, leading to inaccuracies in pollutant migration modeling and subsequent ecological risk assessments. To overcome these challenges, the authors propose a Temporal Graph Convolutional Network (T-GCN) model that integrates river network topological constraints and a flow-driven dynamic connection mechanism. This model effectively captures the influence of changing hydrological conditions on pollutant transport, enhancing the accuracy of watershed water quality predictions and the reliability of ecological risk assessments.

Experimental results demonstrate that the T-GCN model significantly outperforms traditional spatiotemporal graph models, such as DCRNN, Graph WaveNet, and AGCRN, across four water quality indicators: Dissolved Oxygen (DO), Ammonia Nitrogen (NH₃-N), Phosphorus (TP), and pH. The model achieved lower mean squared errors (MSE) for these indicators, with an R² value of 0.884 for DO and high Kappa coefficients for ecological risk discrimination at various risk levels (0.863 for low, 0.826 for medium, and 0.763 for high). The findings indicate that T-GCN provides a robust tool for smart watershed management and ecological risk prevention, offering a quantitative framework for effective water resources management and early warning systems.

Introduction

In the introduction, the paper addresses the growing complexity of water pollution and the necessity for accurate predictions of water quality and ecological risk assessments in river basins. It critiques existing water quality prediction models, which predominantly utilize statistical methods or general deep learning architectures, for their inability to effectively model spatial dependencies at the river basin scale. The authors highlight that many studies employ static adjacency matrices or virtual links based on Euclidean distance, neglecting critical factors such as directionality, time lag, and flow dependence, which leads to inaccuracies in representing pollutant transfer processes.

The paper emphasizes the need for a modeling approach that integrates the topological characteristics of river networks with temporal dynamics to enhance the accuracy of water quality predictions and ecological risk assessments. It notes the evolution of watershed water quality modeling from traditional statistical techniques, like principal component analysis (PCA) and cluster analysis (CA), to more advanced data-driven models, including recurrent neural networks (RNNs) and their variants. Examples of recent advancements include the use of Long Short-Term Memory (LSTM) networks with attention mechanisms and Gated Recurrent Units (GRUs) to better capture the temporal dynamics of water quality, underscoring the shift towards intelligent modeling methods in this field.

Methods

In this section, the authors describe the methodology employed to evaluate the T-GCN model’s effectiveness in predicting watershed water quality and assessing ecological risks. The study focuses on two specific regions: the Chongqing section of the Yangtze River and the Yibin section of the Minjiang River basin, utilizing data from January 2022 to December 2024. Water quality data, including parameters such as dissolved oxygen (DO), ammonia nitrogen (NH₃-N), total phosphorus (TP), and pH, were sourced from the National Surface Water Quality Automatic Monitoring and Real-time Release System and local environmental reports. Hydrological data, including daily streamflow and precipitation, were obtained from relevant hydrological and meteorological agencies. The study area’s land use was analyzed using the 2023 Globeland30 global land cover data, revealing that forest land predominates (62.3%), followed by cultivated land (21.8%) and construction land (8.5%).

The data preprocessing involved aligning the water quality monitoring series to a uniform daily time grid and addressing missing data through a physically constrained interpolation method to maintain mass conservation. The metrics were standardized using the Z-score method to ensure comparability across different indicators. The monitoring sections were mapped to river segments, and a dynamic adjacency matrix was constructed based on the watershed hydrological model. The model’s temporal generalization was assessed using a rolling start evaluation approach, with training, validation, and test sets defined across different quarters of 2023. The experimental setup included the use of the Adam optimizer, a learning rate of 0.001, and an early stopping strategy to mitigate overfitting. Key hyperparameters were optimized through grid search, with specific configurations for hidden layer dimensions, diffusion order, and dropout rates to enhance model performance.

Discussion

The discussion section of the paper highlights the limitations of traditional spatial dependency modeling methods in ecological risk assessment and water quality prediction for urban rivers. It emphasizes the need for advanced methodologies that account for the complex, non-Euclidean topology of river systems. The authors propose a novel approach using a Temporal Graph Convolutional Network (T-GCN) that integrates river network topology constraints with dynamic hydrological processes. This model introduces a dynamic adjacency matrix that reflects real-time flow conditions, allowing for more accurate predictions of pollutant migration and ecological risk assessment.

Key innovations of the T-GCN model include the incorporation of hierarchical spatial relationships among monitoring sections, which enhances the characterization of spatial dependencies in water quality indicators. The dynamic adjacency matrix mechanism adjusts flow diversion ratios and travel times based on observed hydrological data, addressing the shortcomings of static models. The results demonstrate a reduction in mean squared error (MSE) and mean absolute error (MAE) by 3.0% and 2.1%, respectively, compared to existing models, thereby improving the timeliness and reliability of water quality predictions and ecological risk assessments in river basins. This research underscores the importance of integrating physical constraints and hydrological dynamics in spatiotemporal modeling to enhance the understanding and management of water quality in urban river systems.

Limitations

The section on limitations in the research paper highlights several critical areas for improvement in the dynamic T-GCN model developed for analyzing river network connectivity and pollution dynamics. Firstly, the model’s reliance on sparse flow data compromises the accuracy of the dynamic adjacency matrix, particularly in tributaries lacking continuous monitoring. This limitation may hinder the model’s sensitivity to short-term hydrological changes. Future research is encouraged to integrate multi-source hydrological data and distributed models to enhance data completeness and accuracy.

Additionally, the model’s structure presents challenges, particularly in its dependency on LSTM for flow predictions, which can lead to error accumulation in the adjacency matrix. To mitigate this, future work could explore hybrid flow prediction models that incorporate physical principles. The fixed-depth spatial convolution in the T-GCN also restricts its adaptability to varying event scales, suggesting a need for more flexible diffusion mechanisms. Furthermore, the model’s applicability is limited by its exclusion of sudden pollution inputs from non-point sources and its lack of control variables for management measures. Future developments should aim to create a dual-modal prediction structure for abnormal events and extend the model to include scenario analysis for effective policy evaluation and decision-making.