تحديد عدم اليقين وانتشاره في الاستدلال البايزي القائم على البدائل Uncertainty quantification and propagation in surrogate-based Bayesian inference

المجلة: Statistics and Computing، المجلد: 35، العدد: 3
DOI: https://doi.org/10.1007/s11222-025-10597-8
تاريخ النشر: 2025-03-13
المؤلف: Philipp Reiser وآخرون
الموضوع الرئيسي: تصميم هندسي احتمالي وقوي

نظرة عامة

تقدم الورقة إطار عمل بايزي قابل للتوسع لنمذجة البدائل يركز على قياس عدم اليقين، وانتشاره، والتحقق منه. تعمل نماذج البدائل كتقريب لمحاكاة معقدة، ويبرز المؤلفون أهمية الأخذ بعين الاعتبار عدم اليقين الناجم عن ميزانيات المحاكاة المحدودة وأخطاء تقريب البدائل. يقدمون ثلاث طرق للاستدلال البايزي باستخدام نماذج البدائل، موضحين نهجهم من خلال دراسات حالة تشمل سيناريوهات حقيقية خطية وغير خطية. تشير النتائج إلى أن انتشار عدم اليقين الفعال يعزز موثوقية نماذج البدائل، وهو أمر حاسم لاتخاذ قرارات مستنيرة عبر تطبيقات متنوعة.

في الخاتمة، يحدد المؤلفون إجراءً من خطوتين للاستدلال البايزي القائم على البدائل الذي ينشر بفعالية كل من عدم اليقين العشوائي والمعرفي من مرحلة تدريب البدائل إلى استدلال البيانات الحقيقية. يقترحون توسيع المعايرة القائمة على المحاكاة لتقييم معايرة عدم اليقين للاستدلال الناتج. ومن الجدير بالذكر أن طريقتين لانتشار عدم اليقين، E-Lik وE-Post، أظهرتا تحسينات كبيرة في المعايرة مقارنة بالطرق التقليدية التي تتجاهل عدم اليقين. يناقش المؤلفون أيضًا آثار نتائجهم على المشكلات عالية الأبعاد ويقترحون اتجاهات بحث مستقبلية، بما في ذلك تحسين نمذجة أخطاء تقريب البدائل واستكشاف تقنيات استدلال بديلة لمعالجة التحديات الحسابية في التطبيقات واسعة النطاق.

مقدمة

تؤكد مقدمة هذه الورقة البحثية على أهمية المحاكاة في العلوم الطبيعية والهندسة، لا سيما لفهم الأنظمة المعقدة، والتنبؤ بالسيناريوهات المستقبلية، وإجراء تقييمات المخاطر. غالبًا ما تعتمد المحاكاة التقليدية على معادلات تفاضلية عادية وجزئية معقدة تفتقر إلى حلول مغلقة، مما يتطلب طرق عددية مكلفة. للتخفيف من الأعباء الحسابية، ظهرت نماذج البدائل، التي تقدم تقريبًا فعالًا من خلال تقنيات مثل توسيع الفوضى المتعددة الحدود، والعمليات الغاوسية، والشبكات العصبية. لقد اكتسبت هذه النماذج زخمًا عبر مجالات متنوعة، بما في ذلك الهيدرولوجيا، وديناميكا السوائل، وتنبؤ المناخ، وعلم الأحياء النظامي.

على الرغم من التقدم في نمذجة البدائل، لا يزال هناك تحدٍ حاسم يتمثل في ضمان موثوقية هذه النماذج، لا سيما فيما يتعلق بقياس عدم اليقين (UQ). تبرز الورقة أهمية انتشار عدم اليقين (UP) في معالجة عدم اليقين في البدائل، خاصة في كل من المشكلات الأمامية والعكسية. تشير إلى أن الطرق الحالية غالبًا ما تبسط أو تتجاهل عدم اليقين في معلمات البدائل، مما يؤدي إلى عدم دقة محتملة في مهام الاستدلال. يهدف المؤلفون إلى معالجة هذه القضايا من خلال اقتراح إطار بايزي شامل لـ UP في المشكلات العكسية الاحتمالية، مع التركيز على معالجة أكثر صرامة لعدم اليقين وتعزيز الأساليب التشخيصية للتحقق من نتائج الاستدلال.

النتائج

في قسم النتائج، يقدم المؤلفون تحليلًا مقارنًا لأربع طرق مختلفة من I-Step بناءً على I-posteriors الخاصة بهم، باستخدام معلمات مختلفة لخطأ تقريب البدائل $\sigma_A = \{0.1, 0.5, 1\}$. تشير النتائج، الملخصة في الجدول 2 والمصورة في الشكل 3، إلى أن طريقة Point تعطي نتائج غير متأثرة بالانحراف المعياري لـ T-likelihood، حيث إنها لا تأخذ في الاعتبار عدم اليقين المعرفي T. على العكس، تظهر طريقتا E-Lik وE-Post زيادة في عدم اليقين في I-posteriors الخاصة بهما مع ارتفاع عدم اليقين T-step، مما يعكس تراجع موثوقية نموذج البديل. ومن الجدير بالذكر أن طريقة E-Log-Lik تظهر سلوكًا غير متوقع حيث يصبح I-posterior أكثر يقينًا على الرغم من زيادة عدم اليقين في T-posterior، كما تدعمه المعادلة (39).

تحليل إضافي، موضح في الشكل 8، يحدد المعلمات المدخلة الحقيقية إلى $\omega^*_I = (\beta^*, \gamma^*) = (1.6, 0.4)$، مما يكشف أن كل من طريقتي Point وE-Log-Lik تعطيان I-posteriors مفرطة الثقة في بعد واحد. في المقابل، تحافظ طريقتا E-Lik وE-Post على كتلة احتمالية كبيرة حول قيم المعلمات الحقيقية. تظهر الرسوم البيانية لتوزيعات I-posterior المشتركة أنه بينما تفشل Point وE-Log-Lik في تضمين القيمة الحقيقية $\omega^*_I$، فإن E-Post وE-Lik تغطيانها بفعالية. إن زيادة عدم اليقين الملحوظة في E-Post وE-Lik تعكس تلك الموجودة في T-posterior، والتي تعزى إلى البيانات التدريبية المحدودة المتاحة لنموذج البديل.

المناقشة

في هذا القسم، يقترح المؤلفون إطار عمل شامل لانتشار عدم اليقين (UP) في الاستدلال البايزي القائم على البدائل، القابل للتطبيق على نماذج بدائل متنوعة مثل الانحدار الخطي، وتوسيع الفوضى المتعددة الحدود، والعمليات الغاوسية، والشبكات العصبية. يتكون الإطار من إجراء من خطوتين: خطوة تدريب البديل (T-Step)، حيث يتم توليد بيانات التدريب من محاكي معقد، وخطوة الاستدلال (I-Step)، حيث يتم استخدام نموذج البديل المدرب لاستدلال الكميات ذات الأهمية. يميز المؤلفون بين عدم اليقين العشوائي (غير القابل للاختزال) والمعرفي (القابل للاختزال) ويُلخصون عدم اليقين ذي الصلة في جدول.

تشمل خطوة T-Step تدريب نموذج بديل باستخدام بيانات من محاكي، قد يكون معقدًا وربما عشوائيًا. يهدف نموذج البديل إلى تقريب المحاكي مع كونه فعالًا من الناحية الحسابية. يقدم المؤلفون خطأ تقريب البدائل لأخذ قيود نموذج البديل في الاعتبار، مما يؤدي إلى توزيع مشترك لاحق على معلمات البدائل. تركز خطوة I-Step على استدلال المعلمات المجهولة باستخدام بيانات القياس من العالم الحقيقي، حيث يحل نموذج البديل محل المحاكي المعقد. يقدم المؤلفون أربع طرق لانتشار عدم اليقين—تقدير النقطة، والاحتمال المتوقع (E-Post)، والاحتمال المتوقع (E-Lik)، والاحتمال المتوقع للوغاريتم (E-Log-Lik)—لاشتقاق الاحتمال القائم على البدائل. تتضمن كل طريقة عدم اليقين من خطوة T-Step بشكل مختلف، حيث تستفيد E-Post وE-Lik من الاحتمال الكامل أو العينات من الاحتمال، بينما تدمج E-Log-Lik عبر اللوغاريتم، مما يقدم نهجًا جديدًا لانتشار عدم اليقين. تختتم القسم بمناقشة حول الكفاءة الحسابية، والتوازي، وأهمية معايرة عدم اليقين في الاستدلال البايزي.

Journal: Statistics and Computing, Volume: 35, Issue: 3
DOI: https://doi.org/10.1007/s11222-025-10597-8
Publication Date: 2025-03-13
Author(s): Philipp Reiser et al.
Primary Topic: Probabilistic and Robust Engineering Design

Overview

The paper presents a scalable Bayesian framework for surrogate modeling that emphasizes uncertainty quantification, propagation, and validation. Surrogate models serve as approximations for complex simulations, and the authors highlight the importance of accounting for uncertainty arising from limited simulation budgets and surrogate approximation errors. They introduce three methods for Bayesian inference with surrogate models, demonstrating their approach through case studies involving both linear and nonlinear real-world scenarios. The findings indicate that effective uncertainty propagation enhances the reliability of surrogate models, which is crucial for informed decision-making across various applications.

In the conclusion, the authors outline a two-step procedure for surrogate-based Bayesian inference that effectively propagates both aleatoric and epistemic uncertainties from the surrogate training phase to real-data inference. They propose an extension of simulation-based calibration to assess the uncertainty calibration of the resulting inference. Notably, two methods for uncertainty propagation, E-Lik and E-Post, showed significant improvements in calibration compared to traditional methods that neglect uncertainty. The authors also discuss the implications of their findings for high-dimensional problems and suggest future research directions, including refining the modeling of surrogate approximation errors and exploring alternative inference techniques to address computational challenges in large-scale applications.

Introduction

The introduction of this research paper emphasizes the significance of simulations in natural sciences and engineering, particularly for understanding complex systems, predicting future scenarios, and conducting risk assessments. Traditional simulations often rely on complex ordinary and partial differential equations that lack closed-form solutions, necessitating costly numerical methods. To mitigate computational burdens, surrogate models have emerged, offering efficient approximations through techniques such as polynomial chaos expansion, Gaussian processes, and neural networks. These models have gained traction across various fields, including hydrology, fluid dynamics, climate prediction, and systems biology.

Despite advancements in surrogate modeling, a critical challenge remains in ensuring the trustworthiness and reliability of these models, particularly concerning uncertainty quantification (UQ). The paper highlights the importance of uncertainty propagation (UP) in addressing surrogate uncertainties, especially in both forward and inverse problems. It notes that existing methods often simplify or neglect the uncertainty of surrogate parameters, leading to potential inaccuracies in inference tasks. The authors aim to tackle these issues by proposing a comprehensive Bayesian framework for UP in probabilistic inverse problems, focusing on a more rigorous treatment of uncertainties and enhancing diagnostic methods to validate inference outcomes.

Results

In the results section, the authors present a comparative analysis of four different I-Step methods based on their I-posteriors, utilizing varying surrogate approximation error parameters $\sigma_A = \{0.1, 0.5, 1\}$. The findings, summarized in Table 2 and illustrated in Figure 3, indicate that the Point method yields results that are unaffected by the standard deviation of the T-likelihood, as it does not account for T-epistemic uncertainty. Conversely, the E-Lik and E-Post methods exhibit increased uncertainty in their I-posteriors as the T-step uncertainty rises, reflecting the diminishing trustworthiness of the surrogate model. Notably, the E-Log-Lik method displays an unexpected behavior where its I-posterior becomes more certain despite increasing uncertainty in the T-posterior, as supported by Equation (39).

Further analysis, depicted in Figure 8, sets the true input parameters to $\omega^*_I = (\beta^*, \gamma^*) = (1.6, 0.4)$, revealing that both the Point and E-Log-Lik methods yield overconfident I-posteriors in one dimension. In contrast, E-Lik and E-Post methods maintain substantial probability mass around the true parameter values. The scatter plots of the joint I-posterior distributions show that while Point and E-Log-Lik fail to encompass the true value $\omega^*_I$, E-Post and E-Lik effectively cover it. The heightened uncertainty observed in E-Post and E-Lik mirrors that of the T-posterior, attributed to the limited training data available for the surrogate model.

Discussion

In this section, the authors propose a comprehensive framework for uncertainty propagation (UP) in surrogate-based Bayesian inference, applicable to various surrogate models such as linear regression, polynomial chaos expansion, Gaussian processes, and neural networks. The framework consists of a two-step procedure: the Surrogate-Training Step (T-Step), where training data is generated from a complex simulator, and the Inference Step (I-Step), where the trained surrogate model is utilized to infer quantities of interest. The authors differentiate between aleatoric (irreducible) and epistemic (reducible) uncertainties and summarize the relevant uncertainties in a table.

The T-Step involves training a surrogate model using data from a simulator, which may be complex and potentially stochastic. The surrogate model aims to approximate the simulator while being computationally efficient. The authors introduce surrogate approximation error to account for the limitations of the surrogate model, leading to a joint posterior distribution over surrogate parameters. The I-Step focuses on inferring unknown parameters using real-world measurement data, where the surrogate model replaces the complex simulator. The authors present four uncertainty propagation methods—Point Estimate, Expected-Posterior (E-Post), Expected-Likelihood (E-Lik), and Expected-Log-Likelihood (E-Log-Lik)—to derive the surrogate-based posterior. Each method incorporates uncertainty from the T-Step differently, with E-Post and E-Lik leveraging the full posterior or samples from the posterior, while E-Log-Lik integrates over the log-likelihood, offering a novel approach to uncertainty propagation. The section concludes with a discussion on computational efficiency, parallelization, and the importance of uncertainty calibration in Bayesian inference.