التعلم المنقول لتشتت النيوترينو: التكيف مع المجال باستخدام الشبكات التنافسية التوليدية Transfer learning for neutrino scattering: Domain adaptation with generative adversarial networks

المجلة: Physical review. D/Physical review. D.، المجلد: 113، العدد: 5
DOI: https://doi.org/10.1103/kwjj-wp1c
تاريخ النشر: 2026-02-02
المؤلف: J. L. Bonilla وآخرون
الموضوع الرئيسي: الفيزياء الفلكية والظواهر الكونية

نظرة عامة

في هذا القسم، يستكشف المؤلفون تطبيق التعلم بالنقل (TL) للاستفادة من شبكة الخصومة التوليدية (GAN) المدربة على بيانات تشتت النيوترينو-الكربون الاصطناعية للتنبؤ بالتفاعلات المتعلقة بالنيوترينو، وبشكل خاص عمليات النيوترينو-الأرجون والنيوترينو المضاد-الكربون. تدرس الدراسة مدى اتساق ديناميات اللبتون-النواة عبر أهداف وتفاعلات مختلفة، بالإضافة إلى فعالية TL عندما تأتي بيانات التدريب من نموذج تفاعل نيوترينو-نواة مختلف.

تشير النتائج إلى أن TL يلتقط بنجاح الميزات الأساسية لديناميات اللبتون، مثل قمم الكوانسيلاستيك و$\Delta$-الرنين، ويظهر أداءً متفوقًا مقارنة بالنماذج التوليدية المدربة من الصفر. ومن الجدير بالذكر أنه حتى مع مجموعات بيانات محدودة من 10,000 و100,000 حدث، يحافظ TL على دقة عالية، مما يبرز إمكاناته كإطار عمل قوي وفعال لنمذجة تفاعلات (مضاد) النيوترينو-نواة. هذه الطريقة مفيدة بشكل خاص لتطوير مولدات أحداث تشتت النيوترينو من الجيل التالي في السيناريوهات التي تكون فيها البيانات التجريبية محدودة.

مقدمة

تناقش مقدمة هذه الورقة البحثية التقدمات الكبيرة في كل من الدراسات التجريبية والنظرية لتفاعلات (مضاد) النيوترينو-نواة، خاصة في سياق تجارب تذبذب النيوترينو القادمة مثل هايبر-كاميكاندي وDUNE. أدت هذه الجهود إلى تطوير مولدات أحداث مونت كارلو (MC) التي تحاكي مثل هذه التفاعلات، على الرغم من أن هذه النماذج غالبًا ما تعتمد على تقريبات معقدة تحد من دقتها. لمعالجة هذه القيود، هناك اهتمام متزايد في الاستفادة من تقنيات الذكاء الاصطناعي (AI)، وبشكل خاص الشبكات التوليدية الخصومية (GANs)، لتعزيز محاكاة تفاعلات النيوترينو مع النوى.

يقدم المؤلفون عملهم في استخدام GANs لمحاكاة عمليات تشتت النيوترينو-الميوون ذات التيار المشحون (CC). يستكشفون قدرات التعميم للنماذج التوليدية العميقة من خلال التعلم بالنقل (TL)، موضحين أن نموذجًا مدربًا على بيانات تشتت الإلكترون-الكربون الشاملة يمكن تكييفه للتنبؤ بمقاطع العرض لمختلف النوى. تستخدم الدراسة نموذج GAN مدرب مسبقًا يعتمد على بيانات اصطناعية من مولد MC NuWro، بهدف التقاط الميزات العالمية لديناميات اللبتون-النواة. توضح الورقة نهج التكيف مع المجال لضبط النموذج لعمليات التشتت المختلفة، بما في ذلك تفاعلات النيوترينو-الميوون مع نوى الأرجون والكربون، وتبرز قدرة النموذج على الحفاظ على الهياكل الحركية الأساسية مع تحقيق دقة عالية مع بيانات تدريب محدودة. تؤكد هذه الدراسة على إمكانات TL كطريقة فعالة حسابيًا وقابلة للتفسير لتطوير نماذج تشتت (مضاد) النيوترينو-نواة.

النتائج

في هذا القسم، يقدم المؤلفون نتائج عددية من دراستهم حول التعلم بالنقل (TL) المطبق على الشبكات التوليدية الخصومية (GANs) لأحداث تشتت النيوترينو-الكربون الشاملة. قاموا بإنشاء نموذج أساسي مدرب على أربعة ملايين حدث عبر نطاق طاقة النيوترينو من 300 ميغا إلكترون فولت إلى 10 جيغا إلكترون فولت. وُجدت مقاييس الأداء لهذا النموذج الأساسي GAN منخفضة، مشابهة للنتائج السابقة. ثم استكشف المؤلفون سيناريوهين تدريبيين باستخدام مجموعات بيانات من 10,000 و100,000 حدث، كاشفين أن النماذج المدربة على مجموعة البيانات الأصغر واجهت صعوبات كبيرة، مما يعكس التحديات التي تطرحها البيانات التجريبية المحدودة. ومن الجدير بالذكر أن نهج TL أظهر ميزة ملحوظة، محققًا مقاييس أداء أفضل وتوافق أسرع مقارنة بالنماذج المدربة من الصفر.

تشير النتائج إلى أنه بينما يؤدي زيادة حجم مجموعة بيانات التدريب إلى تحسين أداء النموذج، فإن TL يتفوق باستمرار على النماذج المدربة من الصفر، خاصة في التقاط الهيكل الصحيح لتوزيعات الأحداث. على سبيل المثال، أعاد نموذج GAN المدرب باستخدام TL إنتاج القمم في توزيعات الكتلة الثابتة الهدرونية المعاد بناؤها بدقة، بينما فشل النموذج المدرب من الصفر في القيام بذلك عبر طاقات نيوترينو مختلفة. تؤكد النتائج على فعالية TL في تعزيز تعلم الأنماط العالمية في تفاعلات النيوترينو، مما يشير إلى إمكاناته في ضبط النماذج التوليدية باستخدام قياسات تجريبية حقيقية. يخلص المؤلفون إلى أنه حتى مع مجموعات بيانات أكبر، يظل TL متفوقًا، مما يبرز أهميته في سياق السيناريوهات ذات البيانات المحدودة النموذجية في الفيزياء التجريبية.

المناقشة

في هذا القسم، يناقش المؤلفون تطبيق التعلم بالنقل (TL) في محاكاة تفاعلات (مضاد) النيوترينو-نواة، مستفيدين من شبكة الخصومة التوليدية (GAN) المدربة في البداية على بيانات اصطناعية من مولد مونت كارلو NuWro. يسمح نهج TL بتكييف نموذج مدرب مسبقًا للتنبؤ بعمليات تشتت مختلفة مع الحد الأدنى من البيانات الإضافية، وهو ما يعد مفيدًا بشكل خاص نظرًا لندرة البيانات التجريبية في هذا المجال. يوضح المؤلفون أن GAN يمكن أن تولد نماذج فعالة لتشتت النيوترينو-الأرجون والنيوترينو المضاد-الكربون بناءً على نموذج مدرب على تفاعلات النيوترينو-الكربون، مما يبرز عالمية الميزات التي تعلمها GAN.

كما تؤكد المناقشة على بنية GAN، التي تشمل مولدًا ومميزًا، وكلاهما يتم ضبطه لاستيعاب بيانات جديدة مع الاحتفاظ بالأنماط الأساسية التي تم تعلمها من التدريب الأصلي. يذكر المؤلفون أن النماذج التي تم تهيئتها من GAN المدرب مسبقًا تتفوق باستمرار على تلك المدربة من الصفر، خاصة في السيناريوهات ذات البيانات المحدودة. تؤكد هذه الميزة في الأداء على إمكانات TL في تعزيز دقة المحاكاة لتفاعلات (مضاد) النيوترينو، مما يمهد الطريق لأبحاث مستقبلية تهدف إلى توسيع هذه الأساليب لتفاعلات الهدرونات النهائية الأكثر تعقيدًا.

Journal: Physical review. D/Physical review. D., Volume: 113, Issue: 5
DOI: https://doi.org/10.1103/kwjj-wp1c
Publication Date: 2026-02-02
Author(s): J. L. Bonilla et al.
Primary Topic: Astrophysics and Cosmic Phenomena

Overview

In this section, the authors explore the application of transfer learning (TL) to leverage a Generative Adversarial Network (GAN) trained on synthetic neutrino-carbon scattering data for predicting related neutrino interactions, specifically neutrino-argon and antineutrino-carbon processes. The study examines the extent to which lepton-nucleus dynamics are consistent across different targets and interactions, as well as the effectiveness of TL when the training data originates from a different neutrino-nucleus interaction model.

The results indicate that TL successfully captures essential features of lepton kinematics, such as the quasielastic and $\Delta$-resonance peaks, and demonstrates superior performance compared to generative models trained from scratch. Notably, even with limited datasets of 10,000 and 100,000 events, TL maintains high accuracy, underscoring its potential as a robust and efficient framework for modeling (anti)neutrino-nucleus interactions. This approach is particularly advantageous for developing next-generation neutrino-scattering event generators in scenarios where experimental data are limited.

Introduction

The introduction of this research paper discusses the significant advancements in both experimental and theoretical studies of (anti)neutrino-nucleus interactions, particularly in the context of upcoming neutrino oscillation experiments like Hyper-Kamiokande and DUNE. These efforts have led to the development of Monte Carlo (MC) event generators that simulate such interactions, although these models often rely on complex approximations that limit their accuracy. To address these limitations, there is increasing interest in leveraging artificial intelligence (AI) techniques, specifically generative adversarial networks (GANs), to enhance the simulation of neutrino interactions with nuclei.

The authors present their work on using GANs to simulate charged-current (CC) muon-neutrino scattering processes. They explore the generalization capabilities of deep generative models through transfer learning (TL), demonstrating that a model trained on inclusive electron-carbon scattering data can be adapted to predict cross sections for various nuclei. The study employs a pre-trained GAN model based on synthetic data from the NuWro MC generator, aiming to capture universal features of lepton-nucleus dynamics. The paper outlines a domain adaptation approach to fine-tune the model for different scattering processes, including muon-neutrino interactions with argon and carbon nuclei, and highlights the model’s ability to maintain essential kinematic structures while achieving high accuracy with limited training data. This research underscores the potential of TL as a computationally efficient and interpretable method for developing (anti)neutrino-nucleus scattering models.

Results

In this section, the authors present numerical results from their study on transfer learning (TL) applied to generative adversarial networks (GANs) for inclusive neutrino-carbon scattering events. They established a baseline model trained on four million events across a neutrino energy range of 300 MeV to 10 GeV. Performance metrics for this baseline GAN were found to be low, similar to previous findings. The authors then explored two training scenarios using datasets of 10,000 and 100,000 events, revealing that models trained on the smaller dataset struggled significantly, reflecting the challenges posed by limited experimental data. Notably, the TL approach demonstrated a marked advantage, achieving better performance metrics and faster convergence compared to models trained from scratch.

The results indicate that while increasing the training dataset size improves model performance, TL consistently outperforms the scratch-trained models, particularly in capturing the correct structure of event distributions. For instance, the GAN model trained with TL accurately reproduced the peaks in the reconstructed hadronic invariant mass distributions, while the scratch-trained model failed to do so across various neutrino energies. The findings underscore the effectiveness of TL in enhancing the learning of universal patterns in neutrino interactions, suggesting its potential for tuning generative models with real experimental measurements. The authors conclude that even with larger datasets, TL remains superior, highlighting its importance in the context of limited data scenarios typical in experimental physics.

Discussion

In this section, the authors discuss the application of transfer learning (TL) in simulating (anti)neutrino-nucleus interactions, leveraging a Generative Adversarial Network (GAN) initially trained on synthetic data from the NuWro Monte Carlo generator. The TL approach allows for the adaptation of a pre-trained model to predict various scattering processes with minimal additional data, which is particularly beneficial given the scarcity of experimental data in this field. The authors demonstrate that the GAN can effectively generate models for neutrino-argon and antineutrino-carbon scattering based on a model trained on neutrino-carbon interactions, highlighting the universality of the features learned by the GAN.

The discussion also emphasizes the architecture of the GAN, which includes a generator and discriminator, both of which are fine-tuned to accommodate new data while retaining the foundational patterns learned from the original training. The authors report that models initialized from the pre-trained GAN consistently outperform those trained from scratch, especially in scenarios with limited data. This performance advantage underscores the potential of TL in enhancing the accuracy of simulations for (anti)neutrino interactions, paving the way for future research that aims to extend these methods to more complex final-state hadron interactions.