تحسين توليدي تطوري: نحو تحسين تطوري مدفوع بالكامل بالبيانات عبر التعلم التوليدي Evolutionary Generative Optimization: Towards Fully Data-Driven Evolutionary Optimization via Generative Learning

المجلة: IEEE Transactions on Evolutionary Computation
DOI: https://doi.org/10.1109/tevc.2026.3664432
تاريخ النشر: 2026-01-01
المؤلف: Zhenyun Du وآخرون
الموضوع الرئيسي: خوارزميات تحسين متعددة الأهداف المتقدمة

نظرة عامة

يقدم القسم تحسين الجيل التطوري (EvoGO)، وهو إطار عمل جديد يعتمد على البيانات يهدف إلى تعزيز الخوارزميات التطورية (EAs) من خلال القضاء على الاعتماد على المشغلين المصنوعين يدويًا. يعمل EvoGO من خلال ثلاث مراحل رئيسية: إعداد البيانات، تدريب النموذج، وتوليد السكان. في مرحلة إعداد البيانات، يتم بناء مجموعة بيانات ثنائية لتعزيز تنوع التدريب دون تكبد تكاليف تقييم إضافية. تستخدم مرحلة تدريب النموذج نموذجًا توليديًا مصممًا خصيصًا يحول الحلول الضعيفة إلى حلول متفوقة، بينما تستخدم مرحلة توليد السكان آلية توليد قابلة للتوسع لاستبدال المشغلين التقليديين للتكاثر. تشير التجارب الواسعة عبر معايير مختلفة إلى أن EvoGO يحقق التقارب في غضون 10 أجيال فقط، متفوقًا بشكل كبير على EAs التقليدية، وتحسين بايزي، وطرق التعلم المعزز.

في الختام، يعالج EvoGO القيود الحرجة للنماذج القائمة على الجيل الحالية، مثل الاعتماد على الاستدلالات المصنوعة يدويًا وعدم التوافق بين النماذج التوليدية والأهداف التطورية. من خلال دمج بنية توليد مزدوجة واستخدام دالة خسارة مركبة تشمل إعادة البناء والتوجيه الاتجاهي، يسهل EvoGO التعلم القوي من البداية إلى النهاية من أجل التحسين. يتم تعزيز أداء الإطار بشكل أكبر من خلال استدلال متوازي باستخدام وحدات معالجة الرسوميات، مما يحقق تسريعًا يصل إلى 134× في المهام المعقدة. ستركز الأبحاث المستقبلية على تحسين المتانة في سيناريوهات البيانات المنخفضة واستكشاف الجوانب النظرية للتقارب وديناميات التحسين، بالإضافة إلى الامتدادات المحتملة للتدريب المسبق غير المتصل ونقل المهام لتعزيز كفاءة البيانات.

مقدمة

في مقدمة هذه الورقة البحثية، يتناول المؤلفون التحديات المرتبطة بمهام التحسين في العالم الحقيقي التي تعتمد على المحاكاة، والبرامج القديمة، أو التجارب الفيزيائية، والتي غالبًا ما تكون غير تحليلية، وصاخبة، وخالية من التدرجات. يبرزون فعالية الخوارزميات التطورية (EAs) في معالجة مثل هذه المشكلات بسبب قدراتها على البحث العالمي وعدم حساسيتها للتفاضلية. ومع ذلك، تكافح EAs التقليدية لالتقاط المعلومات الهيكلية وإعادة استخدام البيانات عبر الأجيال. لتحسين الكفاءة، انتقلت الأبحاث اللاحقة بـ EAs إلى نموذج قائم على التوزيع، كما يتضح من استراتيجيات التطور (ES) وخوارزميات تقدير التوزيع (EDAs)، التي تستخدم النمذجة الإحصائية لتحديث السكان.

يقترح المؤلفون إطار عمل جديد يسمى تحسين الجيل التطوري (EvoGO)، والذي يهدف إلى تعزيز التحسين التطوري من خلال التعلم التوليدي. يقدم EvoGO بنية توليد مزدوجة تستبدل الاستدلالات التقليدية بنموذج تعاوني أمامي وعكسي لتوليد حلول مرشحة عالية الجودة. بالإضافة إلى ذلك، يستخدم دالة خسارة مصممة خصيصًا توجه العملية التوليدية نحو المناطق الواعدة من فضاء البحث واستراتيجية تركيب عينات قائمة على التعلم لاستقرار تدريب النموذج تحت ميزانيات تقييم محدودة. توضح الورقة هيكل الأقسام التالية، التي ستراجع الأساليب الحالية للتحسين، وتفصل النموذج والاستراتيجيات المقترحة، وتقدم تقييمات تجريبية.

طرق

تقيم إعدادات التجربة في هذه الدراسة أداء EvoGO عبر ثلاث فئات من تحديات التحسين: الوظائف العددية ذات الأبعاد المنخفضة والعالية، ومهام التحكم الكلاسيكية، ومهام التحكم الروبوتية. تشمل مهام التحكم الكلاسيكية الدفع والمركبة، بينما تم تكييف الهبوط، والمشاة، والنملة من مكتبة Gym. تم تعيين الحد الأقصى لعدد تقييمات اللياقة (FEs) إلى 1,000 للمهام ذات الأبعاد المنخفضة و10,000 للمهام ذات الأبعاد العالية. يتم مقارنة EvoGO مع طرق أساسية مختلفة، بما في ذلك خوارزميات التحسين البايزية (مثل Hyperband، TuRBO)، واستراتيجيات التطور (CMA-ES)، وطرق الاستدلال الكلاسيكية (Nelder Mead)، عبر 20 تشغيلًا مستقلًا لضمان المتانة.

تستفيد تنفيذ EvoGO من JAX للحساب الفعال على وحدة معالجة رسومات واحدة، مع عبء موازنة أقل من 5% من إجمالي وقت التشغيل. يتم إنشاء مجموعة البيانات باستخدام أخذ عينات من الهيبركيوب اللاتيني وتتطور من خلال عملية خوارزمية محددة تتضمن خطوة فرز بناءً على قيم الوظائف. يستخدم نموذج العملية الغاوسية (GP) دالة التغاير Matérn، مع إجراء التحسين عبر دالة خسارة الاحتمالية الهامشية ومُحسِّن آدم على مدى 1,000 حقبة. تستكشف الدراسة أيضًا تأثير زيادة البيانات باستخدام مشفرات تلقائية متغيرة (VAEs)، مما يظهر أن الزيادة المعتدلة تعزز الأداء بشكل كبير، بينما يمكن أن تؤدي الزيادة المفرطة إلى تدهور النتائج. بالإضافة إلى ذلك، تم تأكيد متانة EvoGO ضد هياكل الشبكات العصبية المختلفة، حيث أسفرت التكوينات القياسية عن النتائج الأكثر فعالية.

مناقشة

تسلط قسم المناقشة في الورقة البحثية الضوء على الانتقال من الأساليب غير المعتمدة على البيانات إلى الأساليب المعتمدة على البيانات في الحوسبة التطورية (EC)، مع التأكيد على المزايا والقيود المختلفة للمنهجيات. تعتمد الخوارزميات التطورية غير المعتمدة على البيانات (EAs)، مثل الخوارزميات الجينية وتحسين سرب الجسيمات، على استدلالات ثابتة وضبط يدوي، مما قد يعيق قابليتها للتكيف وقابليتها للتوسع. في المقابل، تستفيد النماذج المعتمدة على البيانات، بما في ذلك تحسين بايزي (BO)، والخوارزميات التطورية المدعومة بالبدائل (SAEAs)، والخوارزميات التطورية القائمة على النمذجة التوليدية (GMEAs)، من المعلومات المتراكمة لتعزيز كفاءة التحسين. بينما يوفر BO أسسًا نظرية قوية، فإنه يكافح مع قابلية التوسع بسبب العبء الحسابي. تحسن SAEAs كفاءة العينة ولكن غالبًا ما تواجه تحيزات بسبب النمذجة المحلية. تظهر GMEAs، التي تركز على تعلم توزيعات الحلول الواعدة، إمكانات ولكنها لا تزال تعتمد على مكونات استدلالية وتفتقر إلى التكامل الفعال للنماذج التوليدية في عملية التحسين.

يجادل المؤلفون بضرورة وجود إطار عمل موحد للتعلم من البداية إلى النهاية يدمج تعلم التمثيل، وتوليد الحلول، وتقييم اللياقة. يجب أن يقضي هذا الإطار على الاعتماد على الآليات المصنوعة يدويًا، ويوافق بين أهداف التعلم وأهداف التحسين العالمية، ويعظم كفاءة البيانات وقابلية التكيف. من خلال معالجة هذه التحديات، تهدف الأبحاث إلى تقدم مجال الحوسبة التطورية، مما يسهل حلول تحسين أكثر قوة وقابلية للتوسع.

القيود

يسلط القسم الخاص بقيود خوارزمية EvoGO الضوء على عدة قيود حرجة تؤثر على فعاليتها في سيناريوهات التحسين عبر الإنترنت. تتمثل إحدى القيود الرئيسية في اعتماد الخوارزمية على توفر تدفقات بيانات عبر الإنترنت كافية؛ في البيئات التي تفتقر إلى البيانات، يمكن أن يؤدي هذا الاعتماد إلى أداء دون المستوى بسبب نقص المعلومات للتنبؤات الدقيقة والتعديلات الديناميكية. علاوة على ذلك، لا تستفيد التنفيذ الحالي من البيانات غير المتصلة، والتي يمكن أن تعزز دقة التدريب والتنبؤ، خاصة في التقاط الأنماط المعقدة والطويلة الأمد التي تتطلب بيانات تاريخية واسعة.

بالإضافة إلى ذلك، قد يواجه EvoGO صعوبة في المناظر الطبيعية للتحسين غير المنتظمة للغاية التي تتميز بأنماط غير متوقعة، حيث قد لا تكون طريقة إعداد البيانات الخاصة به كافية للتنقل الفعال. على الرغم من هذه القيود، تشير الأدلة التجريبية إلى أن EvoGO يؤدي بشكل كافٍ في معايير التحكم الكلاسيكية، مما يشير إلى أنه مناسب تمامًا للمناظر الطبيعية للتحسين الواقعي التي تظهر انتظامًا كافيًا. يجب أن تعالج الأعمال المستقبلية هذه القيود لتوسيع تطبيق الخوارزمية وتحسين متانتها عبر تحديات التحسين المتنوعة.

Journal: IEEE Transactions on Evolutionary Computation
DOI: https://doi.org/10.1109/tevc.2026.3664432
Publication Date: 2026-01-01
Author(s): Zhenyun Du et al.
Primary Topic: Advanced Multi-Objective Optimization Algorithms

Overview

The section presents Evolutionary Generative Optimization (EvoGO), a novel data-driven framework aimed at enhancing evolutionary algorithms (EAs) by eliminating reliance on handcrafted operators. EvoGO operates through three key stages: data preparation, model training, and population generation. In the data preparation stage, a pairwise dataset is constructed to enhance training diversity without incurring additional evaluation costs. The model training phase employs a tailored generative model that transforms inferior solutions into superior ones, while the population generation stage utilizes a scalable generative mechanism to replace traditional reproduction operators. Extensive experiments across various benchmarks indicate that EvoGO achieves convergence within just 10 generations, significantly outperforming traditional EAs, Bayesian optimization, and reinforcement learning methods.

In conclusion, EvoGO addresses critical limitations of existing generative model-based EAs, such as the dependence on handcrafted heuristics and misalignment between generative models and evolutionary objectives. By integrating a paired generative architecture and employing a composite loss function that includes reconstruction and directional guidance, EvoGO facilitates robust end-to-end learning for optimization. The framework’s performance is further enhanced by GPU-parallelized inference, achieving up to a 134× speedup on complex tasks. Future research will focus on improving robustness in low data scenarios and exploring theoretical aspects of convergence and optimization dynamics, as well as potential extensions for offline pre-training and cross-task transfer to boost data efficiency.

Introduction

In the introduction of this research paper, the authors address the challenges associated with real-world optimization tasks that rely on simulations, legacy software, or physical experiments, which are often nonanalytical, noisy, and gradient-free. They highlight the effectiveness of Evolutionary Algorithms (EAs) in tackling such problems due to their global search capabilities and insensitivity to differentiability. However, traditional EAs struggle to capture structural information and reuse data across generations. To improve efficiency, subsequent research has transitioned EAs to a distribution-based paradigm, exemplified by Evolution Strategies (ES) and Estimation of Distribution Algorithms (EDAs), which utilize statistical modeling for population updates.

The authors propose a novel framework called Evolutionary Generative Optimization (EvoGO), which aims to enhance evolutionary optimization through generative learning. EvoGO introduces a paired generative architecture that replaces traditional heuristics with a collaborative forward and inverse model to generate high-quality candidate solutions. Additionally, it employs a tailored loss function that guides the generative process toward promising areas of the search space and a learning-based sample synthesis strategy to stabilize model training under limited evaluation budgets. The paper outlines the structure of the subsequent sections, which will review existing optimization approaches, detail the proposed model and strategies, and present empirical evaluations.

Methods

The experimental setup of this study evaluates the performance of EvoGO across three categories of optimization challenges: low- and high-dimensional numerical functions, classical control tasks, and robotic control tasks. The classical control tasks include Pushing and Rover, while Landing, Walker, and Ant are adapted from the Gym library. The maximum number of fitness evaluations (FEs) is set to 1,000 for low-dimensional tasks and 10,000 for high-dimensional tasks. EvoGO is compared against various baseline methods, including Bayesian optimization algorithms (e.g., Hyperband, TuRBO), evolutionary strategies (CMA-ES), and classical heuristic methods (Nelder Mead), across 20 independent runs to ensure robustness.

EvoGO’s implementation leverages JAX for efficient computation on a single GPU, with a parallelization overhead of less than 5% of total runtime. The dataset is generated using Latin Hypercube Sampling and evolves through a defined algorithmic process that incorporates a sorting step based on function values. The Gaussian Process (GP) model utilizes the Matérn covariance function, with optimization conducted via Marginal Log Likelihood loss and the Adam optimizer over 1,000 epochs. The study also explores the impact of data augmentation using Variational Autoencoders (VAEs), demonstrating that moderate augmentation significantly enhances performance, while excessive augmentation can degrade results. Additionally, the robustness of EvoGO to different neural network architectures is confirmed, with the standard configuration yielding the most effective outcomes.

Discussion

The discussion section of the research paper highlights the transition from non-data-driven to data-driven approaches in evolutionary computation (EC), emphasizing the advantages and limitations of various methodologies. Non-data-driven evolutionary algorithms (EAs), such as genetic algorithms and particle swarm optimization, rely on fixed heuristics and manual tuning, which can hinder their adaptability and scalability. In contrast, data-driven paradigms, including Bayesian Optimization (BO), Surrogate-Assisted Evolutionary Algorithms (SAEAs), and Generative Modeling-based Evolutionary Algorithms (GMEAs), leverage accumulated information to enhance optimization efficiency. While BO offers strong theoretical foundations, it struggles with scalability due to computational overhead. SAEAs improve sample efficiency but often face biases due to local modeling. GMEAs, which focus on learning distributions of promising solutions, show potential but still depend on heuristic components and lack effective integration of generative models into the optimization process.

The authors argue for the necessity of a unified, end-to-end learning framework that integrates representation learning, solution generation, and fitness evaluation. This framework should eliminate reliance on handcrafted mechanisms, align learning objectives with global optimization goals, and maximize data efficiency and adaptability. By addressing these challenges, the research aims to advance the field of evolutionary computation, facilitating more robust and scalable optimization solutions.

Limitations

The section on limitations of the EvoGO algorithm highlights several critical constraints that impact its effectiveness in online optimization scenarios. A primary limitation is the algorithm’s dependence on the availability of sufficient online data streams; in data-scarce environments, this reliance can lead to suboptimal performance due to inadequate information for accurate predictions and dynamic adjustments. Furthermore, the current implementation does not leverage offline data, which could enhance training and predictive accuracy, particularly in capturing complex, long-term patterns that require extensive historical data.

Additionally, EvoGO may struggle in highly irregular optimization landscapes characterized by unpredictable patterns, as its data preparation approach may not suffice for effective navigation. Despite these limitations, empirical evidence indicates that EvoGO performs adequately in classic control benchmarks, suggesting that it is well-suited for typical real-world optimization landscapes that exhibit sufficient regularity. Future work should address these limitations to broaden the algorithm’s applicability and improve its robustness across diverse optimization challenges.