PA-MPPI: التحكم في مسار التكامل التنبؤي المدرك للإدراك لملاحة الطائرات الرباعية في بيئات غير معروفة PA-MPPI: Perception-Aware Model Predictive Path Integral Control for Quadrotor Navigation in Unknown Environments

المجلة: IEEE Robotics and Automation Letters، المجلد: 11، العدد: 3
DOI: https://doi.org/10.1109/lra.2026.3662653
تاريخ النشر: 2026-02-09
المؤلف: Zhenyun Du وآخرون
الموضوع الرئيسي: خوارزميات تخطيط المسار الروبوتي

نظرة عامة

تتناول الأبحاث التحديات المتعلقة بملاحة الطائرات الرباعية في بيئات غير معروفة، لا سيما للتطبيقات مثل مهام البحث والإنقاذ. تشمل التحديات الرئيسية تخطيط المسار في المساحات غير المحدبة، والالتزام بديناميات الطائرات الرباعية، واستكشاف المناطق غير المرسومة. لقد أظهر أسلوب نموذج التنبؤ بمسار التكامل (MPPI) وعدًا في معالجة التحديين الأولين من خلال تحسين قائم على العينة، مما يدير بشكل فعال المساحات غير المحدبة ويقوم بتحسين التكاليف الخاصة بالطائرات الرباعية مثل استهلاك الطاقة. ومع ذلك، تكمن قيود MPPI في التحكم في التتبع، مما يقيد تحسين المسار إلى منطقة ضيقة حول مسار مرجعي، مما يعيق استكشاف المناطق غير المعروفة.

للتغلب على هذه القيود، يقدم المؤلفون MPPI المدرك (PA-MPPI)، الذي يدمج تكلفة مدفوعة بالإدراك تتكيف مع المسارات بناءً على أهداف الإدراك في الوقت الحقيقي. يوجه هذا النهج مسار الطائرة الرباعية نحو المناطق التي يمكن أن تكشف عن مناطق غير معروفة، مما يعزز المساحة القابلة للتنقل المرسومة ويزيد من فرص العثور على طرق بديلة نحو الهدف. تشير النتائج التجريبية إلى أن PA-MPPI، الذي يعمل بسرعة 50 هرتز، يتطابق مع أداء مخططي الملاحة المتقدمين في البيئات المعقدة. علاوة على ذلك، يظهر PA-MPPI فائدته كسياسة عمل قوية لنماذج أساس الملاحة، التي غالبًا ما تولد أوضاع أهداف غير قابلة للوصول. ستهدف الأبحاث المستقبلية إلى توسيع قدرات PA-MPPI لآفاق ملاحة أطول واستكشاف قيود تخطيط المسار الخاصة به.

طرق

توضح قسم المنهجية مجموعة التحكم المتكاملة المستخدمة في الدراسة، كما هو موضح في الشكل 2. تعمل الخوارزمية المقترحة، المسماة PA-MPPI، عن طريق تلقي شبكة إشغال من وحدة الإدراك، والتي تستخدمها لتحسين تسلسلات التحكم لتنفيذ المسار بواسطة الطائرة الرباعية. تستند الخوارزمية إلى نموذج ديناميكي خاص بالطائرات الرباعية، كما هو مفصل في القسم IV-A.

يتم تقديم مزيد من الشرح حول صياغة MPPI في القسم IV-B، بينما تركز الأقسام IV-C و IV-D على مكونات الإدراك والتخطيط، بالإضافة إلى تعريف دالة التكلفة المستخدمة في عملية التحسين. يضمن هذا النهج المنظم أن الطائرة الرباعية يمكن أن تتنقل بفعالية في بيئتها من خلال دمج الإدراك مع استراتيجيات التحكم.

نقاش

يسلط قسم النقاش في الورقة الضوء على دمج أهداف الإدراك في استراتيجيات الملاحة، من خلال تطوير إطار عمل للتحكم التنبؤي المدرك (MPC). يعزز هذا النهج قدرة الروبوت على التنقل في البيئات المعقدة من خلال محاذاة وجهة نظره مع الميزات ذات الصلة، مما يزيد من الرؤية ويضمن السلامة أثناء المناورات. يؤكد المؤلفون على أهمية اعتبار المساحات غير الملاحظة كأماكن محتملة الخطورة، مما يؤثر على عملية التخطيط. يتم مقارنة المخطط المتقدم، SUPER، بالخوارزمية المقترحة للتحكم التنبؤي المدرك (PA-MPPI)، التي تدمج تكاليف الإدراك لتحسين الملاحة دون الاعتماد على المراجع الخارجية.

تناقش القسم أيضًا الخوارزميات التي تدعم التحكم التنبؤي، مميزة بين الطرق المعتمدة على المشتقات والطرق المعتمدة على العينة مثل MPPI. تعتبر الأخيرة مفيدة بشكل خاص للتعامل مع الأهداف غير المحدبة وغير السلسة، مما يجعلها مناسبة للبيئات الديناميكية. تم تصميم خوارزمية PA-MPPI للعمل بفعالية في الإعدادات المزدحمة من خلال الاستفادة من صور العمق للتخطيط في الوقت الحقيقي وتحسين المسار. تختتم المناقشة بتأمل في قيود التنفيذ الحالي، مثل الحاجة إلى ملاحظات أولية جيدة والقيود المفروضة من الموارد الحاسوبية، مما يقترح طرقًا للبحث المستقبلي، بما في ذلك الاستخدام المحتمل لـ LIDAR لتحسين قدرات الملاحة.

Journal: IEEE Robotics and Automation Letters, Volume: 11, Issue: 3
DOI: https://doi.org/10.1109/lra.2026.3662653
Publication Date: 2026-02-09
Author(s): Zhenyun Du et al.
Primary Topic: Robotic Path Planning Algorithms

Overview

The research addresses the challenges of quadrotor navigation in unknown environments, particularly for applications like search-and-rescue missions. Key challenges include path-planning in non-convex spaces, adhering to quadrotor dynamics, and exploring uncharted areas. The Model Predictive Path Integral (MPPI) method has shown promise in tackling the first two challenges through sampling-based optimization, effectively managing non-convex spaces and optimizing quadrotor-specific costs such as energy consumption. However, MPPI’s limitation lies in its tracking control, which restricts trajectory optimization to a narrow vicinity around a reference path, hindering exploration of unknown regions.

To overcome these limitations, the authors introduce Perception-Aware MPPI (PA-MPPI), which incorporates a perception-driven cost that adapts trajectories based on real-time perception objectives. This approach biases the quadrotor’s path towards areas that can reveal unknown regions, thereby enhancing the mapped traversable space and increasing the chances of finding alternative routes to the goal. Experimental results indicate that PA-MPPI, operating at 50 Hz, matches the performance of state-of-the-art navigation planners in complex environments. Furthermore, PA-MPPI demonstrates its utility as a robust action policy for navigation foundation models, which often generate unreachable goal poses. Future research will aim to extend PA-MPPI’s capabilities for longer navigation horizons and explore its path-planning limitations.

Methods

The methodology section outlines the integrated control stack utilized in the study, as depicted in Figure 2. The proposed algorithm, termed PA-MPPI, operates by receiving an occupancy grid from the perception module, which it uses to optimize control sequences for trajectory execution by a quadrotor. The algorithm is grounded in a dynamics model specific to quadrotors, detailed in Section IV-A.

Further elaboration on the MPPI formulation is provided in Section IV-B, while Sections IV-C and IV-D focus on the perception and mapping components, as well as the definition of the cost function employed in the optimization process. This structured approach ensures that the quadrotor can effectively navigate its environment by integrating perception with control strategies.

Discussion

The discussion section of the paper highlights the integration of perception objectives into navigation strategies, specifically through the development of a perception-aware Model Predictive Control (MPC) framework. This approach enhances the robot’s ability to navigate complex environments by aligning its viewpoint with relevant features, thereby maximizing visibility and ensuring safety during maneuvers. The authors emphasize the importance of treating unobserved spaces as potentially hazardous, which informs the planning process. The state-of-the-art planner, SUPER, is compared to the proposed perception-aware Model Predictive Path Integral (PA-MPPI) algorithm, which incorporates perception costs to optimize navigation without relying on external references.

The section also discusses the algorithms underpinning Model Predictive Control, contrasting derivative-based methods with sampling-based approaches like MPPI. The latter is particularly advantageous for handling nonconvex and nonsmooth objectives, making it suitable for dynamic environments. The PA-MPPI algorithm is designed to operate effectively in cluttered settings by leveraging depth images for real-time mapping and trajectory optimization. The discussion concludes with a reflection on the limitations of the current implementation, such as the need for good initial observations and the constraints imposed by computational resources, suggesting avenues for future research, including the potential use of LIDAR for improved navigation capabilities.