إزالة الضباب من صور الاستشعار عن بُعد باستخدام شبكات الخصومة التوليدية المعتمدة على المويجات Remote sensing image dehazing using a wavelet-based generative adversarial networks

المجلة: Scientific Reports، المجلد: 15، العدد: 1
DOI: https://doi.org/10.1038/s41598-025-87240-z
PMID: https://pubmed.ncbi.nlm.nih.gov/39880836
تاريخ النشر: 2025-01-29
المؤلف: Guangda Chen وآخرون
الموضوع الرئيسي: تقنيات تحسين الصور

نظرة عامة

تقدم البحث طريقة جديدة لإزالة الضباب من صور الاستشعار عن بعد باستخدام الشبكات التنافسية التوليدية (GANs) للتخفيف من الآثار السلبية للضباب الجوي على جودة الصورة. تتكون البنية المقترحة من شبكة مولدة تعزز الصور الضبابية وشبكة مميزة تميز بين الصور التي تم إزالة الضباب عنها والصور الواضحة. تشمل المكونات الرئيسية للمولد كتل متبقية كثيفة لاستخراج الميزات الأساسية، وكتل تحويل المويجات لالتقاط الميزات ذات الترددات العالية والمنخفضة، وكتل انتباه عالمية ومحلية لإعطاء الأولوية للميزات المهمة مع تقليل التكرار. تستخدم عملية زيادة الدقة PixelShuffle لضمان تحسين تفاصيل الصورة.

تتضمن المميز المحسن وحدة ضوضاء لتعزيز المتانة، ودالة خسارة جديدة تجمع بين مقاييس الخسارة التقليدية مع خسائر اللون ومؤشر التشابه الهيكلي (SSIM) لتعزيز دقة اللون ووضوح الصورة. تحقق الطريقة أداءً متفوقًا، كما يتضح من أعلى نسبة ذروة للإشارة إلى الضوضاء (PSNR) ودرجات SSIM عبر مجموعات بيانات مختلفة، بما في ذلك StateHaze1k وRICEx. تؤكد التقييمات الذاتية هذه النتائج، كاشفة عن وضوح معزز، ووفاء لوني، والحفاظ على التفاصيل في الصور التي تم إزالة الضباب عنها. تسلط دراسات الإزالة الضوء على المساهمات الكبيرة لكل مكون من مكونات الشبكة، وخاصة الكتلة المتبقية الكثيفة، مما يؤكد فعالية الطريقة في معالجة التحديات الشائعة في إزالة الضباب من صور الاستشعار عن بعد. بشكل عام، تمثل هذه الطريقة المعتمدة على المويجات حلاً قويًا لتحسين الجودة البصرية وقابلية الاستخدام لصور الاستشعار عن بعد.

طرق

في هذا القسم، يقدم المؤلفون منهجيتهم لتقييم أداء إزالة الضباب لنموذجهم على مجموعات بيانات StateHaze1k وRICEx. يقدمون تحليلًا مقارنًا لمختلف التكوينات، بما في ذلك عدم وجود كتلة متبقية كثيفة (no_DRB)، وعدم وجود كتلة تحويل مويجات (no_WTB)، وعدم وجود كتلة طبقة انتباه عالمية (no_GALB)، وعدم وجود توسيع للخسارة (no_LS)، جنبًا إلى جنب مع الطريقة الكاملة. تشير النتائج، الملخصة في الجدول 1، إلى أن الطريقة الكاملة تحقق أعلى نسبة ذروة للإشارة إلى الضوضاء (PSNR) ومؤشر التشابه الهيكلي (SSIM) عبر جميع ظروف الضباب، مع قيم تبلغ 28.204 PSNR و0.927 SSIM للضباب الرقيق، و35.212 PSNR و0.982 SSIM لـ RICEx.

على الرغم من هذه النتائج الواعدة، يعترف المؤلفون بأن زيادة تعقيد النموذج تؤدي إلى أوقات معالجة أطول، مما يدفع الجهود المستقبلية لتقليل تعقيد الوحدات المتبقية الكثيفة. علاوة على ذلك، يلاحظون أن فعالية النموذج محدودة على صور الاستشعار عن بعد التقليدية، حيث يواجه صعوبة في السيناريوهات المعقدة التي تتضمن تغطية سحابية، أو ثلوج، أو أسطح عاكسة، والتي يمكن أن يتم تصنيفها بشكل خاطئ على أنها ضباب. لتعزيز قابلية تطبيق النموذج في مثل هذه البيئات الصعبة، يخطط المؤلفون للتحقيق في تحسينات في تقنيات إزالة الضباب المصممة خصيصًا لصور الاستشعار عن بعد المعقدة.

نقاش

في هذا القسم، يناقش المؤلفون بنية ومكونات شبكات GANs المعتمدة على المويجات المقترحة لإزالة الضباب من صور الاستشعار عن بعد. تم هيكلة الشبكة التوليدية باستخدام كتل متبقية كثيفة، وكتل تحويل مويجات، وآليات انتباه لتعزيز استخراج الميزات والتخفيف من آثار الضباب. تستخدم بنية الترميز-فك الترميز سلسلة من الوحدات الهجينة التي تستفيد من الطبقات التلافيفية، وكتل الانتباه، وعمليات PixelShuffle لاستخراج الميزات بشكل فعال واستعادة الدقة. تعمل الكتل المتبقية الكثيفة على تحسين تدفق المعلومات والتقاط القوام المعقد، بينما تقوم كتل تحويل المويجات بتفكيك الصور إلى مكونات ترددية مختلفة، مما يسهل تمثيل الميزات بشكل أفضل.

تكمل الشبكة التنافسية الشبكة التوليدية من خلال التمييز بين الصور التي تم إزالة الضباب عنها والصور الأصلية، مما يعزز الأداء العام لعملية إزالة الضباب. تم تقديم دالة خسارة محسنة، تتضمن خسائر تنافسية، وإدراكية، ومساحة لون LAB، ومؤشر التشابه الهيكلي (SSIM) لتقييم جودة الصور المولدة بشكل أفضل. يقوم المؤلفون بالتحقق من نهجهم باستخدام مجموعات بيانات StateHaze1K وRICEx، مما يظهر أداءً متفوقًا من حيث نسبة ذروة الإشارة إلى الضوضاء (PSNR) ومقاييس SSIM مقارنة بالطرق الحالية. تؤكد دراسات الإزالة بشكل أكبر على أهمية كل وحدة مقترحة، مما يبرز الكتلة المتبقية الكثيفة كأكثر المكونات تأثيرًا في تعزيز أداء إزالة الضباب.

Journal: Scientific Reports, Volume: 15, Issue: 1
DOI: https://doi.org/10.1038/s41598-025-87240-z
PMID: https://pubmed.ncbi.nlm.nih.gov/39880836
Publication Date: 2025-01-29
Author(s): Guangda Chen et al.
Primary Topic: Image Enhancement Techniques

Overview

The research presents a novel dehazing method for remote sensing images utilizing generative adversarial networks (GANs) to mitigate the adverse effects of atmospheric haze on image quality. The proposed architecture comprises a generator network that enhances hazy images and a discriminator network that differentiates between dehazed and clear images. Key components of the generator include dense residual blocks for primary feature extraction, wavelet transform blocks for capturing both high and low-frequency features, and global and local attention blocks to prioritize significant features while minimizing redundancy. The upsampling process employs PixelShuffle to ensure detailed image refinement.

An improved discriminator incorporates a noise module to bolster robustness, and a new loss function combines traditional loss metrics with color and Structural Similarity Index (SSIM) losses to enhance color accuracy and visual fidelity. The method achieves superior performance, as evidenced by the highest Peak Signal-to-Noise Ratio (PSNR) and SSIM scores across various datasets, including StateHaze1k and RICEx. Subjective evaluations corroborate these findings, revealing enhanced clarity, color fidelity, and detail preservation in the dehazed images. Ablation studies highlight the significant contributions of each network component, particularly the dense residual block, affirming the method’s effectiveness in addressing common challenges in remote sensing image dehazing. Overall, this wavelet-based GAN approach represents a robust solution for improving the visual quality and usability of remote sensing images.

Methods

In this section, the authors present their methodology for evaluating the dehazing performance of their model on the StateHaze1k and RICEx datasets. They provide a comparative analysis of various configurations, including no Dense Residual Block (no_DRB), no Wavelet Transform Block (no_WTB), no Global Attention Layer Block (no_GALB), and no Loss Scaling (no_LS), alongside the complete method. The results, summarized in Table 1, indicate that the complete method achieves the highest Peak Signal-to-Noise Ratio (PSNR) and Structural Similarity Index (SSIM) across all haze conditions, with values of 28.204 PSNR and 0.927 SSIM for thin haze, and 35.212 PSNR and 0.982 SSIM for RICEx.

Despite these promising results, the authors acknowledge that the model’s increased complexity leads to longer processing times, prompting future efforts to reduce the complexity of the dense residual modules. Furthermore, they note that the model’s effectiveness is limited to conventional remote sensing images, as it struggles with complex scenarios involving cloud cover, snow, or reflective surfaces, which can be misclassified as haze. To enhance the model’s applicability in such challenging environments, the authors plan to investigate improvements in dehazing techniques tailored for complex remote sensing images.

Discussion

In this section, the authors discuss the architecture and components of their proposed Wavelet-based Generative Adversarial Networks (GANs) for remote sensing image dehazing. The generative network is structured with dense residual blocks, wavelet transform blocks, and attention mechanisms to enhance feature extraction and mitigate haze effects. The encoder-decoder architecture employs a series of hybrid modules that utilize convolutional layers, attention blocks, and PixelShuffle operations for effective feature extraction and resolution restoration. The dense residual blocks improve information flow and capture complex textures, while the wavelet transform blocks decompose images into various frequency components, facilitating better feature representation.

The adversarial network complements the generative network by distinguishing between dehazed and original images, thereby enhancing the overall performance of the dehazing process. An improved loss function is introduced, incorporating adversarial, perceptual, LAB color space, and Structural Similarity Index (SSIM) losses to better evaluate the quality of generated images. The authors validate their approach using the StateHaze1K and RICEx datasets, demonstrating superior performance in terms of Peak Signal-to-Noise Ratio (PSNR) and SSIM metrics compared to existing methods. Ablation studies further confirm the significance of each proposed module, highlighting the dense residual block as the most impactful component in enhancing dehazing performance.