نهج ستيغانوغرافي متعدد الطبقات مدفوع بالتعلم العميق لتعزيز أمان البيانات A deep learning-driven multi-layered steganographic approach for enhanced data security

المجلة: Scientific Reports، المجلد: 15، العدد: 1
DOI: https://doi.org/10.1038/s41598-025-89189-5
PMID: https://pubmed.ncbi.nlm.nih.gov/39922893
تاريخ النشر: 2025-02-08
المؤلف: Yousef Sanjalawe وآخرون
الموضوع الرئيسي: تقنيات التشفير المتقدمة وإضافة العلامات المائية

نظرة عامة

تقدم ورقة البحث إطار عمل جديد متعدد الطبقات للتخفي مصمم لتعزيز سلامة البيانات، والأصالة، والسرية في سياق زيادة التهديدات الأمنية. تواجه الطرق التقليدية للتخفي قيودًا مثل سعة الحمولة المحدودة والضعف أمام الكشف. يدمج الإطار المقترح ترميز هوفمان، وإدخال أقل بت أهمية (LSB)، ومشفر-فك تشفير قائم على التعلم العميق، مما يحسن بشكل كبير من عدم الكشف، والصلابة، والأمان. تظهر التقييمات التجريبية باستخدام مجموعات بيانات مرجعية مثل Tiny ImageNet وCOCO وCelebA فعالية الإطار، حيث تحقق مؤشر تشابه هيكلي (SSIM) باستمرار فوق 99% ودقة استعادة نصية تبلغ 100% تحت الظروف القياسية، بينما تظهر أيضًا مقاومة ضد الهجمات الشائعة.

في الختام، لا يعالج الإطار التحديات المعاصرة في إخفاء البيانات بشكل آمن فحسب، بل يظهر أيضًا وعدًا للتطبيقات في الاتصالات الآمنة وإدارة الحقوق الرقمية. ستركز الأعمال المستقبلية على تحسين كفاءة الإطار وقابليته للتكيف من خلال الاستفادة من النماذج المدربة مسبقًا، واستكشاف تقنيات التعلم الميتا، وتنقيح بنية المشفر-فك التشفير. تهدف هذه التحسينات إلى تحسين الأداء عبر مقاييس مختلفة، وتقليل الحمل الحسابي، وضمان استعادة البيانات بشكل قوي حتى في الظروف الصعبة. من المتوقع أن يوسع التطوير المستمر لهذا الإطار من قابليته للتطبيق ويؤثر بشكل كبير على أمان البيانات عبر مجالات متنوعة.

الطرق

تجمع المنهجية المقدمة في هذا البحث بين ترميز هوفمان وتخفي الصور، معززة بإطار عمل مشفر-فك تشفير قائم على التعلم العميق (DL). يقوم المشفر بمعالجة كل من الصور المغطاة والسرية من خلال شبكة تخفي قائمة على DL، والتي تستخرج الميزات من الصورة السرية وتدمجها في الصورة المغطاة مع الحفاظ على عدم الكشف البصري. تقلل هذه الطريقة من الآثار القابلة للكشف وتزيد من الصلابة ضد تحليل التخفي. يتم هيكلة المنهجية حول خوارزميتين: خوارزمية الترميز، التي تحول البيانات النصية إلى تنسيق مشفر بهوفمان وتدمجها مع تخفي أقل بت أهمية (LSB)، وخوارزمية فك التشفير، التي تستخرج وتعيد بناء البيانات المخفية من صورة الحاوية.

تستخدم التحقق التجريبي مجموعة بيانات TinyImageNet، التي تتكون من 200 فئة كائن مع صور تم تغيير حجمها إلى 64×64×3 بكسل. يركز تدريب نموذج DL على تحقيق توازن بين دقة الصورة المغطاة وسلامة البيانات السرية من خلال دالة خسارة تُسمى “steganography_loss”، والتي تتضمن متوسط الخطأ التربيعي (MSE) للصورة المغطاة وMSE للبيانات السرية. تظهر النتائج أن الطريقة المقترحة تحقق أداءً متفوقًا مقارنة بأساليب LSB التقليدية والحديثة القائمة على التعلم العميق عبر مقاييس مختلفة، بما في ذلك انخفاض كبير في MSE (4.30%)، ونسبة إشارة إلى ضوضاء ذروة (PSNR) أعلى تبلغ 61.80 ديسيبل، ومؤشر تشابه هيكلي (SSIM) قريب من الكمال يبلغ 99.99%. بالإضافة إلى ذلك، تظهر الطريقة سعة حمولة عالية تبلغ 2.6 بت لكل بكسل (bpp) ومعدل كشف منخفض يبلغ 59.8%، مما يشير إلى مقاومة محسنة ضد تحليل التخفي. بشكل عام، يوازن النهج متعدد الطبقات بشكل فعال بين عدم الكشف، ودقة استعادة البيانات، والأمان، مما يجعله مناسبًا للتطبيقات التي تتطلب سلامة بيانات عالية.

النتائج

تتناول قسم النتائج فعالية نهج جديد يدمج ترميز هوفمان مع تخفي أقل بت أهمية (LSB) لإنشاء صورة سرية، والتي يتم تضمينها بعد ذلك داخل صورة مغطاة باستخدام نموذج تعلم عميق (DL). استخدمت التجارب مجموعة بيانات Tiny ImageNet، وهي مجموعة فرعية من ImageNet، والتي تتكون من 200 فئة كائن مع 500 صورة تدريب، و50 صورة تحقق، و50 صورة اختبار، جميعها تم تغيير حجمها إلى أبعاد 64 × 64 بكسل. تم اختيار هذه المجموعة نظرًا لتوازنها بين الكفاءة الحسابية والتعقيد، مما يجعلها مناسبة لتقييم صلابة وقدرات تعميم طريقة التخفي متعددة الطبقات المقترحة.

تبرر اختيار Tiny ImageNet بقدرتها على توفير مجموعة متنوعة من الصور، من الكائنات البسيطة إلى القوام المعقدة، مما يسمح بإجراء اختبارات شاملة تحت ظروف بصرية متغيرة. يتماشى حجم الصورة الأصغر مع الهدف المتمثل في تقليل التشويه البصري في الصورة المغطاة، مما يعزز من عدم وضوح البيانات المدمجة. تشير النتائج إلى أن النتائج التي تم الحصول عليها من Tiny ImageNet قد تتعمم بشكل جيد على مجموعات بيانات الصور الطبيعية الأخرى المستخدمة عادة في أبحاث التخفي، مثل CIFAR-10 وCelebA. ومع ذلك، بالنسبة للتطبيقات في المجالات المتخصصة مثل التصوير الطبي أو صور الأقمار الصناعية، سيكون من الضروري إجراء مزيد من الاختبارات على مجموعات بيانات محددة للتحقق من قابلية تطبيق الطريقة.

المناقشة

يتناول قسم المناقشة في ورقة البحث تطور ومنهجيات التخفي، مع التركيز على طبيعته المزدوجة إلى جانب تحليل التخفي. التخفي، المشتق من مصطلحات يونانية تعني “الكتابة المغطاة”، هو ممارسة إخفاء المعلومات داخل بيانات أخرى، مما يضمن أن المستلمين المقصودين فقط يمكنهم اكتشاف الرسالة المخفية. تصنف الورقة تقنيات التخفي إلى أشكال رقمية ولغوية، مع تقسيم التخفي الرقمي إلى طرق تقنية وأخرى قائمة على الأساليب. تشمل التقنيات البارزة طريقة أقل بت أهمية (LSB)، واختلاف قيمة البكسل (PVD)، وطرق متقدمة تستخدم التعلم الآلي (ML) وإطارات التعلم العميق (DL)، مثل الشبكات التنافسية التوليدية (GANs) والشبكات العصبية الالتفافية (CNNs). تعزز هذه التقنيات الحديثة من صلابة وأمان البيانات المخفية ضد الكشف من قبل تحليل التخفي.

كما يسلط القسم الضوء على التطورات الأخيرة في التخفي، بما في ذلك دمج ترميز هوفمان لضغط البيانات وإخفائها، مما يحسن من كفاءة تضمين الرسائل المخفية. يجمع النهج متعدد الطبقات المقترح بين ترميز هوفمان مع إدخال LSB وإطار عمل مشفر-فك تشفير قائم على التعلم العميق، مما يعزز بشكل كبير من عدم الكشف، والصلابة، والسعة للنظام. يعالج هذا التصميم ذو الطبقتين قيود الطرق التقليدية، مما يوفر وسيلة أكثر أمانًا وكفاءة لإخفاء البيانات. تؤكد التجارب التي أجريت على فعالية التقنية المقترحة، مما يظهر إمكانياتها في الاتصالات الآمنة وإدارة الحقوق الرقمية في تطبيقات متنوعة. بشكل عام، تؤكد التطورات في التخفي على مستقبل واعد للبحث في هذا المجال، مدفوعًا بالتطور المستمر للشبكات العصبية والتقنيات التكيفية.

القيود

يستعرض قسم القيود القيود المرتبطة بالتقنيات المناقشة في الورقة. بينما تمتلك كل تقنية ميزات رئيسية ومزايا مميزة، إلا أنها تواجه أيضًا قيودًا كبيرة قد تؤثر على قابليتها للتطبيق وفعاليتها. قد تشمل هذه القيود عوامل مثل التعقيد الحسابي، والحساسية للظروف الأولية، أو القيود في قابلية التوسع، مما قد يعيق أدائها في السيناريوهات الواقعية.

علاوة على ذلك، يبرز القسم الحاجة إلى النظر بعناية في هذه القيود عند اختيار تقنية لتطبيقات محددة. فهم التوازن بين المزايا والقيود أمر حاسم للباحثين والممارسين لاتخاذ قرارات مستنيرة تتماشى مع أهدافهم وسياق عملهم.

Journal: Scientific Reports, Volume: 15, Issue: 1
DOI: https://doi.org/10.1038/s41598-025-89189-5
PMID: https://pubmed.ncbi.nlm.nih.gov/39922893
Publication Date: 2025-02-08
Author(s): Yousef Sanjalawe et al.
Primary Topic: Advanced Steganography and Watermarking Techniques

Overview

The research paper presents a novel multi-layered steganographic framework designed to enhance data integrity, authenticity, and confidentiality in the context of increasing security threats. Traditional steganographic methods face limitations such as restricted payload capacity and vulnerability to detection. The proposed framework integrates Huffman coding, Least Significant Bit (LSB) embedding, and a deep learning-based encoder-decoder, significantly improving imperceptibility, robustness, and security. Experimental evaluations using benchmark datasets like Tiny ImageNet, COCO, and CelebA demonstrate the framework’s effectiveness, achieving a Structural Similarity Index Metric (SSIM) consistently above 99% and a text recovery accuracy of 100% under standard conditions, while also exhibiting resilience against common attacks.

In conclusion, the framework not only addresses contemporary challenges in secure data hiding but also shows promise for applications in secure communication and digital rights management. Future work will focus on optimizing the framework’s efficiency and adaptability by leveraging pre-trained models, exploring meta-learning techniques, and refining the encoder-decoder architecture. These enhancements aim to improve performance across various metrics, reduce computational overhead, and ensure robust data recovery even in adverse conditions. The ongoing development of this framework is expected to broaden its applicability and significantly impact data security across diverse fields.

Methods

The methodology presented in this research combines Huffman coding and image steganography, enhanced by a deep learning (DL)-based encoder-decoder framework. The encoder processes both cover and secret images through a DL hiding network, which extracts features from the secret image and embeds them into the cover image while maintaining visual imperceptibility. This approach minimizes detectable artifacts and increases robustness against steganalysis. The methodology is structured around two algorithms: the Encode Algorithm, which converts textual data into Huffman-coded format and integrates it with least significant bit (LSB) steganography, and the Decode Algorithm, which extracts and reconstructs the hidden data from the container image.

Experimental validation employs the TinyImageNet dataset, consisting of 200 object classes with images resized to 64×64×3 pixels. The training of the DL model focuses on balancing cover image fidelity and secret data integrity through a loss function termed “steganography_loss,” which incorporates cover mean squared error (MSE) and secret MSE. The results demonstrate that the proposed method achieves superior performance compared to traditional LSB and modern deep learning approaches across various metrics, including a significantly lower MSE (4.30%), a higher peak signal-to-noise ratio (PSNR) of 61.80 dB, and a near-perfect structural similarity index (SSIM) of 99.99%. Additionally, the method exhibits a high payload capacity of 2.6 bits per pixel (bpp) and a low detection rate of 59.8%, indicating enhanced resistance to steganalysis. Overall, the multi-layered approach effectively balances imperceptibility, data recovery accuracy, and security, making it suitable for applications requiring high data integrity.

Results

The results section details the effectiveness of a novel approach that integrates Huffman coding with Least Significant Bit (LSB) steganography to create a secret image, which is subsequently embedded within a cover image using a deep learning (DL) model. The experiments utilized the Tiny ImageNet dataset, a subset of ImageNet, which comprises 200 object classes with 500 training images, 50 validation images, and 50 test images, all resized to dimensions of 64 × 64 pixels. This dataset was chosen for its balance between computational efficiency and complexity, making it suitable for evaluating the robustness and generalization capabilities of the proposed multi-layered steganographic method.

The selection of Tiny ImageNet is justified by its ability to provide a diverse range of images, from simple objects to complex textures, which allows for comprehensive testing under varying visual conditions. The smaller image size aligns with the goal of minimizing visual distortion in the cover image, thereby enhancing the invisibility of the embedded data. The findings suggest that the results obtained from Tiny ImageNet may generalize well to other natural image datasets commonly used in steganography research, such as CIFAR-10 and CelebA. However, for applications in specialized domains like medical imaging or satellite imagery, further testing on domain-specific datasets would be required to validate the method’s applicability.

Discussion

The discussion section of the research paper delves into the evolution and methodologies of steganography, emphasizing its dual nature alongside steganalysis. Steganography, derived from Greek terms meaning “covered writing,” is the practice of concealing information within other data, ensuring that only intended recipients can detect the hidden message. The paper categorizes steganographic techniques into digital and linguistic forms, with digital steganography further divided into technical and method-based approaches. Notable techniques include the Least Significant Bit (LSB) method, Pixel-Value Differencing (PVD), and advanced methods utilizing machine learning (ML) and deep learning (DL) frameworks, such as Generative Adversarial Networks (GANs) and Convolutional Neural Networks (CNNs). These modern techniques enhance the robustness and security of hidden data against detection by steganalysis.

The section also highlights recent advancements in steganography, including the integration of Huffman encoding for data compression and obfuscation, which improves the efficiency of embedding hidden messages. The proposed multi-layered approach combines Huffman coding with LSB embedding and a deep learning encoder-decoder framework, significantly enhancing the system’s imperceptibility, robustness, and capacity. This dual-layer design addresses the limitations of traditional methods, providing a more secure and efficient means of data concealment. The experiments conducted validate the proposed technique’s effectiveness, showcasing its potential for secure communication and digital rights management in various applications. Overall, the advancements in steganography underscore a promising future for research in this field, driven by the continuous evolution of neural networks and adaptive techniques.

Limitations

The section on limitations outlines the constraints associated with the techniques discussed in the paper. While each technique possesses distinct key features and advantages, they also face significant limitations that may impact their applicability and effectiveness. These limitations could include factors such as computational complexity, sensitivity to initial conditions, or constraints in scalability, which may hinder their performance in real-world scenarios.

Furthermore, the section emphasizes the need for careful consideration of these limitations when selecting a technique for specific applications. Understanding the trade-offs between advantages and limitations is crucial for researchers and practitioners to make informed decisions that align with their objectives and the context of their work.