خوارزمية محسنة لاختيار الميزات ذات الطبقتين المتآزرتين مع مصنفين من نوعين للكشف الفعال عن التسلل في بيئة إنترنت الأشياء An improved synergistic dual-layer feature selection algorithm with two type classifier for efficient intrusion detection in IoT environment

المجلة: Scientific Reports، المجلد: 15، العدد: 1
DOI: https://doi.org/10.1038/s41598-025-91663-z
PMID: https://pubmed.ncbi.nlm.nih.gov/40055408
تاريخ النشر: 2025-03-07
المؤلف: G Logeswari وآخرون
الموضوع الرئيسي: أمن الشبكات وكشف التسلل

نظرة عامة

استجابةً للتعقيد المتزايد للتهديدات السيبرانية، يقدم هذا البحث نظام كشف التسلل (IDS) مصمم لشبكات إنترنت الأشياء (IoT)، التي تواجه تحديات أمنية متميزة لا تعالجها الطرق التقليدية بشكل كافٍ. يستخدم نظام كشف التسلل المقترح نهجًا متعدد الجوانب يتكون من ثلاثة أنظمة فرعية حاسمة: معالجة البيانات المسبقة، اختيار الميزات، والتصنيف. يعزز نظام معالجة البيانات المسبقة جودة البيانات من خلال تقنيات مثل التعامل مع القيم المفقودة، وإزالة التكرارات، وتطبيع الميزات عبر مقياس الحد الأدنى والحد الأقصى. يستخدم نظام اختيار الميزات خوارزمية اختيار الميزات الثنائية التآزرية (SDFC)، التي تدمج الأساليب الإحصائية وتقنيات النمذجة المتقدمة مثل آلة الدعم الناقل (SVM) مع الإزالة التكرارية للميزات (RFE) وتحسين سرب الجسيمات (PSO) لتحديد الميزات ذات الصلة. للتصنيف، يستخدم مصنف ذو مرحلتين باستخدام LightGBM وXGBoost لتصنيف حركة مرور الشبكة كعادية أو خبيثة. تشير النتائج التجريبية على مجموعة بيانات TON-IoT إلى أن نظام كشف التسلل المقترح يحقق مقاييس أداء متفوقة، بما في ذلك الدقة (0.9370)، والموثوقية (0.9365)، والاسترجاع (0.9378)، ودرجة F1 (0.9371)، مقارنةً بالطرق الموجودة.

تؤكد الاستنتاجات على النهج الشامل لنظام كشف التسلل في تعزيز أمان إنترنت الأشياء من خلال التعامل الفعال مع البيانات، واختيار الميزات المتقدم، والتصنيف القوي. تشمل اتجاهات البحث المستقبلية استكشاف هندسة الميزات المتقدمة، وتنفيذ التعلم التكيفي لتحديث النماذج، ودمج تدابير أمنية إضافية، وتحسين القابلية للتوسع والمعالجة في الوقت الحقيقي، والتحقق من النظام باستخدام بيانات إنترنت الأشياء من العالم الحقيقي. كما يُقترح تحسين واجهات المستخدم وآليات الاستجابة التلقائية لتحسين الكفاءة وسهولة الاستخدام، مما يؤدي في النهاية إلى أنظمة كشف ومنع التسلل أكثر مرونة لشبكات إنترنت الأشياء.

النتائج

استخدمت التجارب التي أجريت في هذه الدراسة بايثون (الإصدار 3.x) والعديد من المكتبات الرئيسية، بما في ذلك TensorFlow (الإصدار 2.x)، Keras (الإصدار 2.x)، Scikit-learn (الإصدار 0.24+)، Pandas (الإصدار 1.x)، NumPy (الإصدار 1.x)، Matplotlib، وSeaborn. تم اختيار هذه المكتبات بناءً على قدراتها القوية في دعم مهام التعلم العميق، ومعالجة البيانات، والتصور.

تظهر النتائج التي تم الحصول عليها من التجارب فعالية المنهجية المقترحة في تحقيق النتائج المرجوة. تشير التحليلات التفصيلية لمقاييس الأداء إلى تحسينات كبيرة مقارنة بالنماذج الأساسية، مما يبرز إمكانيات التقنيات المنفذة في تقدم مجال الدراسة. يتم تقديم مناقشات إضافية حول تداعيات هذه النتائج في الأقسام التالية.

المناقشة

تسلط قسم المناقشة في ورقة البحث الضوء على نقاط الضعف في شبكات إنترنت الأشياء تجاه التهديدات السيبرانية، مما يبرز ضرورة وجود حلول أمنية متقدمة بسبب طبيعتها المعقدة والمترابطة. ركزت الدراسات الحديثة على تعزيز أنظمة كشف التسلل (IDS) من خلال تقنيات التعلم العميق والتعلم الآلي. على سبيل المثال، استكشف Tsimenidis وآخرون نماذج التعلم العميق، مشيرين إلى إمكانياتها في تحسين دقة الكشف مع معالجة قيود الطرق التقليدية. أظهر عمل مغربي في أتمتة عمليات كشف التسلل تحسينات كبيرة في معدلات الكشف وتقليل الإيجابيات الكاذبة. قدم غوبتا وآخرون نهجًا هجينًا يجمع بين تقنيات التحسين والتعلم العميق، والذي أظهر أيضًا أداءً محسّنًا في الكشف عن التسللات المعقدة.

تحدد الدراسة التحديات المستمرة، مثل التكاليف الحاسوبية العالية، ومشكلات القابلية للتوسع، والحاجة إلى مجموعات بيانات موسومة كبيرة، والتي تعيق النشر العملي لهذه الحلول المتقدمة لنظام كشف التسلل. يستخدم النظام المقترح في الورقة مجموعة بيانات TON-IoT، التي تتكون من أكثر من 22 مليون حالة، ويستخدم نهج تصنيف ذو مرحلتين باستخدام LightGBM وXGBoost لتعزيز دقة الكشف وكفاءته. يتضمن هذا النظام عملية اختيار ميزات قوية من خلال طريقة من طبقتين، مما يضمن استخدام الميزات ذات الصلة لتحسين أداء النموذج مع تقليل العبء الحاسوبي. بشكل عام، تؤكد النتائج على أهمية تطوير حلول أمنية قابلة للتوسع وفعالة مصممة لتحديات شبكات إنترنت الأشياء الفريدة.

Journal: Scientific Reports, Volume: 15, Issue: 1
DOI: https://doi.org/10.1038/s41598-025-91663-z
PMID: https://pubmed.ncbi.nlm.nih.gov/40055408
Publication Date: 2025-03-07
Author(s): G Logeswari et al.
Primary Topic: Network Security and Intrusion Detection

Overview

In response to the growing sophistication of cyber threats, this research presents an Intrusion Detection System (IDS) tailored for Internet of Things (IoT) networks, which face distinct security challenges inadequately addressed by traditional methods. The proposed IDS employs a multi-faceted approach comprising three critical subsystems: data pre-processing, feature selection, and classification. The data pre-processing subsystem enhances data quality through techniques such as handling missing values, removing duplicates, and normalizing features via min-max scaling. The feature selection subsystem utilizes the Synergistic Dual-Layer Feature Selection (SDFC) algorithm, integrating statistical methods and advanced model-based techniques like Support Vector Machine (SVM) with Recursive Feature Elimination (RFE) and Particle Swarm Optimization (PSO) to identify relevant features. For classification, a two-stage classifier using LightGBM and XGBoost effectively categorizes network traffic as normal or malicious. Experimental results on the TON-IoT dataset indicate that the proposed IDS achieves superior performance metrics, including accuracy (0.9370), precision (0.9365), recall (0.9378), and F1 score (0.9371), compared to existing methods.

The conclusions emphasize the IDS’s comprehensive approach to enhancing IoT security through effective data handling, advanced feature selection, and robust classification. Future research directions include exploring advanced feature engineering, implementing adaptive learning for model updates, integrating additional security measures, optimizing for scalability and real-time processing, and validating the system with real-world IoT data. Enhancements in user interfaces and automated response mechanisms are also suggested to improve efficiency and usability, ultimately leading to more resilient intrusion detection and prevention systems for IoT networks.

Results

The experiments conducted in this study utilized Python (version 3.x) and several key libraries, including TensorFlow (version 2.x), Keras (version 2.x), Scikit-learn (version 0.24+), Pandas (version 1.x), NumPy (version 1.x), Matplotlib, and Seaborn. The choice of these libraries was based on their robust capabilities in supporting deep learning tasks, data manipulation, and visualization.

The results obtained from the experiments demonstrate the effectiveness of the proposed methodology in achieving the desired outcomes. Detailed analyses of the performance metrics indicate significant improvements over baseline models, underscoring the potential of the implemented techniques in advancing the field of study. Further discussions on the implications of these findings are provided in the subsequent sections.

Discussion

The discussion section of the research paper highlights the vulnerabilities of IoT networks to cyber threats, emphasizing the necessity for advanced security solutions due to their complex and interconnected nature. Recent studies have focused on enhancing Intrusion Detection Systems (IDS) through deep learning and machine learning techniques. For instance, Tsimenidis et al. explored deep learning models, noting their potential to improve detection accuracy while addressing the limitations of traditional methods. Maghrabi’s work on automating intrusion detection processes demonstrated significant improvements in detection rates and reduced false positives. Gupta et al. introduced a hybrid approach combining optimization techniques with deep learning, which also showed enhanced performance in detecting sophisticated intrusions.

The research identifies ongoing challenges, such as high computational costs, scalability issues, and the need for large labeled datasets, which hinder the practical deployment of these advanced IDS solutions. The proposed system in the paper utilizes the TON-IoT dataset, comprising over 22 million instances, and employs a two-stage classification approach using LightGBM and XGBoost to enhance detection accuracy and efficiency. This system incorporates a robust feature selection process through a two-layer method, ensuring relevant features are utilized to improve model performance while minimizing computational overhead. Overall, the findings underscore the importance of developing scalable and efficient security solutions tailored for the unique challenges of IoT networks.