التحكم الديناميكي في الوصول القائم على الثقة الصفرية للحوسبة السحابية Zero-trust based dynamic access control for cloud computing

المجلة: Cybersecurity، المجلد: 8، العدد: 1
DOI: https://doi.org/10.1186/s42400-024-00320-x
تاريخ النشر: 2025-02-16
المؤلف: Ri Wang وآخرون
الموضوع الرئيسي: التحكم في الوصول والثقة

نظرة عامة

تتناول ورقة البحث التحديات التي تواجه آليات التحكم في الوصول التقليدية في سياق الحوسبة السحابية، خاصة في ضوء التهديدات السيبرانية المتزايدة. تنتقد الطرق الحالية للتحكم في الوصول الديناميكي التي تعتمد على القواعد الثابتة وتقترح نموذج التحكم في الوصول القائم على الثقة (TBAC) المستوحى من بنية الثقة الصفرية (ZTA). يتضمن نموذج TBAC منهجية لتقييم الثقة لتقييم موثوقية المستخدم بشكل مستمر ويقدم قواعد ديناميكية من خلال نظام DR-TBAC (TBAC مع قاعدة ديناميكية). يستخدم هذا النظام خوارزمية الشبكة العصبية العميقة (DQN) لتحديث عتبات الثقة بشكل تكيفي، مما يعزز سياسات التحكم في الوصول بناءً على سلوك المستخدم في الوقت الحقيقي.

في الختام، تم تصميم نظام DR-TBAC لتوفير رؤية مستمرة وتعزيز مرونة القواعد طوال دورة حياة التحكم في الوصول، مما يسمح بموقف أمني أكثر استجابة. من خلال الاستفادة من سجلات سلوك المستخدم في الوقت الحقيقي ودمجها في عملية التعلم المعزز، يقوم النظام بضبط قواعد التحكم في الوصول بشكل ديناميكي لإدارة الامتيازات بدقة أكبر. تظهر التقييمات التجريبية فعالية ودقة نظام DR-TBAC في تعزيز قيود سلوك المستخدم والأمان العام للسحابة. تشمل اتجاهات البحث المستقبلية استكشاف طرق التعلم المعزز العميق البديلة غير المتصلة بالإنترنت وتحسين استخدام الموارد لاتخاذ قرارات السياسة، بالإضافة إلى تطوير مخطط معالجة القواعد المصمم لمختلف منصات السحابة.

مقدمة

تناقش مقدمة ورقة البحث هذه تطور تكنولوجيا الحوسبة السحابية، مع التركيز على فوائدها في مشاركة وإدارة الموارد، بينما تسلط الضوء أيضًا على التحديات الأمنية الجديدة التي تقدمها. تعتبر أنظمة إدارة الهوية والوصول التقليدية (IAM)، مثل التحكم في الوصول القائم على الدور (RBAC) والتحكم في الوصول القائم على السمات (ABAC)، غير كافية للطبيعة الديناميكية والمتعددة المستأجرين لبيئات السحابة، مما يؤدي إلى زيادة المخاطر مثل خروقات البيانات والتهديدات الداخلية. تدعو الورقة إلى اعتماد بنية الثقة الصفرية (ZTA)، التي تنتقل من الأمان القائم على المحيط إلى نموذج يتطلب التحقق المستمر من الثقة لجميع طلبات الوصول، مما يعزز الأمان والقدرة على التكيف.

يقترح المؤلفون نموذج التحكم في الوصول القائم على الثقة (TBAC) الذي يتضمن تقييمات ثقة ديناميكية باستخدام نموذج الذاكرة القصيرة والطويلة الأمد (LSTM) لتقييم سلوك المستخدم في الوقت الحقيقي. يتم تنفيذ هذا النموذج في نظام التحكم في الوصول القائم على الثقة الديناميكي المعزز (DR-TBAC)، الذي يستخدم تقنيات التعلم الآلي، وخاصة التعلم المعزز العميق غير المتصل بالإنترنت، لتحسين قرارات التحكم في الوصول بناءً على تفاعلات المستخدم والتغيرات البيئية. توضح الورقة مساهمات نموذج TBAC ونظام DR-TBAC، بما في ذلك تتبع سلوك المستخدم في الوقت الحقيقي، وضبط ديناميكي لامتيازات الوصول، وتحليل مقارن مع النماذج الأساسية لإظهار فعاليتها. ستتناول الأقسام التالية من الورقة الأعمال ذات الصلة، وتصميم النموذج، وتنفيذ النظام، والنتائج التجريبية.

النتائج

تم إجراء نتائج التجارب التقييمية لنظام تحليل سلوك المرور القائم على القواعد الديناميكية والتحكم (DR-TBAC) المقترح على خادم مزود بمعالج Intel Xeon v2 بسرعة 2.0GHz، يعمل بنظام CentOS Linux 8، مع 32GB من الذاكرة وGTX 1080Ti GPU. تم استخدام PyTorch 1.9.0 وPython 3.6 في التدريب. ركز قسم “التقييم غير المتصل” على تحديد أفضل المعلمات الفائقة لنماذج الذاكرة القصيرة والطويلة الأمد (LSTM) والشبكة العصبية العميقة (DQN). في المقابل، سلط قسم “التقييم المتصل” الضوء على تحسينات الأداء لنظام DR-TBAC من خلال تنفيذ قواعد ديناميكية مصممة لتناسب سيناريوهات المستخدم في العالم الحقيقي، facilitated by OpenStack (الإصدار Victoria) مع عقدة تحكم واحدة وعقدة حساب واحدة.

علاوة على ذلك، أظهر نظام DR-TBAC دقة أمان متفوقة أثناء النشر عبر الإنترنت مقارنة بمختلف الأساليب الأساسية. تم توثيق النتائج الإحصائية التفصيلية بشأن قيم ردود الفعل السلبية المختلفة لأربعة مستخدمين، كما تم مناقشته في قسم “التقييم المتصل”، في الجداول 5 و6 لألف دورة الأولى من مرحلة التدريب عبر الإنترنت.

المناقشة

في قسم المناقشة من ورقة البحث، يقوم المؤلفون بتقييم نقدي لنماذج التحكم في الوصول القائمة على الثقة الموجودة ضمن إطار عمل بنية الثقة الصفرية (ZTA). يبرزون أن النماذج السابقة، مثل تلك التي اقترحها رiad وآخرون (2017) وياو وآخرون (2020)، تعتمد على سمات الثقة المحددة مسبقًا التي يتم تقييمها من خلال طرق ذاتية، مما يؤدي إلى عدم الاتساق والتحيزات المحتملة في تقييمات الثقة. بينما بدأت الأساليب الأكثر حداثة في دمج سلوك المستخدم والعوامل البيئية، فإنها غالبًا ما تقلل هذه الديناميكيات المعقدة إلى قيم عددية، مما يعرض المعلومات السياقية الحيوية للخطر. يجادل المؤلفون بأن تقييم الثقة الأكثر فعالية يجب أن يستفيد من تحليل البيانات في الوقت الحقيقي والتعلم المستمر، كما يتضح من نظام التحكم في الوصول القائم على الثقة الديناميكي في الوقت الحقيقي (DR-TBAC) الذي يقترحونه، والذي يدمج عوامل متعددة الأبعاد ويتكيف مع سياسات التفويض بشكل ديناميكي.

كما يؤكد المؤلفون على أهمية تحسين سياسات التفويض بالتزامن مع تقييمات الثقة. ينتقدون النماذج الحالية لطبيعتها الثابتة والخشنة، مما يحد من قدرتها على التكيف في البيئات الديناميكية. يهدف نظام DR-TBAC إلى معالجة هذه القيود من خلال استخدام تقنيات التعلم المعزز (RL) لتحديث عتبات الثقة باستمرار وتحسين قرارات التحكم في الوصول بناءً على سلوك المستخدم في الوقت الحقيقي والتغيرات البيئية. لا يعزز هذا النهج الأمان فحسب، بل يتماشى أيضًا مع مبادئ الحد الأدنى من الامتياز والتفويض الديناميكي، مما يسهم في إطار عمل أكثر قوة واستجابة للتحكم في الوصول في بيئات السحابة.

Journal: Cybersecurity, Volume: 8, Issue: 1
DOI: https://doi.org/10.1186/s42400-024-00320-x
Publication Date: 2025-02-16
Author(s): Ri Wang et al.
Primary Topic: Access Control and Trust

Overview

The research paper addresses the challenges of traditional access control mechanisms in the context of cloud computing, particularly in light of increasing cyber threats. It critiques existing dynamic access control methods that rely on static rules and propose a Trust-based Access Control (TBAC) model inspired by zero-trust architecture (ZTA). The TBAC model incorporates a trust assessment methodology to continuously evaluate user trustworthiness and introduces dynamic rules through the DR-TBAC (TBAC with Dynamic Rule) system. This system utilizes the Deep Q-Network (DQN) algorithm to adaptively update trust thresholds, thereby enhancing access control policies based on real-time user behavior.

In conclusion, the DR-TBAC system is designed to provide continuous visibility and flexible rule enhancement throughout the access control lifecycle, allowing for a more responsive security posture. By leveraging real-time user behavior logs and integrating them into the reinforcement learning process, the system dynamically adjusts access control rules for more precise privilege management. Experimental evaluations demonstrate the effectiveness and accuracy of the DR-TBAC system in strengthening user behavior constraints and overall cloud security. Future research directions include exploring alternative offline deep reinforcement learning methods and optimizing resource usage for policy decision-making, as well as developing a rule processing scheme tailored to various cloud platforms.

Introduction

The introduction of this research paper discusses the evolution of cloud computing technology, emphasizing its benefits in resource sharing and management while also highlighting the new security challenges it presents. Traditional identity and access management (IAM) systems, such as role-based access control (RBAC) and attribute-based access control (ABAC), are deemed inadequate for the dynamic and multi-tenant nature of cloud environments, leading to increased risks such as data breaches and insider threats. The paper advocates for a Zero Trust Architecture (ZTA), which shifts from perimeter-based security to a model that requires continuous verification of trust for all access requests, thereby enhancing security and adaptability.

The authors propose a Trust-Based Access Control (TBAC) model that incorporates dynamic trust assessments using a long short-term memory (LSTM) model to evaluate user behavior in real-time. This model is implemented in the Dynamic Reinforcement Trust-Based Access Control (DR-TBAC) system, which utilizes machine learning techniques, particularly deep offline reinforcement learning, to optimize access control decisions based on user interactions and environmental changes. The paper outlines the contributions of the TBAC model and DR-TBAC system, including real-time tracking of user behavior, dynamic adjustment of access privileges, and a comparative analysis with baseline models to demonstrate its effectiveness. The subsequent sections of the paper will elaborate on related work, model design, system implementation, and experimental results.

Results

The results of the evaluation experiments for the proposed Dynamic Rule-based Traffic Behavior Analysis and Control (DR-TBAC) system were conducted on a server equipped with a 2.0GHz Intel Xeon v2 processor, running CentOS Linux 8, with 32GB of memory and a GTX 1080Ti GPU. The training utilized PyTorch 1.9.0 and Python 3.6. The “Offline evaluation” section focused on determining the optimal hyperparameters for the Long Short-Term Memory (LSTM) and Deep Q-Network (DQN) models. In contrast, the “Online evaluation” section highlighted the performance improvements of the DR-TBAC system through the implementation of dynamic rules tailored to real-world user scenarios, facilitated by OpenStack (version Victoria) with one control node and one compute node.

Furthermore, the DR-TBAC system exhibited superior security accuracy during online deployment when compared to various baseline approaches. Detailed statistical results regarding different negative reward feedback values for four users, as discussed in the “Online evaluation” section, are documented in Tables 5 and 6 for the first 1000 epochs of the online training phase.

Discussion

In the discussion section of the research paper, the authors critically evaluate existing trust-based access control models within the framework of Zero Trust Architecture (ZTA). They highlight that previous models, such as those proposed by Riad et al. (2017) and Yao et al. (2020), rely on predefined trust attributes assessed through subjective methods, leading to inconsistencies and potential biases in trust evaluations. While more recent approaches have begun to incorporate user behavior and environmental factors, they often reduce these complex dynamics to numerical values, risking the loss of critical contextual information. The authors argue that a more effective trust assessment should leverage real-time data analysis and continuous learning, as demonstrated by their proposed Dynamic Real-Time Trust-Based Access Control (DR-TBAC) system, which integrates multi-dimensional factors and adapts authorization policies dynamically.

The authors also emphasize the importance of refining authorization policies in conjunction with trust assessments. They critique existing models for their static and coarse-grained nature, which limits their adaptability in dynamic environments. The DR-TBAC system aims to address these limitations by employing reinforcement learning (RL) techniques to continuously update trust thresholds and optimize access control decisions based on real-time user behavior and environmental changes. This approach not only enhances security but also aligns with the principles of least privilege and dynamic authorization, ultimately contributing to a more robust and responsive access control framework in cloud environments.