QSegRNN: شبكة عصبية متكررة كمية لتوقع السلاسل الزمنية QSegRNN: quantum segment recurrent neural network for time series forecasting

المجلة: EPJ Quantum Technology، المجلد: 12، العدد: 1
DOI: https://doi.org/10.1140/epjqt/s40507-025-00333-6
تاريخ النشر: 2025-03-03
المؤلف: K.M. Moon وآخرون
الموضوع الرئيسي: طرق التنبؤ بسوق الأسهم

نظرة عامة

تناقش هذه الفقرة التحديات المرتبطة بإدارة درجة حرارة محولات الكهرباء في مراكز البيانات، وخاصة تلك المستخدمة في أبحاث الذكاء الاصطناعي (AI). إن توليد الحرارة الكبير من هذه المحولات يتطلب إدارة فعالة لدرجة الحرارة، مما يحفز البحث المكثف في نماذج التنبؤ المعتمدة على الذكاء الاصطناعي. ومع ذلك، فإن الحجم المتزايد والمتطلبات الحاسوبية لهذه النماذج غالبًا ما تتجاوز الموارد الحاسوبية المتاحة، مما يعقد تشغيلها. لمعالجة هذه المشكلة، يقترح المؤلفون الشبكة العصبية المتكررة الكمية (QSegRNN)، وهي نموذج لتنبؤ السلاسل الزمنية يستخدم الحوسبة الكمية لتحقيق أداء مشابه للنماذج التقليدية مع الحاجة إلى عدد أقل من المعلمات.

تستند QSegRNN إلى الشبكة العصبية المتكررة التقليدية (SegRNN) ولكنها تتضمن خلية كمية بدلاً من خلية تقليدية، مما يسمح بتصميم أكثر كفاءة. يستخدم النموذج دائرة تلافيفية كمية مع تضمين السعة لتقليل فقدان المعلومات، مصممة للأجهزة الكمية المتوسطة الضوضاء (NISQ). تظهر النتائج التجريبية أن QSegRNN تتفوق على SegRNN ونماذج التنبؤ الأخرى على الرغم من احتوائها على 85% فقط من المعلمات، مما يشير إلى إمكانية الحوسبة الكمية لتعزيز كفاءة نماذج الذكاء الاصطناعي. يبرز المؤلفون قابلية تطبيق النموذج في التحكم في درجة الحرارة للمحولات وغيرها من مهام التنبؤ بالسلاسل الزمنية، مع الاعتراف أيضًا بالتحديات المتعلقة بتعقيد النموذج وقيود الأجهزة الكمية الحالية. ستركز الأبحاث المستقبلية على تقليل التعقيد، وتحسين قابلية التوسع، وتكييف النموذج لمختلف المنصات الكمية لتعزيز نشره العملي.

مقدمة

تسلط مقدمة ورقة البحث الضوء على الطلب المتزايد على الموارد الحاسوبية المدفوعة بالتقدم في الذكاء الاصطناعي (AI) والتطبيقات المعتمدة على البيانات، وخاصة من خلال خدمات الحوسبة السحابية. وقد أدى هذا الارتفاع إلى انتشار مراكز البيانات، التي تشتهر باستهلاكها العالي للطاقة. تشير الورقة إلى أن استخدام الكهرباء العالمي من قبل مراكز البيانات من المتوقع أن يصل إلى مستويات حرجة، مما يشكل تحديات بيئية كبيرة إذا لم يتم التعامل معها. علاوة على ذلك، تعتبر مراكز البيانات مساهمًا كبيرًا في انبعاثات الكربون، مما يبرز الحاجة الملحة لممارسات الطاقة المستدامة واستراتيجيات تحسين الطاقة.

تزيد متطلبات التخزين المتزايدة المرتبطة بعصر البيانات الضخمة من الطلبات الطاقوية لمراكز البيانات، مما يؤدي إلى زيادة استهلاك الطاقة. يتم التأكيد على الاعتماد على المحولات لتوفير الكهرباء، حيث يمكن أن يؤدي الاستخدام المفرط للكهرباء إلى رفع درجات حرارة المحولات، مما قد يؤدي إلى أعطال أو مخاطر حريق. وبالتالي، تظهر الإدارة الفعالة لاستهلاك الكهرباء ودرجات حرارة المحولات كقضية حاسمة لسلامة التشغيل واستدامة مراكز البيانات.

الطرق

في هذا القسم، يقدم المؤلفون المنهجية لنموذجهم المقترح للشبكة العصبية المتكررة الكمية (QSegRNN)، وهو نموذج هجين يستفيد من التعلم الآلي الكمي (QML) لتحقيق أداء مشابه لنماذج التعلم العميق التقليدية مع استخدام عدد أقل من المعلمات. يتم هيكلة QSegRNN في مرحلتين رئيسيتين: الترميز وفك الترميز. خلال مرحلة الترميز، يتم أولاً تقسيم البيانات، تليها عملية الترميز التكرارية لإعداد البيانات لمرحلة فك الترميز. تتضمن مرحلة فك الترميز فك الترميز التكراري وتنبؤ البيانات المشفرة، مع تطبيق تطبيع الحالة لمعالجة مشاكل تحول التوزيع.

تعتبر الابتكار الرئيسي في QSegRNN هو استبدال خلية الشبكة العصبية المتكررة (RNN) في نموذج SegRNN بنموذج QML قائم على وحدة متكررة بوابة هجين (HQGRU)، المشار إليها باسم HQRNN. يسمح هذا التعديل لـ QSegRNN بالحفاظ على الأداء مع تقليل حجم النموذج، مما يحقق دقة تنبؤ محسنة على الرغم من احتوائه على 0.3 مليون معلمة أقل. علاوة على ذلك، يتم تسليط الضوء على استخدام تضمين السعة في الدائرة الكمية المتغيرة (VQC) لـ HQGRU في كلا المرحلتين كوسيلة لتقليل فقدان المعلومات، مما يعزز أداء النموذج مقارنة بتضمين الزاوية. بشكل عام، تظهر QSegRNN نهجًا واعدًا لدمج التقنيات الكمية في التعلم الآلي.

المناقشة

في هذا القسم، يناقش المؤلفون التحديات والمساهمات لنموذجهم الهجين للتنبؤ بالسلاسل الزمنية الكمية-التقليدية، الشبكة العصبية المتكررة الكمية (QSegRNN). يبرزون أنه بينما استخدمت الدراسات السابقة الذكاء الاصطناعي للتنبؤ بدرجات حرارة المحولات، فإن التعقيد المتزايد لهذه النماذج غالبًا ما يؤدي إلى زيادة الطلب على الموارد، مما يجعل من الصعب تدريبها ونشرها بشكل فعال. للتخفيف من هذه القضايا، تم تصميم QSegRNN للعمل مع عدد أقل من المعلمات مقارنة بالشبكات العصبية المتكررة التقليدية (RNNs)، وخاصة SegRNN، مع تحقيق مقاييس أداء محسنة، مثل متوسط الخطأ التربيعي (MSE) البالغ 0.006 ومتوسط الخطأ المطلق (MAE) البالغ 0.004.

تدمج QSegRNN بنية هجينة كمية-تقليدية تستخدم طبقة تضمين السعة لترميز الميزات التقليدية إلى حالات كمية، مما يقلل من فقدان المعلومات ويعزز كفاءة النموذج. يؤكد المؤلفون أن هيكل نموذجهم يسمح له باكتساب مزيد من المعلومات من الطبقات السابقة مع الحفاظ على الجدوى الحاسوبية، مما يجعله مناسبًا للتطبيقات الواقعية في تنبؤ درجة حرارة المحولات الكهربائية. يختتمون بتوضيح تنظيم بقية الورقة، والتي تشمل الأعمال ذات الصلة، هيكل النموذج، نتائج المحاكاة، ومناقشات حول نتائجهم.

Journal: EPJ Quantum Technology, Volume: 12, Issue: 1
DOI: https://doi.org/10.1140/epjqt/s40507-025-00333-6
Publication Date: 2025-03-03
Author(s): K.M. Moon et al.
Primary Topic: Stock Market Forecasting Methods

Overview

The section discusses the challenges associated with managing the temperature of electricity transformers in data centers, particularly those used for artificial intelligence (AI) research. The significant heat generation from these transformers necessitates effective temperature management, prompting extensive research into AI-based forecasting models. However, the increasing size and computational demands of these models often exceed available computing resources, complicating their operation. To address this issue, the authors propose the quantum segment recurrent neural network (QSegRNN), a time series forecasting model that utilizes quantum computing to achieve comparable performance to classical models while requiring fewer parameters.

QSegRNN is inspired by the classical segment recurrent neural network (SegRNN) but incorporates a quantum cell instead of a classical one, allowing for a more efficient design. The model employs a quantum convolutional circuit with amplitude embedding to minimize information loss, tailored for noisy intermediate-scale quantum (NISQ) devices. Experimental results demonstrate that QSegRNN outperforms SegRNN and other forecasting models despite having only 85% of the parameters, indicating the potential of quantum computing to enhance AI model efficiency. The authors highlight the model’s applicability in temperature control for transformers and other time series forecasting tasks, while also acknowledging challenges related to model complexity and the limitations of current quantum hardware. Future research will focus on reducing complexity, improving scalability, and adapting the model for various quantum platforms to enhance its practical deployment.

Introduction

The introduction of the research paper highlights the escalating demand for computational resources driven by advancements in artificial intelligence (AI) and data-driven applications, particularly through cloud computing services. This surge has resulted in the proliferation of data centers, which are notorious for their high energy consumption. The paper notes that global electricity usage by data centers is projected to reach critical levels, posing significant environmental challenges if not addressed. Furthermore, data centers are substantial contributors to carbon emissions, emphasizing the urgent need for sustainable energy practices and power optimization strategies.

The increasing storage requirements associated with the big data era exacerbate the energy demands of data centers, leading to heightened power consumption. The reliance on transformers for electricity supply is underscored, as excessive electricity usage can elevate transformer temperatures, potentially resulting in malfunctions or fire hazards. Consequently, effective management of electricity consumption and transformer temperatures emerges as a crucial concern for the operational integrity and sustainability of data centers.

Methods

In this section, the authors present the methodology for their proposed quantum segment recurrent neural network (QSegRNN), a hybrid model that leverages quantum machine learning (QML) to achieve performance comparable to classical deep learning models while utilizing fewer parameters. The QSegRNN is structured into two main phases: encoding and decoding. During the encoding phase, the data is first segmented, followed by recursive encoding to prepare the data for the decoding phase. The decoding phase involves recursive decoding and prediction of the encoded data, with instance normalization applied to address distribution shift issues.

A key innovation in QSegRNN is the replacement of the recurrent neural network (RNN) cell in the SegRNN model with a hybrid quantum gated recurrent unit (HQGRU)-based QML model, referred to as HQRNN. This modification allows the QSegRNN to maintain performance while reducing the model size, achieving improved forecasting accuracy despite having 0.3 million fewer parameters. Furthermore, the use of amplitude embedding in the variational quantum circuit (VQC) of the HQGRU in both phases is highlighted as a means to minimize information loss, enhancing the model’s performance relative to angle embedding. Overall, the QSegRNN demonstrates a promising approach to integrating quantum techniques in machine learning.

Discussion

In this section, the authors discuss the challenges and contributions of their proposed hybrid quantum-classical time series forecasting model, the Quantum Segment Recurrent Neural Network (QSegRNN). They highlight that while previous studies have utilized artificial intelligence to predict transformer temperatures, the increasing complexity of these models often leads to higher resource demands, making them difficult to train and deploy effectively. To mitigate these issues, QSegRNN is designed to operate with fewer parameters than traditional recurrent neural networks (RNNs), specifically the SegRNN, while achieving improved performance metrics, such as a mean squared error (MSE) of 0.006 and a mean absolute error (MAE) of 0.004.

The QSegRNN incorporates a hybrid quantum-classical architecture that utilizes an amplitude embedding layer to encode classical features into quantum states, thereby reducing information loss and enhancing the model’s efficiency. The authors emphasize that their model’s structure allows it to inherit more information from previous layers while maintaining computational feasibility, making it suitable for real-world applications in electricity transformer temperature forecasting. They conclude by outlining the organization of the remainder of the paper, which includes related works, model structure, simulation results, and discussions on their findings.