تعزيز توقعات أسعار الأسهم باستخدام GANs وآليات الانتباه المعتمدة على المحولات Enhancing stock price prediction using GANs and transformer-based attention mechanisms

المجلة: Empirical Economics، المجلد: 68، العدد: 1
DOI: https://doi.org/10.1007/s00181-024-02644-6
تاريخ النشر: 2024-10-03
المؤلف: S. Li وآخرون
الموضوع الرئيسي: طرق التنبؤ بسوق الأسهم

نظرة عامة

تقدم هذه القسم نظرة عامة على أهمية توقع أسعار الأسهم في اتخاذ القرارات المالية، مع تسليط الضوء على قيود الطرق الإحصائية التقليدية في التقاط الديناميات المعقدة للسوق والتعامل مع مجموعات البيانات الكبيرة. لمعالجة هذه التحديات، يقترح المؤلفون نهجًا جديدًا يجمع بين الشبكات التنافسية التوليدية (GANs) وآليات الانتباه المعتمدة على المحولات. يهدف هذا التكامل إلى توليد بيانات أسعار الأسهم الاصطناعية مع دمج مشاعر السوق وتقلباته، مما يعزز تحديد المؤشرات الرئيسية للسوق التي تؤثر على أسعار الأسهم.

يستفيد النموذج المقترح من آليات الانتباه للتركيز على الميزات الحرجة داخل البيانات، مما يحسن من دقة وموثوقية التوقعات. بالإضافة إلى ذلك، يتناول المؤلفون القضايا الشائعة المرتبطة بالشبكات التنافسية التوليدية وآليات الانتباه، مثل توليد البيانات غير الواقعية والتكيف الزائد، من خلال تنفيذ تقنيات التنظيم واستخدام مصادر بيانات إضافية. سيتم تقييم فعالية هذا النهج من خلال التجارب على بيانات سوق الأسهم في العالم الحقيقي، مع توقع أن توفر النتائج رؤى قيمة للمستثمرين والمحللين الماليين، مما يؤدي في النهاية إلى إبلاغ استراتيجيات استثمار أفضل.

مقدمة

تؤكد مقدمة ورقة البحث على الدور الحاسم لتوقع أسعار الأسهم في المالية، مما يساعد المستثمرين والمحللين في اتخاذ قرارات مستنيرة بشأن معاملات الأسهم. تم استخدام الطرق الإحصائية التقليدية، مثل المتوسطات المتحركة، وتحليل الانحدار، وتوقع السلاسل الزمنية (مثل ARIMA والتنعيم الأسي)، تاريخيًا لهذا الغرض. ومع ذلك، غالبًا ما تكافح هذه الطرق مع الأنماط المعقدة وغير الخطية ومجموعات البيانات الكبيرة، مما يؤدي إلى قيود في دقتها التنبؤية. تقدم التطورات الأخيرة في التعلم الآلي والتعلم العميق، لا سيما من خلال استخدام الشبكات التنافسية التوليدية (GANs) وآليات الانتباه، بدائل واعدة يمكن أن تعزز من توقع أسعار الأسهم.

تحدد الورقة إمكانيات الشبكات التنافسية التوليدية في توليد بيانات سوق الأسهم الاصطناعية، مما يحسن من دقة التوقعات في السيناريوهات التي تعاني من نقص البيانات، بينما يمكن أن تساعد آليات الانتباه في تحديد المؤشرات الرئيسية للسوق التي تؤثر على أسعار الأسهم. أظهرت الدراسات السابقة فعالية هذه التقنيات، حيث حسنت الشبكات التنافسية التوليدية من أداء نماذج الذاكرة الطويلة القصيرة (LSTM) وحسنت آليات الانتباه من قابلية تفسير النموذج. ومع ذلك، تتطلب التحديات مثل انهيار الوضع في الشبكات التنافسية التوليدية والتكيف الزائد في نماذج الانتباه مزيدًا من البحث. تهدف الدراسة المقترحة إلى دمج الشبكات التنافسية التوليدية وآليات الانتباه المعتمدة على المحولات، جنبًا إلى جنب مع بيانات مشاعر السوق وتقلباته، لتطوير نموذج أكثر قوة لتوقع أسعار الأسهم. ستقوم الأبحاث بتقييم فعالية هذه النماذج مقابل بيانات السوق الحقيقية واستكشاف آثارها على المستثمرين والمحللين الماليين.

طرق

في هذا القسم، يوضح المؤلفون المنهجيات المستخدمة في توقع أسعار إغلاق أسهم شركة آبل باستخدام مجموعات بيانات التدريب والاختبار. أظهر النموذج كفاءة في تحديد الاتجاهات الصاعدة والهابطة في أسعار الأسهم خلال مرحلة التدريب، مما أدى إلى توقعات دقيقة للفترة التي تلت منتصف عام 2022. ومع ذلك، واجه صعوبة في حساب الزيادة الكبيرة في سعر سهم آبل، الذي ارتفع بأكثر من 42% بعد مارس 2023، وذلك بسبب المشاعر الإيجابية في السوق المحيطة بالتطورات في الذكاء الاصطناعي (AI).

لتقييم أداء النموذج، استخدم المؤلفون معيار الجذر التربيعي لمتوسط الخطأ (RMSE) القياسي كمعيار أكثر موثوقية من RMSE التقليدي، الذي يميل إلى إنتاج نتائج مبالغ فيها. تشير النتائج، المعروضة في الجدول 1، إلى أن قيم RMSE القياسية كانت 2.414 لمجموعة التدريب و3.331 لمجموعة الاختبار. تؤكد هذه النتائج على القدرات التنبؤية الفعالة للنموذج في توقع أسعار الأسهم، لا سيما في سياق المشهد بعد جائحة COVID-19.

مناقشة

في هذا البحث، استخدم المؤلفون مجموعة بيانات شاملة مأخوذة من Yahoo Finance وSeeking Alpha لتوقع أسعار أسهم شركة آبل (AAPL) على مدى ست سنوات (2017-2023). تتضمن مجموعة البيانات أسعار الأسهم التاريخية، وبيانات من شركات التكنولوجيا الكبرى (أمازون، مايكروسوفت، جوجل)، ومؤشرات الأسهم (ناسداك، نيويورك، RUSSELL2000)، ومؤشر VIX لتقلبات السوق، ومؤشرات اقتصادية مثل سعر الصرف بين اليوان الصيني (CNY) والدولار الأمريكي (USD) وأسعار الخزينة. يثري دمج مشاعر الأخبار من Seeking Alpha، التي تم تحليلها باستخدام نموذج FinBERT، مجموعة البيانات من خلال التقاط مشاعر السوق وتأثيرها المحتمل على أسعار الأسهم.

شمل معالجة البيانات تطبيع تغطية الأخبار واستخدام نهج المتوسط المرجح لأخذ في الاعتبار مقالات متعددة في نفس اليوم، مما يعزز تمثيل الأحداث السوقية. تم حساب المتوسطات المتحركة وتحويلات فورييه لتحديد الاتجاهات والأنماط الأساسية في أسعار الأسهم. يجمع هيكل النموذج بين مشفر تلقائي متغير (VAE) لإزالة الضوضاء واستخراج الميزات مع شبكة تنافسية توليدية (GAN) لتوقع أسعار الأسهم المستقبلية. يقوم مولد الشبكة التنافسية التوليدية بتوليد بيانات مستقبلية بناءً على الاتجاهات التاريخية، بينما يميز المميز بين البيانات الحقيقية والبيانات المولدة. تم استخدام تحسين بايزي لضبط المعلمات الفائقة لتحسين أداء النموذج، بهدف تحسين دقة توقعات أسعار الأسهم. تم تقييم أداء النموذج باستخدام الجذر التربيعي لمتوسط الخطأ (RMSE) لقياس دقة التوقعات.

القيود

تنبع قيود النموذج التنبؤي الم discussed في ورقة البحث هذه أساسًا من توفر البيانات، والتحيزات المحتملة، وعدم قدرة النموذج على حساب الشذوذات السوقية. أولاً، تقتصر فعالية النموذج على الوصول المحدود إلى بيانات مالية تاريخية عالية الجودة، لا سيما للأصول الرئيسية مثل S&P 500، والنفط الخام، والذهب. تؤثر هذه الندرة ليس فقط على دقة توقعات أسعار الأسهم ولكن قد تقدم أيضًا تحيزات، خاصة إذا كانت البيانات تمثل بشكل غير متناسب ظروف السوق أو فئات الأصول، لا سيما ضمن قطاع التكنولوجيا. يجب أن تهدف الأبحاث المستقبلية إلى دمج مجموعة أوسع من مجموعات البيانات واستخدام تقنيات مثل توليد البيانات الاصطناعية لمعالجة هذه التحيزات.

بالإضافة إلى ذلك، يفتقر النموذج حاليًا إلى آليات للتعامل بشكل كافٍ مع الشذوذات السوقية غير القابلة للتنبؤ والحركات الشديدة، التي يمكن أن تشوه التوقعات بشكل كبير. تعقد الشكوك الكامنة في النمذجة التنبؤية أيضًا من توافق أسعار الأسهم المتوقعة مع الظروف السوقية الفعلية، لا سيما خلال فترات التقلب. لتعزيز القوة، يجب أن تدمج النسخ المستقبلية تقنيات الكشف المتقدمة وآليات التعديل الديناميكي. علاوة على ذلك، لا تأخذ التحليل في الاعتبار تكاليف المعاملات والسيولة، والتي تعتبر حيوية لتقييم جدوى استراتيجيات التداول بناءً على الأسعار المتوقعة. إن معالجة هذه القيود من خلال دمج منهجيات التعلم التعزيزي وضبط المعلمات الفائقة أمر ضروري لتحسين قدرات النموذج التنبؤية ودعم اتخاذ قرارات استثمارية مستنيرة.

Journal: Empirical Economics, Volume: 68, Issue: 1
DOI: https://doi.org/10.1007/s00181-024-02644-6
Publication Date: 2024-10-03
Author(s): S. Li et al.
Primary Topic: Stock Market Forecasting Methods

Overview

The section provides an overview of the significance of stock price prediction in financial decision-making, highlighting the limitations of traditional statistical methods in capturing complex market dynamics and handling large datasets. To address these challenges, the authors propose a novel approach that combines generative adversarial networks (GANs) with transformer-based attention mechanisms. This integration aims to generate synthetic stock price data while incorporating market sentiment and volatility, thereby enhancing the identification of key market indicators that influence stock prices.

The proposed model leverages attention mechanisms to focus on critical features within the data, improving the accuracy and robustness of predictions. Additionally, the authors tackle common issues associated with GANs and attention mechanisms, such as unrealistic data generation and overfitting, by implementing regularization techniques and utilizing supplementary data sources. The effectiveness of this approach will be evaluated through experiments on real-world stock market data, with the results expected to provide valuable insights for investors and financial analysts, ultimately informing better investment strategies.

Introduction

The introduction of the research paper emphasizes the critical role of stock price prediction in finance, aiding investors and analysts in making informed decisions regarding stock transactions. Traditional statistical methods, such as moving averages, regression analysis, and time series forecasting (e.g., ARIMA and exponential smoothing), have historically been employed for this purpose. However, these methods often struggle with complex, nonlinear patterns and large datasets, leading to limitations in their predictive accuracy. Recent advancements in machine learning and deep learning, particularly through the use of generative adversarial networks (GANs) and attention mechanisms, present promising alternatives that could enhance stock price forecasting.

The paper outlines the potential of GANs to generate synthetic stock market data, thereby improving prediction accuracy in data-scarce scenarios, while attention mechanisms can help identify key market indicators influencing stock prices. Previous studies have demonstrated the effectiveness of these techniques, with GANs enhancing the performance of long short-term memory (LSTM) models and attention mechanisms improving model interpretability. Nonetheless, challenges such as mode collapse in GANs and overfitting in attention models necessitate further research. The proposed study aims to integrate GANs and transformer-based attention mechanisms, alongside market sentiment and volatility data, to develop a more robust stock price prediction model. The research will evaluate the efficacy of these models against real market data and explore their implications for investors and financial analysts.

Methods

In this section, the authors detail the methodologies employed in predicting Apple’s closing stock prices using both training and testing datasets. The model demonstrated proficiency in identifying upward and downward trends in stock prices during the training phase, leading to accurate predictions for the period following mid-2022. However, it struggled to account for the significant surge in Apple’s stock price, which increased by over 42% after March 2023, attributed to the positive market sentiment surrounding advancements in artificial intelligence (AI).

To evaluate the model’s performance, the authors utilized the standard Root Mean Square Error (RMSE) as a more reliable metric than conventional RMSE, which tends to produce inflated results. The findings, presented in Table 1, indicate that the standard RMSE values were 2.414 for the training set and 3.331 for the testing set. These results underscore the model’s effective predictive capabilities in stock price forecasting, particularly in the context of the post-COVID-19 pandemic landscape.

Discussion

In this research, the authors utilized a comprehensive dataset sourced from Yahoo Finance and Seeking Alpha to predict the stock prices of Apple Inc. (AAPL) over a six-year period (2017-2023). The dataset includes historical stock prices, data from major technology companies (Amazon, Microsoft, Google), stock indices (NASDAQ, NYSE, RUSSELL2000), the VIX index for market volatility, and economic indicators such as the exchange rate between the Chinese Yuan (CNY) and the US Dollar (USD) and Treasury rates. The integration of news sentiment from Seeking Alpha, analyzed using the FinBERT model, further enriches the dataset by capturing market sentiment and its potential impact on stock prices.

Data preprocessing involved normalizing news coverage and employing a weighted average approach to account for multiple articles on the same day, enhancing the representation of market events. Moving averages and Fourier transforms were calculated to identify trends and underlying patterns in stock prices. The model architecture combines a variational autoencoder (VAE) for denoising and feature extraction with a generative adversarial network (GAN) for predicting future stock prices. The GAN’s generator synthesizes future data based on historical trends, while the discriminator distinguishes between real and generated data. Hyperparameter tuning through Bayesian optimization was employed to refine model performance, ultimately aiming to improve the accuracy of stock price predictions. The evaluation of model performance was conducted using Root Mean Square Error (RMSE) to quantify prediction accuracy.

Limitations

The limitations of the predictive model discussed in this research paper primarily stem from data availability, potential biases, and the model’s inability to account for market anomalies. Firstly, the model’s effectiveness is constrained by the limited access to high-quality historical financial data, particularly for key assets such as the S&P 500, Crude Oil, and Gold. This scarcity not only affects the accuracy of stock price forecasts but may also introduce biases, especially if the data disproportionately represents certain market conditions or asset classes, notably within the technology sector. Future research should aim to incorporate a broader range of datasets and employ techniques like synthetic data generation to address these biases.

Additionally, the model currently lacks mechanisms to adequately handle unpredictable market anomalies and extreme movements, which can significantly distort predictions. The inherent uncertainties in predictive modeling further complicate the alignment of predicted stock prices with actual market conditions, particularly during periods of volatility. To enhance robustness, future iterations should integrate advanced detection techniques and dynamic adjustment mechanisms. Furthermore, the analysis does not consider transaction costs and liquidity, which are vital for assessing the feasibility of trading strategies based on predicted prices. Addressing these limitations through the integration of reinforcement learning methodologies and hyperparameter fine-tuning is essential for improving the model’s predictive capabilities and supporting informed investment decisions.