التنبؤ بسلاسل الزمن لمؤشرات الشبكات المالية المعتمد على نماذج اللغة الكبيرة Large language model-driven time-series forecasting of financial network indicators

المجلة: Frontiers in Artificial Intelligence، المجلد: 9
DOI: https://doi.org/10.3389/frai.2026.1722121
PMID: https://pubmed.ncbi.nlm.nih.gov/41696046
تاريخ النشر: 2026-01-29
المؤلف: Mini Han Wang وآخرون
الموضوع الرئيسي: طرق التنبؤ بسوق الأسهم

نظرة عامة

تستكشف هذه الورقة البحثية دور نماذج اللغة الكبيرة (LLMs) في تعزيز توقعات السلاسل الزمنية داخل الأسواق المالية، مع التركيز بشكل خاص على ديناميكيات شبكات معلومات الأسهم التي تشكلها الحيازات المتقاطعة المؤسسية. باستخدام لوحة حيازة الصناديق والأسهم ربع السنوية من 2016 إلى 2024، يقوم المؤلفون ببناء رسوم بيانية ثنائية الأطراف للصناديق والأسهم لاشتقاق مؤشرين رئيسيين للشبكة: مركزية الدرجة ($cen_d$)، التي تشير إلى التركيز على مستوى السوق، وكثافة المتبقية ($den$)، التي تعكس الشذوذ على مستوى الشركات. تقدم الدراسة إطار عمل معزز بتوقعات LLM يدمج السلاسل الزمنية العددية والإفصاحات النصية للصناديق، محققة تحسينات كبيرة في دقة التوقعات مقارنة بالطرق التقليدية مثل ARIMA وProphet.

تكشف النتائج أن النهج القائم على LLM لا يقلل فقط من متوسط الخطأ المطلق وجذر متوسط مربع الخطأ، بل يعزز أيضًا قدرة النموذج على اكتشاف الارتفاعات الشاذة في $den$. تظهر تحليل أوزان الانتباه أن النموذج يعطي الأولوية للفترات التاريخية التي تتميز بحركة مشتركة كبيرة للصناديق أو صدمات سياسية، مما يوفر رؤى قابلة للتفسير حول سلوك السوق. تمتد تداعيات هذا البحث إلى الممارسات المالية والتنظيم، مقدمة تحذيرات مبكرة من المخاطر النظامية وتساعد في تحديد الأسهم الحرجة للتدخلات المستهدفة. على الرغم من نقاط قوته، تعترف الدراسة بالقيود، بما في ذلك الاعتماد على البيانات ربع السنوية وكثافة الحوسبة لنماذج LLM، بينما تقترح اتجاهات بحثية مستقبلية لدمج بيانات ذات تردد أعلى وسياقات سوقية أوسع. بشكل عام، تؤسس هذه العمل منهجية جديدة تدمج مقاييس نظرية الرسوم البيانية مع قدرات LLM، مما يساهم في تقديم رؤى قيمة للمستثمرين والمنظمين في الحفاظ على استقرار السوق.

مقدمة

تناقش مقدمة هذه الورقة البحثية الديناميات المعقدة للأسواق المالية كنظم مترابطة حيث تتدفق المعلومات ورأس المال والمخاطر من خلال شبكات الاستثمارات المؤسسية. تسلط الضوء على أهمية الحيازات المتداخلة بين الصناديق، التي تخلق شبكات معلومات الأسهم التي يمكن أن تضخم صدمات السوق وتؤدي إلى سلوك القطيع، مما يزيد من المخاطر النظامية. تؤكد الدراسة على الحاجة إلى تحسين طرق التوقع لفهم وتوقع تطور هذه الشبكات، خاصة في ضوء التقدمات الأخيرة في نماذج اللغة الكبيرة (LLMs) التي يمكن أن تدمج بفعالية أنواع بيانات متنوعة وتلتقط الاعتماديات طويلة المدى.

يحدد المؤلفون عدة تحديات في توقع شبكات معلومات الأسهم، بما في ذلك تعقيد البيانات عالية الأبعاد التي تجمع بين المقاييس الكمية المنظمة والإشارات النوعية غير المنظمة، وعدم استقرار الشبكات السوقية بسبب التغيرات المفاجئة، وضرورة القابلية للتفسير في مخرجات النموذج. لمعالجة هذه القضايا، تقترح الدراسة إطار عمل جديد للتوقع يستفيد من LLMs لتحليل كل من السلاسل الزمنية العددية ومعلومات الصناديق النصية، مما يعزز الدقة التنبؤية ويقدم رؤى حول الآليات التي تحرك ديناميات السوق. تظهر النتائج التجريبية أن هذا النهج القائم على LLM يتفوق على طرق التوقع التقليدية، مقدماً نظام تحذير مبكر موثوق لعدم استقرار السوق والمخاطر النظامية، بينما يعزز أيضًا الشفافية وقوة التفسير للتوقعات المالية.

طرق

في هذا القسم، يقدم المؤلفون خط أنابيب جديد للتوقعات الزمنية المعززة بـ LLM يهدف إلى توقع تطور شبكات معلومات الأسهم والمؤشرات الهيكلية الرئيسية. يدمج الإطار تحليل الرسوم البيانية، والنمذجة الزمنية، وقدرات التفكير باللغة الطبيعية لنماذج اللغة الكبيرة (LLMs). يتكون من أربعة مكونات مترابطة: تمثيل البيانات وبناء المطالبات، التوقع القائم على LLM، المقارنة مع النماذج التقليدية ونماذج التعلم العميق، والتقييم الشامل. تؤكد البنية، الموضحة في مخطط تدفق طبقي، على توحيد مصادر البيانات المتنوعة، بما في ذلك المتغيرات الزمنية المالية، ومستندات إفصاح الصناديق، والتقارير الاقتصادية الكلية، ولقطات طوبولوجيا الشبكة، لالتقاط الديناميات الهيكلية والمعلوماتية للسوق.

تتميز جوهر خط الأنابيب بمشفر قائم على LLM يستخدم بنية المحول مع آليات الانتباه الذاتي متعددة الرؤوس. يقوم هذا المكون بتوليف المعلومات الزمنية والنصية والطوبولوجية لتحديد الاعتماديات عبر الأنماط بين الكيانات المالية. تنتج رأس التوقعات مخرجات تنبؤية، بما في ذلك توقعات لمركزية المستقبل ($cen_d$) ومؤشرات كثافة المتبقية ($den$)، بالإضافة إلى عناصر قابلة للتفسير مثل تصورات أوزان الانتباه ودرجات أهمية الميزات المعتمدة على SHAP. تُترجم هذه المخرجات إلى تطبيقات عملية ضمن طبقة دعم القرار، مما يسهل أنظمة تنبيه المخاطر واستراتيجيات تخصيص المحافظ. قام المؤلفون بإجراء مهمتين للتوقع: توقع مركزية السوق على مستوى الدرجة ربع سنة واحدة مقدماً واكتشاف الارتفاعات الشاذة في الكثافة المتبقية على مستوى الأسهم الفردية، باستخدام طريقة تقييم ذات أصل متداول لضمان اختبار صارم خارج العينة.

نتائج

تشير نتائج الدراسة إلى أن المتنبئ القائم على LLM تفوق بشكل كبير على المعايير التقليدية في توقع مركزية الربع التالي (cen_d) من 2016 إلى 2024، محققاً متوسط خطأ مطلق (MAE) قدره 0.012 ومتوسط جذر مربع الخطأ (RMSE) قدره 0.019. بالمقارنة، سجل أفضل منافس تقليدي، TFT، متوسط خطأ مطلق قدره 0.021 ومتوسط جذر مربع الخطأ قدره 0.033. بلغ متوسط نسبة الخطأ المطلق (MAPE) للـ LLM 4.8%، مما يقلل تقريبًا من خطأ أقرب منافس له، بينما وصلت دقته الاتجاهية إلى 87%. تمكن النموذج من تحديد نقاط الانعطاف الحرجة خلال فترات تقلب السوق، تحديدًا حول الربع الرابع من 2018 والربع الأول من 2022، حيث تأخرت النماذج التقليدية.

فيما يتعلق باكتشاف ارتفاعات الكثافة الشاذة على مستوى الأسهم الفردية، أظهر الـ LLM قدرات تحذير مبكر متفوقة، محققًا دقة قدرها 82%، واسترجاع قدره 79%، ودرجة F1 قدرها 0.80، متفوقًا على TFT وProphet وARIMA. سلطت دراسات الحالة الضوء على قدرة النموذج على توقع ارتفاعات الكثافة المتبقية في الأسهم التكنولوجية والمالية ذات القيمة السوقية الكبيرة ربع سنة واحدة مقدماً، مما يمكّن استراتيجيات محتملة للتخفيف من المخاطر مثل إعادة توازن المحافظ. تؤكد النتائج فعالية التوقع القائم على LLM في تعزيز كل من التوقعات النقطية واكتشاف الأحداث ضمن الشبكات المالية المعقدة، مما يشير إلى أن دمج النموذج لمؤشرات مشتقة من الرسوم البيانية والسرد النصي أمر حاسم لالتقاط الانقطاعات الهيكلية وأنماط العدوى غير الخطية. بشكل عام، لا يتجاوز الإطار المعزز بـ LLM المعايير الإحصائية الحالية فحسب، بل يوفر أيضًا رؤى قابلة للتنفيذ للمشاركين في السوق والمنظمين في إدارة المخاطر النظامية.

مناقشة

تسلط قسم المناقشة في الورقة البحثية الضوء على فعالية خط أنابيب التوقع المعزز بنموذج اللغة الكبيرة (LLM) في توقع التغيرات الهيكلية والمخاطر النظامية داخل الشبكات المالية. باستخدام لوحة حيازة الصناديق والأسهم ربع السنوية من 2016 إلى 2024، تبني الدراسة شبكة ثنائية الأطراف لتحليل علاقات الأسهم والصناديق، ثم تقوم بإسقاط ذلك على طبقة الأسهم لاشتقاق مقاييس رئيسية مثل مركزية الدرجة ($cen_d$) وكثافة المتبقية ($den$). يدمج هيكل LLM كل من البيانات العددية والنصية، مما يسمح بفهم دقيق لديناميات السوق. يتفوق النموذج على طرق التوقع التقليدية، محققًا متوسط خطأ مطلق (MAE) قدره 0.012 ودقة اتجاهية قدرها 87%، مما يدل على قوته في التقاط كل من الاتجاهات التدريجية والشذوذ المفاجئ.

تُبرز قابلية تفسير نموذج LLM من خلال تصورات الانتباه، التي تكشف عن تركيزه على الفترات الحرجة من التحولات السوقية والأسهم الرئيسية التي تعمل كقنوات لتدفق المعلومات. لا تعزز هذه القدرة الدقة التنبؤية فحسب، بل توفر أيضًا رؤى قابلة للتنفيذ لإدارة المخاطر. تشير النتائج إلى أن إطار عمل LLM يمكن أن يكون أداة قيمة للإشراف المالي، مما يمكّن من الكشف المبكر عن المخاطر النظامية وتحسين استراتيجيات المحافظ. بشكل عام، تُظهر الدراسة أن نماذج LLM يمكن أن تسد الفجوة بين التوقع الكمي والسرد النوعي للسوق، مقدمة نهجًا قابلاً للتوسع وقابلًا للتفسير لتحليل السلاسل الزمنية المالية.

قيود

تسلط قسم القيود الضوء على عدة مجالات حيوية لتحسين نتائج البحث. على الرغم من أن النموذج يظهر تعميمًا قويًا عبر آفاق زمنية متنوعة، إلا أن فعاليته قد تتناقص في مواجهة التحولات النظامية غير الممثلة في بيانات التدريب، مثل التغيرات السياسية غير المتوقعة أو الأحداث الجيوسياسية. علاوة على ذلك، بينما توفر تحليل القابلية للتفسير رؤى قيمة، إلا أنها تظل نوعية؛ يمكن أن تعزز الأبحاث المستقبلية هذا الجانب من خلال دمج مقاييس التفسير الكمية أو أطر الاستدلال السببي لتعزيز مصداقية التفسيرات المعتمدة على الانتباه.

بالإضافة إلى ذلك، يقترح المؤلفون أن توسيع الإطار ليشمل بيانات متعددة الأسواق أو عبر الحدود سيمكن من تقييم قابليته للتوسع وملاءمته في سيناريوهات المخاطر النظامية العالمية. بشكل عام، تشير النتائج إلى أن نماذج اللغة الكبيرة (LLMs) تمثل تقدمًا كبيرًا في توقع السلاسل الزمنية المالية، مقدمة مزيجًا من الدقة، والقابلية للتفسير، والرؤى القابلة للتنفيذ. من خلال دمج النمذجة التنبؤية مع آليات قابلة للتفسير وإشارات اقتصادية ذات صلة، يمهد الإطار المقترح الطريق لنموذج جديد في التحليلات المالية الذي ي prioritizes الأداء والفهم ضمن بيئات السوق المعقدة.

Journal: Frontiers in Artificial Intelligence, Volume: 9
DOI: https://doi.org/10.3389/frai.2026.1722121
PMID: https://pubmed.ncbi.nlm.nih.gov/41696046
Publication Date: 2026-01-29
Author(s): Mini Han Wang et al.
Primary Topic: Stock Market Forecasting Methods

Overview

This research paper investigates the role of large language models (LLMs) in enhancing time-series forecasting within financial markets, specifically focusing on the dynamics of stock information networks shaped by institutional cross-holdings. Utilizing a quarterly fund-stock holding panel from 2016 to 2024, the authors construct bipartite fund-stock graphs to derive two critical network indicators: degree centralization ($cen_d$), which indicates market-wide concentration, and residual density ($den$), reflecting firm-level anomalies. The study introduces an LLM-enhanced forecasting framework that integrates numerical time series and textual fund disclosures, achieving significant improvements in forecasting accuracy over traditional methods such as ARIMA and Prophet.

The findings reveal that the LLM-based approach not only reduces mean absolute error and root mean square error but also enhances the model’s ability to detect abnormal spikes in $den$. Attention-weight analysis shows that the model prioritizes historical periods marked by significant fund co-movement or policy shocks, thus providing interpretable insights into market behavior. The implications of this research extend to financial practice and regulation, offering early warnings of systemic risks and aiding in the identification of critical stocks for targeted interventions. Despite its strengths, the study acknowledges limitations, including reliance on quarterly data and the computational intensity of LLMs, while suggesting future research directions to incorporate higher-frequency data and broader market contexts. Overall, this work establishes a novel methodology that merges graph-theoretic metrics with LLM capabilities, contributing valuable insights for investors and regulators in maintaining market stability.

Introduction

The introduction of this research paper discusses the intricate dynamics of financial markets as interconnected systems where information, capital, and risk flow through networks of institutional investments. It highlights the significance of overlapping portfolios among funds, which create stock information networks that can amplify market shocks and lead to herding behavior, thereby increasing systemic risk. The study emphasizes the need for improved forecasting methods to understand and predict the evolution of these networks, particularly in light of recent advancements in large language models (LLMs) that can effectively integrate diverse data types and capture long-range dependencies.

The authors identify several challenges in forecasting stock information networks, including the complexity of high-dimensional data that combines structured quantitative metrics with unstructured qualitative signals, the non-stationarity of market networks due to abrupt changes, and the necessity for interpretability in model outputs. To address these issues, the study proposes a novel forecasting framework that leverages LLMs to analyze both numerical time series and textual fund information, enhancing predictive accuracy and providing insights into the mechanisms driving market dynamics. Empirical results demonstrate that this LLM-based approach outperforms traditional forecasting methods, offering a reliable early-warning system for market instability and systemic risk, while also enhancing the transparency and explanatory power of financial predictions.

Methods

In this section, the authors present a novel LLM-enhanced time-series forecasting pipeline aimed at predicting the evolution of stock information networks and key structural indicators. The framework integrates graph analytics, temporal modeling, and the natural language reasoning capabilities of large language models (LLMs). It consists of four interconnected components: data representation and prompt construction, LLM-based forecasting, benchmarking against classical and deep learning models, and end-to-end evaluation. The architecture, depicted in a layered flowchart, emphasizes the unification of heterogeneous data sources, including financial time-series variables, fund disclosure documents, macroeconomic reports, and network topology snapshots, to capture the market’s structural and informational dynamics.

The core of the pipeline features an LLM-based encoder utilizing a transformer architecture with multi-head self-attention mechanisms. This component synthesizes temporal, textual, and topological information to identify cross-modal dependencies among financial entities. The forecasting head produces predictive outputs, including forecasts of future centralization ($cen_d$) and residual density ($den$) indices, as well as interpretability artifacts such as attention-weight visualizations and SHAP-based feature importance scores. These outputs are translated into practical applications within a decision-support layer, facilitating risk-alert systems and portfolio allocation strategies. The authors conducted two forecasting tasks: predicting market-level degree centralization one quarter ahead and detecting abnormal residual-density spikes at the individual-stock level, employing a rolling-origin evaluation method to ensure rigorous out-of-sample testing.

Results

The results of the study indicate that the LLM forecaster significantly outperformed traditional benchmarks in next-quarter centralization (cen_d) forecasting from 2016 to 2024, achieving a Mean Absolute Error (MAE) of 0.012 and a Root Mean Square Error (RMSE) of 0.019. In comparison, the best classical competitor, TFT, recorded an MAE of 0.021 and an RMSE of 0.033. The LLM’s Mean Absolute Percentage Error (MAPE) averaged 4.8%, nearly halving the error of its closest rival, while its directional accuracy reached 87%. The model effectively identified critical inflection points during periods of market volatility, specifically around Q4 2018 and Q1 2022, where traditional models lagged.

In terms of abnormal density spike detection at the individual-stock level, the LLM demonstrated superior early-warning capabilities, achieving a precision of 82%, recall of 79%, and an F1-score of 0.80, outperforming TFT, Prophet, and ARIMA. Case studies highlighted the model’s ability to predict residual-density surges in large-cap technology and financial stocks one quarter in advance, enabling potential risk-mitigation strategies such as portfolio rebalancing. The findings underscore the effectiveness of LLM-based forecasting in enhancing both point predictions and event detection within complex financial networks, suggesting that the model’s integration of graph-derived indicators and textual narratives is crucial for capturing structural breaks and nonlinear contagion patterns. Overall, the LLM-enhanced framework not only surpasses existing statistical benchmarks but also provides actionable insights for market participants and regulators in managing systemic risk.

Discussion

The discussion section of the research paper highlights the effectiveness of a large language model (LLM)-enhanced forecasting pipeline in predicting structural changes and systemic risks within financial networks. Utilizing a quarterly fund-stock holding panel from 2016 to 2024, the study constructs a bipartite network to analyze stock and fund relationships, subsequently projecting this onto a stock layer to derive key metrics such as degree centralization ($cen_d$) and residual density ($den$). The LLM architecture integrates both numerical and textual data, allowing for a nuanced understanding of market dynamics. The model outperforms traditional forecasting methods, achieving a mean absolute error (MAE) of 0.012 and a directional accuracy of 87%, indicating its robustness in capturing both gradual trends and abrupt anomalies.

The interpretability of the LLM model is underscored through attention visualizations, which reveal its focus on critical periods of market shifts and key stocks that act as conduits for information diffusion. This capability not only enhances predictive accuracy but also provides actionable insights for risk management. The findings suggest that the LLM framework can serve as a valuable tool for financial supervision, enabling early detection of systemic risks and improving portfolio strategies. Overall, the research demonstrates that LLMs can effectively bridge the gap between quantitative forecasting and qualitative market narratives, offering a scalable and interpretable approach to financial time-series analysis.

Limitations

The section on limitations highlights several critical areas for improvement in the research findings. Although the model exhibits robust generalization across various temporal horizons, its efficacy may diminish in the face of regime shifts not represented in the training data, such as unexpected policy changes or geopolitical events. Furthermore, while the interpretability analysis provides valuable insights, it remains qualitative; future research could enhance this aspect by integrating quantitative explainability metrics or causal inference frameworks to bolster the credibility of attention-based interpretations.

Additionally, the authors suggest that extending the framework to encompass multi-market or cross-border datasets would further evaluate its scalability and relevance in global systemic-risk scenarios. Overall, the findings indicate that large language models (LLMs) signify a significant advancement in financial time-series forecasting, offering a combination of accuracy, interpretability, and actionable insights. By merging predictive modeling with explainable mechanisms and economically relevant signals, the proposed framework paves the way for a new paradigm in financial analytics that prioritizes both performance and comprehension within complex market environments.