أتمتة استراتيجية البحث باستخدام LLM في الاستثمار الكمي Automate Strategy Finding with LLM in Quant Investment

المجلة: Findings of the Association for Computational Linguistics: EMNLP 2025
DOI: https://doi.org/10.18653/v1/2025.findings-emnlp.1005
تاريخ النشر: 2025-01-01
المؤلف: Zhizhuo Kou وآخرون
الموضوع الرئيسي: طرق التنبؤ بسوق الأسهم

نظرة عامة

تقدم هذه البحث إطار عمل مبتكر من ثلاث مراحل يدمج نماذج اللغة الكبيرة (LLMs) ضمن نظام متعدد الوكلاء مدرك للمخاطر لأتمتة تطوير الاستراتيجيات في المالية الكمية. يعالج الإطار قيود نماذج التعلم العميق التقليدية من خلال استخدام نماذج LLMs المصممة بعناية لتوليد مرشحات ألفا القابلة للتنفيذ من بيانات مالية متنوعة. يتضمن عملية تقييم متعددة الوسائط تعتمد على الوكلاء لتصفية هذه العوامل بناءً على ظروف السوق وجودة التنبؤ مع ضمان توازن الفئات. بالإضافة إلى ذلك، يتم استخدام تحسين الوزن الديناميكي للتكيف مع بيئات السوق المتغيرة. تشير النتائج التجريبية إلى أن هذا النهج يحقق عائدًا تراكميًا قدره 53.17% على مؤشر SSE50 من يناير 2023 إلى يناير 2024، متفوقًا بشكل كبير على المعايير المعمول بها من حيث الأداء المعدل حسب المخاطر وحماية الجانب السلبي.

في الختام، يجمع الإطار المقترح بفعالية بين LLMs وهياكل الوكلاء المتعددة للتخفيف من عدم الاستقرار الذي يوجد عادة في استراتيجيات الاستثمار الكمية التقليدية. من خلال توليد عوامل ألفا متنوعة وبناء وكلاء تداول مدربين على المخاطر، يظهر النظام أداءً متفوقًا عبر كل من الأسواق الصينية والأمريكية. تؤكد النتائج أن هذا الإطار لا يتجاوز فقط طرق توليد ألفا المتطورة، بل يوفر أيضًا هيكلًا قابلًا للتوسع والتكيف لاستخراج الإشارات المالية، مما يعمل بشكل مستقل عبر أنظمة السوق المختلفة.

مقدمة

تناقش مقدمة هذه الورقة البحثية التأثير التحويلي لنماذج اللغة الكبيرة (LLMs) وأنظمة الوكلاء المتعددة على المالية الكمية. تبرز كيف تقدمت LLMs من مجرد دعم أدوات التحليل إلى أن تصبح مشاركين أساسيين في اتخاذ القرارات المالية، كما يتضح من قدرات BloombergGPT في تحليل مشاعر السوق وتوليد إجراءات التداول بناءً على المعلومات السياقية من الأخبار وتقارير الأرباح. في الوقت نفسه، تعزز أنظمة الوكلاء المتعددة تحسين المحفظة من خلال التفاعلات اللامركزية، مما يمكّن الوكلاء من تحقيق عوائد متنوعة جيدًا وأداء محسّن معدل حسب المخاطر، مما يعالج الديناميات السوقية المعقدة التي لا تستطيع نماذج الوكلاء الفردية التعامل معها.

تحدد الورقة ثلاث تحديات حاسمة في تعدين ألفا – صلابة الطرق التقليدية، قضايا تنوع البيانات والتكامل، والتكيف مع تقلبات السوق. لمعالجة هذه التحديات، يقترح المؤلفون إطار عمل مبتكر مدفوع بـ LLM يتكون من ثلاثة مكونات: تعدين ألفا المرن، الذي يصنف عوامل ألفا من الأدبيات المالية؛ تقييم السوق متعدد الوكلاء متعدد الوسائط، الذي يجري اختبارات خلفية عبر ظروف سوق متنوعة؛ وتحسين الاستراتيجية الديناميكي، الذي يعدل أوزان عوامل ألفا بناءً على ظروف السوق الحالية. يهدف هذا الإطار بين التخصصات إلى تعزيز تحديد وتحسين عوامل ألفا عبر فئات الأصول المختلفة، مما يمثل تقدمًا كبيرًا في تطوير استراتيجيات استثمار قوية وقابلة للتكيف. كما يؤكد المؤلفون على أهمية جعل شفرة المصدر الخاصة بهم متاحة للجمهور لدعم الأبحاث المستقبلية وإمكانية التكرار.

طرق

تستخدم منهجية بناء المحفظة في هذه الدراسة إطار عمل لإعادة البناء اليومية يعتمد على استراتيجية اختيار top-k/drop-n. تتضمن كل جلسة تداول تصنيف الأوراق المالية حسب قيم ألفا الخاصة بها، والتي تعمل كمؤشرات كمية للعوائد الزائدة المتوقعة. يتم اختيار أفضل $k$ أسهم للإدراج في المحفظة، مستهدفة تلك التي تحمل أقوى الإشارات لاستغلال عدم كفاءة السوق على المدى القصير. يتم تطبيق نظام توزيع متساوي للأوزان لتوزيع رأس المال بشكل موحد بين الأوراق المالية المختارة، مما يعزز التنويع ويعكس الفكرة القائلة بأن إشارات ألفا تشير بشكل أساسي إلى الاتجاهات الاتجاهية بدلاً من التنبؤات الدقيقة للعائد.

لتحسين تكاليف المعاملات وكفاءة التشغيل، تفرض المنهجية حدًا على دوران المحفظة، مما يسمح بتعديل أقصى قدر من $n$ أوراق مالية لكل يوم تداول. يهدف هذا القيد إلى الحفاظ على التوافق مع إشارات ألفا الحالية مع تقليل تكاليف الاحتكاك في التداول. في الإعداد التجريبي المفصل في القسم 4.4، يتم تعيين المعلمات إلى $k = 13$ و $n = 5$، بناءً على اختبارات خلفية شاملة. يحقق هذا التكوين توازنًا بين التنويع وتخفيف الإشارات، مع الحفاظ على معدل دوران يومي يبلغ حوالي 38%، والذي يعتبر فعالًا من حيث نمذجة تكاليف المعاملات والتوازن بين استخدام الإشارات وتكاليف التنفيذ.

مناقشة

ت outlines قسم المناقشة في الورقة البحثية إطار عمل شامل لتطوير استراتيجيات عوامل ألفا في المالية الكمية، مع معالجة ثلاثة تحديات رئيسية: توضيح عوامل ألفا، توليد ألفا ديناميكية، وتحسين استراتيجيات ألفا. يدمج الإطار نماذج اللغة الكبيرة (LLMs) وأنظمة الوكلاء المتعددة لتعزيز طرق التداول الكمية التقليدية. يتم تمثيل عوامل ألفا رياضيًا كـ $\alpha (t)_{ij}$ للسهم $i$ في الفئة $j$ في الوقت $t$، مع تقييم قوتها التنبؤية باستخدام معامل المعلومات (IC)، الذي يقيس العلاقة بين ألفا المتوقعة والعوائد المستقبلية الفعلية.

توضح الورقة تمثيل عوامل ألفا من خلال مشغلات العرض المتقاطع والسلاسل الزمنية، مما يسمح بالدمج المرن للإشارات المالية. يتم اقتراح نهج اختيار ديناميكي لتحديد ألفا ذات الصلة بناءً على تقييمات الثقة والمخاطر، مما يضمن التكيف مع ظروف السوق المتغيرة. يتم بناء استراتيجية الاستثمار النهائية كتركيب وزني من ألفا المختارة، مع تعديل الأوزان وفقًا للحالة الحالية للسوق. يدعم هيكل الإطار التحسين المستمر والتطبيق عبر أسواق منظمة متنوعة، مما يظهر قوته وفعاليته في توليد استراتيجيات تداول متفوقة مقارنة بالطرق التقليدية. تشير النتائج التجريبية إلى أن الإطار المدفوع بـ LLM يتفوق باستمرار على المعايير، محققًا عوائد كبيرة والحفاظ على الاستقرار عبر بيئات السوق المختلفة.

القيود

تتعدد قيود الإطار المقترح. أولاً، تعتمد فعاليته بشكل كبير على جودة الوثائق المدخلة، والتي قد تعزز عن غير قصد التحيزات الموجودة (Deb et al., 2017; Ashok et al., 2018). ثانيًا، قد تفتقر عوامل ألفا التي تولدها نماذج اللغة الكبيرة (LLMs) أحيانًا إلى الحدس المالي الذي يمتلكه المحللون البشر، مما يؤدي إلى عوامل قد تكون نظرية صحيحة، لكنها قد لا تكون عملية (Tuarob et al., 2017). بالإضافة إلى ذلك، تفترض طريقة تقييم الوكلاء المتعددة وجود علاقات تاريخية مستقرة بين ظروف السوق وأداء ألفا، وهو افتراض قد يتعثر خلال فترات تغير أنظمة السوق.

علاوة على ذلك، كانت عملية التحقق من صحة الإطار تركز بشكل أساسي على أسواق الأسهم، مما يشير إلى الحاجة إلى مزيد من الدراسات التجريبية لتقييم قابليته للتطبيق عبر فئات الأصول المختلفة. يجب أن تهدف الأبحاث المستقبلية إلى التخفيف من هذه القيود من خلال التحقيق في هياكل مزيج الخبراء (MoE) لتعزيز كفاءة التعلم (Masoudnia and Ebrahimpour, 2014)، وتطوير هياكل وكلاء قابلة للتكيف، واستخدام تقنيات التعلم الانتقالي (Wang et al., 2023)، والنظر في الآثار التنظيمية (Han and Xi, 2020)، وتحسين الكفاءة الحسابية في التنفيذ.

Journal: Findings of the Association for Computational Linguistics: EMNLP 2025
DOI: https://doi.org/10.18653/v1/2025.findings-emnlp.1005
Publication Date: 2025-01-01
Author(s): Zhizhuo Kou et al.
Primary Topic: Stock Market Forecasting Methods

Overview

This research presents a novel three-stage framework that integrates Large Language Models (LLMs) within a risk-aware multi-agent system to automate strategy development in quantitative finance. The framework addresses the limitations of traditional deep learning models by utilizing prompt-engineered LLMs to generate executable alpha factor candidates from diverse financial data. It incorporates a multimodal agent-based evaluation process to filter these factors based on market conditions and predictive quality while ensuring category balance. Additionally, dynamic weight optimization is employed to adapt to changing market environments. Experimental results indicate that this approach achieves a cumulative return of 53.17% on the SSE50 index from January 2023 to January 2024, significantly outperforming established benchmarks in terms of risk-adjusted performance and downside protection.

In conclusion, the proposed framework effectively combines LLMs and multi-agent architectures to mitigate the instability commonly found in traditional quantitative investment strategies. By generating diversified alpha factors and constructing risk-calibrated trading agents, the system demonstrates superior performance across both Chinese and US markets. The findings confirm that this framework not only surpasses state-of-the-art alpha generation methods but also provides a scalable and adaptive architecture for financial signal extraction, functioning autonomously across various market regimes.

Introduction

The introduction of this research paper discusses the transformative impact of large language models (LLMs) and multi-agent systems on quantitative finance. It highlights how LLMs have progressed from merely supporting analytical tools to becoming integral participants in financial decision-making, as exemplified by BloombergGPT’s capabilities in parsing market sentiment and generating trading actions based on contextual information from news and earnings reports. Concurrently, multi-agent systems enhance portfolio optimization through decentralized interactions, enabling agents to achieve well-diversified returns and improved risk-adjusted performance, addressing complex market dynamics that single-agent models cannot.

The paper identifies three critical challenges in alpha mining—rigidity of traditional methods, data diversity and integration issues, and adaptation to market variability. To address these challenges, the authors propose a novel LLM-driven framework comprising three components: Flexible Alpha Mining, which categorizes alpha factors from financial literature; Multi-agent Multimodal Market Evaluation, which conducts backtesting across diverse market conditions; and Dynamic Strategy Optimization, which adjusts alpha factor weights based on current market conditions. This interdisciplinary framework aims to enhance the identification and optimization of alpha factors across various asset classes, representing a significant advancement in developing robust, adaptive investment strategies. The authors also emphasize the importance of making their source code publicly available to support future research and reproducibility.

Methods

The methodology for portfolio construction in this study employs a daily reconstruction framework based on a top-k/drop-n selection strategy. Each trading session involves ranking securities by their alpha values, which serve as quantitative indicators of expected excess returns. The top $k$ stocks are selected for inclusion in the portfolio, targeting those with the strongest signals to exploit short-term market inefficiencies. An equal-weighting scheme is applied to distribute capital uniformly among the selected securities, enhancing diversification and reflecting the notion that alpha signals primarily indicate directional trends rather than precise return predictions.

To optimize transaction costs and operational efficiency, the methodology imposes a limit on portfolio turnover, allowing a maximum of $n$ securities to be adjusted per trading day. This constraint aims to maintain alignment with current alpha signals while minimizing trading friction costs. In the experimental setup detailed in Section 4.4, the parameters are set to $k = 13$ and $n = 5$, based on extensive backtesting. This configuration achieves a balance between diversification and signal dilution, while maintaining a daily turnover rate of approximately 38%, which is deemed efficient in terms of transaction cost modeling and the trade-off between signal utilization and implementation costs.

Discussion

The discussion section of the research paper outlines a comprehensive framework for developing alpha factor strategies in quantitative finance, addressing three primary challenges: formalizing alpha factors, generating dynamic seed alphas, and optimizing alpha strategies. The framework integrates large language models (LLMs) and multi-agent systems to enhance traditional quantitative trading methods. Alpha factors are mathematically represented as $\alpha (t)_{ij}$ for stock $i$ in category $j$ at time $t$, with their predictive power assessed using the Information Coefficient (IC), which measures the correlation between predicted alphas and actual future returns.

The paper details the representation of alpha factors through cross-section and time-series operators, allowing for the flexible combination of financial signals. A dynamic selection approach is proposed to identify relevant alphas based on confidence and risk evaluations, ensuring adaptability to changing market conditions. The final investment strategy is constructed as a weighted combination of selected alphas, with weights adjusted according to current market status. The framework’s architecture supports continuous refinement and application across various structured markets, demonstrating its robustness and effectiveness in generating superior trading strategies compared to traditional methods. Experimental results indicate that the LLM-driven framework consistently outperforms benchmarks, achieving significant returns and maintaining stability across different market environments.

Limitations

The limitations of the proposed framework are multifaceted. Firstly, its effectiveness is heavily reliant on the quality of input documents, which may inadvertently reinforce existing biases (Deb et al., 2017; Ashok et al., 2018). Secondly, the alpha factors generated by large language models (LLMs) sometimes lack the financial intuition that human analysts possess, leading to factors that, while theoretically valid, may not be practical (Tuarob et al., 2017). Additionally, the multi-agent evaluation approach assumes stable historical relationships between market conditions and alpha performance, an assumption that could falter during periods of market regime shifts.

Moreover, the validation of the framework has predominantly focused on equity markets, indicating a need for further empirical studies to assess its applicability across different asset classes. Future research should aim to mitigate these limitations by investigating Mixture of Experts (MoE) architectures to enhance learning efficiency (Masoudnia and Ebrahimpour, 2014), developing adaptive agent architectures, employing transfer learning techniques (Wang et al., 2023), considering regulatory implications (Han and Xi, 2020), and optimizing computational efficiency in implementations.