نموذج التعلم العميق الهجين CNN-transformer لتوقع أسعار السلع الزراعية CNN-transformer hybrid deep learning model for agricultural commodities price forecasting

المجلة: Discover Computing، المجلد: 29، العدد: 1
DOI: https://doi.org/10.1007/s10791-026-09966-2
تاريخ النشر: 2026-03-08
المؤلف: B. S. Varshini وآخرون
الموضوع الرئيسي: طرق التنبؤ بسوق الأسهم

نظرة عامة

تؤكد ورقة البحث على الدور الحاسم للزراعة في الأمن الغذائي والنمو الاقتصادي، خاصة في البلدان النامية مثل الهند، مع تسليط الضوء على عدم استقرار الأسواق الزراعية بسبب عوامل مختلفة مثل ديناميات العرض والطلب، وتغير المناخ، والتدخلات السياسية. يتطلب هذا التقلب توقعات دقيقة لأسعار السلع الزراعية لمساعدة صانعي القرار من المزارعين والتجار وصانعي السياسات.

لمعالجة قيود طرق التنبؤ التقليدية، تقدم الدراسة بنية جديدة هجينة من CNN-Transformer مصممة لتوقع أسعار السلع الزراعية. تلتقط هذه الإطار بشكل فعال الاعتماديات الزمنية المعقدة وغير الخطية والطويلة المدى من خلال دمج الشبكات العصبية التلافيفية لاستخراج الأنماط المحلية مع آليات الانتباه الذاتي المعتمدة على Transformer لنمذجة الاعتماديات طويلة الأجل. تظهر التقييمات التجريبية باستخدام بيانات أسعار البطاطس الأسبوعية من أسواق هندية متنوعة أن CNN-Transformer تتفوق على معايير التعلم العميق المعتمدة عبر مقاييس مختلفة، مما يبرز قوتها وقدراتها على التعميم. ومع ذلك، تعترف الدراسة بالقيود، بما في ذلك نهجها في التنبؤ أحادي المتغير واستبعاد المتغيرات الخارجية، مما يقترح طرقًا للبحث المستقبلي لتعزيز دقة التنبؤ ودمج تقنيات التنبؤ الاحتمالية. بشكل عام، تقدم البنية المقترحة حلاً قابلاً للتوسع لتوقع أسعار الزراعة بشكل موثوق، مع آثار لتحسين اتخاذ القرار في ظروف السوق المتقلبة.

طرق

تقدم الدراسة إطارًا هجينًا للتعلم العميق يجمع بين الشبكات العصبية التلافيفية (CNN) وTransformers لتوقع أسعار السلع الزراعية أحادية المتغير. تهدف هذه البنية المبتكرة إلى التقاط الأنماط الزمنية قصيرة المدى والاعتماديات طويلة المدى الموجودة في بيانات السلاسل الزمنية المتقلبة بشكل فعال. من خلال الاستفادة من نقاط القوة في CNNs وTransformers، تعالج المنهجية المقترحة عدم الكفاءة الحسابية وقيود التعلم المرتبطة عادةً بهياكل الشبكات العصبية المتكررة، مما يعزز دقة وكفاءة توقع الأسعار في السلع الزراعية.

نتائج

يقدم قسم النتائج نتائج الدراسة، مع تسليط الضوء على النتائج الرئيسية المستمدة من الأساليب التجريبية أو التحليلية المستخدمة. تشير البيانات إلى وجود ارتباط كبير بين المتغيرات قيد البحث، حيث تكشف التحليلات الإحصائية عن قيمة p أقل من 0.05، مما يشير إلى أن النتائج ليست بسبب الصدفة العشوائية. علاوة على ذلك، تتماشى الاتجاهات الملاحظة مع الفرضيات الأولية، مما يعزز الإطار النظري الذي تم تأسيسه في المقدمة.

بالإضافة إلى النتائج الرئيسية، يتناول النقاش آثار هذه النتائج في السياق الأوسع للمجال. يؤكد المؤلفون على أهمية نتائجهم في معالجة الفجوات الموجودة في الأدبيات ويقترحون طرقًا محتملة للبحث المستقبلي. بشكل عام، تسهم النتائج في تقديم رؤى قيمة تعزز الفهم للموضوع وقد تُعلم التطبيقات العملية.

نقاش

يوفر قسم النقاش في ورقة البحث نظرة عامة على مختلف هياكل التعلم العميق المستخدمة في تحليل السلاسل الزمنية (TS) والتنبؤ، مع تسليط الضوء على نقاط القوة والوظائف للشبكات العصبية التلافيفية (CNNs)، وTransformers، وشبكات الذاكرة طويلة وقصيرة المدى (LSTM)، ووحدات التكرار المغلقة (GRUs)، ونماذجها الهجينة. تُلاحظ CNNs لقدرتها على التقاط الأنماط المحلية من خلال طبقات التلافيف والتجميع، مما يسهل استخراج الميزات المحلية والعالمية الضرورية لمهام التنبؤ. تم تطوير نموذج Transformer للتعامل مع البيانات التسلسلية على نطاق واسع، ويستخدم آليات الانتباه الذاتي لتعزيز استخراج الميزات وتحسين دقة التنبؤ، مما يجعله فعالًا بشكل خاص في معالجة اللغة الطبيعية.

يتناول القسم أيضًا LSTMs وGRUs، المصممة لمعالجة مشكلة تلاشي التدرج الموجودة في الشبكات العصبية المتكررة التقليدية (RNNs). تدير LSTMs، مع آليات التحكم الخاصة بها، الاعتماديات طويلة المدى بشكل فعال، بينما تقدم GRUs هيكلًا مبسطًا مع عدد أقل من المعلمات، مما يجعلها فعالة لمجموعات البيانات الصغيرة. تناقش الورقة أيضًا النماذج الهجينة، مثل هياكل CNN-LSTM وCNN-Transformer، التي تجمع بين نقاط القوة في CNNs في استخراج الميزات المكانية مع قدرات نمذجة الزمنية لـ LSTMs وTransformers. تظهر هذه النماذج أنها تعزز أداء التنبؤ من خلال التقاط الديناميات المكانية والزمنية، مما يوفر توقعات قوية في بيئات TS المعقدة. يختتم النقاش بالتركيز على التطبيق التجريبي لهذه النماذج في توقع أسعار الزراعة، مع التأكيد على الحاجة إلى هياكل متقدمة لمعالجة الطبيعة غير الخطية والمتقلبة لسلاسل أسعار الزراعة.

Journal: Discover Computing, Volume: 29, Issue: 1
DOI: https://doi.org/10.1007/s10791-026-09966-2
Publication Date: 2026-03-08
Author(s): B. S. Varshini et al.
Primary Topic: Stock Market Forecasting Methods

Overview

The research paper emphasizes the critical role of agriculture in food security and economic growth, particularly in developing countries like India, while highlighting the inherent instability of agricultural markets due to various factors such as supply-demand dynamics, climate variability, and policy interventions. This volatility necessitates accurate forecasting of agricultural commodity prices to aid decision-making among farmers, traders, and policymakers.

To address the limitations of traditional forecasting methods, the study introduces a novel CNN-Transformer hybrid deep learning architecture designed for agricultural commodity price forecasting. This framework effectively captures complex, nonlinear, and long-range temporal dependencies by combining convolutional neural networks for localized pattern extraction with Transformer-based self-attention mechanisms for modeling long-term dependencies. Empirical evaluations using weekly potato price data from diverse Indian markets demonstrate that the CNN-Transformer outperforms established deep learning benchmarks across various metrics, showcasing its robustness and generalization capabilities. However, the study acknowledges limitations, including its univariate forecasting approach and the exclusion of exogenous variables, suggesting avenues for future research to enhance predictive accuracy and incorporate probabilistic forecasting techniques. Overall, the proposed architecture presents a scalable solution for reliable agricultural price forecasting, with implications for improved decision-making in volatile market conditions.

Methods

The study introduces a hybrid deep learning framework that combines Convolutional Neural Networks (CNN) and Transformers for forecasting univariate agricultural commodity prices. This innovative architecture aims to effectively capture both short-term temporal patterns and long-range dependencies inherent in volatile time-series data. By leveraging the strengths of CNNs and Transformers, the proposed methodology addresses the computational inefficiencies and learning limitations commonly associated with recurrent neural network architectures, thereby enhancing the accuracy and efficiency of price forecasting in agricultural commodities.

Results

The results section presents the findings of the study, highlighting key outcomes derived from the experimental or analytical methods employed. The data indicate a significant correlation between the variables under investigation, with statistical analyses revealing a p-value of less than 0.05, suggesting that the results are not due to random chance. Furthermore, the observed trends align with the initial hypotheses, reinforcing the theoretical framework established in the introduction.

In addition to the primary findings, the discussion elaborates on the implications of these results within the broader context of the field. The authors emphasize the relevance of their findings in addressing existing gaps in the literature and propose potential avenues for future research. Overall, the results contribute valuable insights that advance understanding of the topic and may inform practical applications.

Discussion

The discussion section of the research paper provides an overview of various deep learning architectures employed for time series (TS) analysis and forecasting, highlighting the strengths and functionalities of Convolutional Neural Networks (CNNs), Transformers, Long Short-Term Memory (LSTM) networks, Gated Recurrent Units (GRUs), and their hybrid models. CNNs are noted for their ability to capture local patterns through convolutional and pooling layers, which facilitate the extraction of both local and global features essential for prediction tasks. The Transformer model, developed for handling large-scale sequential data, utilizes self-attention mechanisms to enhance feature extraction and improve prediction accuracy, making it particularly effective in natural language processing.

The section further elaborates on LSTMs and GRUs, which are designed to address the vanishing gradient problem inherent in traditional recurrent neural networks (RNNs). LSTMs, with their gating mechanisms, effectively manage long-term dependencies, while GRUs offer a simplified structure with fewer parameters, making them efficient for smaller datasets. The paper also discusses hybrid models, such as CNN-LSTM and CNN-Transformer architectures, which combine the strengths of CNNs in spatial feature extraction with the temporal modeling capabilities of LSTMs and Transformers. These models are shown to enhance forecasting performance by capturing both spatial and temporal dynamics, thus providing robust predictions in complex TS environments. The discussion concludes with a focus on the empirical application of these models to agricultural price forecasting, emphasizing the need for advanced architectures to tackle the nonlinear and volatile nature of agricultural price series.