نماذج التعلم العميق الهجينة لاكتشاف الأخبار المزيفة: دراسة حالة على اللغتين العربية والإنجليزية Hybrid deep learning models for fake news detection: case study on Arabic and English languages

المجلة: Frontiers in Big Data، المجلد: 8
DOI: https://doi.org/10.3389/fdata.2025.1683786
PMID: https://pubmed.ncbi.nlm.nih.gov/41568095
تاريخ النشر: 2026-01-06
المؤلف: Baqer M. Merzah وآخرون
الموضوع الرئيسي: المعلومات المضللة وتأثيراتها

نظرة عامة

تتناول ورقة البحث القضية الملحة لاكتشاف الأخبار المزيفة (FND) باللغتين العربية والإنجليزية، مع تسليط الضوء على التحديات التي تطرحها الانتشار السريع للمعلومات المضللة في البيئات عبر الإنترنت. يقترح المؤلفون نموذج تعلم عميق جديد يجمع بين شبكة عصبية تلافيفية متعددة القنوات (CNN) مع شبكتين ثنائيتين من الذاكرة طويلة وقصيرة المدى (BiLSTM). تلتقط هذه البنية بفعالية كل من الميزات النصية الدلالية والمحلية من خلال نموذج FastText المدرب مسبقًا، تليها طبقة تجميع عالمية لتعزيز استخراج الميزات. تم تقييم النموذج بدقة على ثلاثة مجموعات بيانات مرجعية—AFND و ANS للعربية، و WELFake للإنجليزية—محققًا دقة مثيرة للإعجاب تبلغ 94.43% ± 0.19%، 71.63% ± 1.45%، و 98.85% ± 0.03%، على التوالي، متجاوزًا الأساليب المتطورة.

على الرغم من أدائه القوي، تعترف الدراسة بالقيود، خاصة فيما يتعلق بتعقيدات اللغة العربية، والتي تشمل الثراء الصرفي وتنوع اللهجات. كما أن تركيز النموذج على البيانات النصية يقيد قابليته للتطبيق، حيث أن المعلومات المضللة في العالم الحقيقي غالبًا ما تشمل محتوى متعدد الوسائط. من المخطط أن يتم العمل في المستقبل لتعزيز قدرات النموذج من خلال دمج لغات إضافية وتوسيع إطاره ليشمل الصور ومقاطع الفيديو، مما يعالج قيود الكشف القائم على النص فقط. تؤكد النتائج على إمكانيات النموذج للتطبيقات العملية في تحسين موثوقية المعلومات على الشبكات الاجتماعية عبر الإنترنت.

مقدمة

تناقش مقدمة الورقة ارتفاع الشبكات الاجتماعية عبر الإنترنت (OSNs) مثل فيسبوك، تويتر، وإنستغرام كمصادر رئيسية للمعلومات، مما أدى إلى تحديات كبيرة تتعلق بمصداقية المحتوى المشترك، وخاصة انتشار الأخبار المزيفة. الأخبار المزيفة، التي تشمل المعلومات المضللة والمعلومات الخاطئة، كانت بارزة بشكل ملحوظ خلال أحداث مثل انتخابات الرئاسة الأمريكية 2016، حيث تم تداول معلومات مضللة عن المرشحين على نطاق واسع. يبرز المؤلفون الحاجة الملحة إلى أنظمة فعالة لاكتشاف الأخبار المزيفة (FND)، وخاصة باللغة العربية، التي تقدم تحديات فريدة بسبب تركيبها المعقد، وقواعدها، ولهجاتها، بالإضافة إلى نقص الموارد مقارنة باللغة الإنجليزية.

تحدد الورقة فجوة في الأدبيات الحالية فيما يتعلق بتطبيق نماذج التعلم العميق (DL) لاكتشاف الأخبار المزيفة باللغة العربية، مشيرة إلى أنه على الرغم من تحقيق تقدم كبير في اللغة الإنجليزية، إلا أن دمج نماذج مثل الشبكات العصبية التلافيفية (CNNs) وشبكات الذاكرة طويلة وقصيرة المدى (LSTM) في العربية لا يزال غير مستكشف بشكل كاف. يقترح المؤلفون نموذجًا هجينًا جديدًا يجمع بين بنى BiLSTM المزدوجة وCNN متعددة القنوات لتعزيز استخراج الميزات وتحسين اكتشاف الأخبار المزيفة باللغتين العربية والإنجليزية. يهدف هذا النموذج إلى الاستفادة من نقاط القوة في كلا النهجين لإنشاء تمثيل ميزات أكثر قوة، مع معالجة قيود المنهجيات الحالية والمساهمة في مجال اكتشاف المعلومات المضللة. يتم توضيح هيكل الورقة، مما يشير إلى استكشاف شامل للأدبيات ذات الصلة، وتصميم النموذج، والنتائج التجريبية، واتجاهات البحث المستقبلية.

الطرق

في هذا القسم، يوضح المؤلفون إعداد التجارب وتكوينات المعلمات الفائقة المستخدمة لتقييم نموذجهم المقترح، الذي يعالج عدم توازن الفئات من خلال التحقق المتقاطع الطبقي 5-fold. تم إجراء التجارب باستخدام بايثون عبر Google Colab، مع تنفيذ نماذج التعلم العميق عبر TensorFlow و Keras. تتضمن البنية شبكة BiLSTM مكدسة بطبقتين مع 64 و 128 خلية ذاكرة وشبكة CNN متعددة القنوات مع ثلاث طبقات من 128 فلتر. لتخفيف الإفراط في التكيف، تم استخدام معدل إسقاط قدره 0.5، وتم استخدام مُحسِّن آدم مع حجم دفعة قدره 32. كما أعاد المؤلفون إنتاج معايير قوية من المحولات (AraBERT، MARBERT، BERT، و DistilBERT) تحت ظروف متطابقة للمقارنة العادلة.

تم تقييم أداء النموذج مقابل الأساليب المتطورة (SOTA) عبر ثلاث مجموعات بيانات: AFND، ANS، و WELFake. على مجموعة بيانات AFND، حقق النموذج المقترح دقة متوسطة قدرها \(94.43 \pm 0.19\%\)، متجاوزًا المنافسين بفوارق ملحوظة. بالنسبة لمجموعة بيانات ANS، أظهر دقة متوسطة قدرها \(71.63 \pm 1.45\%\) ودرجة F1 قدرها \(79.6 \pm 1.42\%\)، متفوقًا على نماذج SOTA السابقة. كانت نتائج مجموعة بيانات WELFake مثيرة للإعجاب بشكل خاص، حيث حقق النموذج دقة قدرها \(98.85 \pm 0.03\%\) ودرجة AUC-PR قدرها \(99.9\%\)، مما يدل على قوته في تحديد الأخبار المزيفة. بشكل عام، تشير النتائج إلى أن البنية الهجينة واستراتيجية الإفراط في العينة تعزز بشكل كبير أداء النموذج، خاصة في سيناريوهات البيانات غير المتوازنة.

النتائج

يقدم قسم النتائج النتائج الكمية من التجارب التي أجريت لتقييم النموذج المقترح. تشير البيانات المجمعة إلى تحسين كبير في مقاييس الأداء مقارنة بالنماذج الأساسية، مما يظهر فعالية النهج المقترح. يتم تفصيل نتائج عددية محددة، بما في ذلك معدلات الدقة، والدقة، والاسترجاع، مع تسليط الضوء على نقاط قوة النموذج في سيناريوهات الاختبار المختلفة.

بالإضافة إلى ذلك، تدعم التحليلات الإحصائية، مثل قيم p وفترات الثقة، قوة النتائج، مؤكدة أن التحسينات الملحوظة ذات دلالة إحصائية. بشكل عام، تدعم النتائج الفرضية القائلة بأن النموذج المقترح يتفوق على المنهجيات الحالية في التطبيقات المستهدفة.

المناقشة

تسلط قسم المناقشة في ورقة البحث الضوء على المشهد الحالي لأساليب اكتشاف الأخبار المزيفة (FND)، مع التركيز على التباين بين الأساليب للغتين الإنجليزية والعربية. تحدد فجوة كبيرة في أبحاث FND باللغة العربية مقارنة باللغة الإنجليزية، مع استكشاف نماذج وتقنيات متنوعة في الأدبيات. بالنسبة للغة الإنجليزية، تشمل المساهمات الملحوظة استخدام تحليل N-gram، وهياكل CNN-LSTM، ونماذج هجينة مثل BiLSTM مع آليات الانتباه الذاتي، محققة معدلات دقة عالية. في المقابل، ركزت دراسات FND باللغة العربية بشكل أساسي على أساليب التعلم العميق، مثل تركيبات BiLSTM وCNN، مع تحقيق بعض النماذج دقة تتجاوز 90%. يبرز القسم قيود الأساليب التقليدية مثل Bag of Words (BoW) و TF-IDF، التي تفشل في التقاط المعنى الدلالي، وينتقد النماذج الهجينة التسلسلية لفقدان المعلومات المحتمل.

يقترح المؤلفون بنية هجينة جديدة متوازية تستخرج الميزات بشكل مستقل باستخدام BiLSTM مزدوجة وCNN متعددة القنوات، بهدف تعزيز تمثيل البيانات النصية. يُفترض أن يحسن هذا النهج الدقة في تمييز الأخبار المزيفة عن الأخبار الحقيقية من خلال التقاط كل من الاعتماديات بعيدة المدى والأنماط المحلية بشكل فعال. تختتم المناقشة بالتأكيد على أهمية معالجة التحديات الفريدة التي تطرحها اللغة العربية في FND، داعية إلى نماذج أكثر شمولاً وقوة تستفيد من تقنيات التعلم العميق المتقدمة.

Journal: Frontiers in Big Data, Volume: 8
DOI: https://doi.org/10.3389/fdata.2025.1683786
PMID: https://pubmed.ncbi.nlm.nih.gov/41568095
Publication Date: 2026-01-06
Author(s): Baqer M. Merzah et al.
Primary Topic: Misinformation and Its Impacts

Overview

The research paper addresses the pressing issue of fake news detection (FND) in both Arabic and English, highlighting the challenges posed by the rapid dissemination of misinformation in online environments. The authors propose a novel deep learning model that combines a multi-channel Convolutional Neural Network (CNN) with dual Bidirectional Long Short-Term Memory (BiLSTM) networks. This architecture effectively captures both semantic and local textual features through a pre-trained FastText model, followed by a global maxpooling layer to enhance feature extraction. The model was rigorously evaluated on three benchmark datasets—AFND and ANS for Arabic, and WELFake for English—yielding impressive accuracies of 94.43% ± 0.19%, 71.63% ± 1.45%, and 98.85% ± 0.03%, respectively, surpassing state-of-the-art methods.

Despite its robust performance, the study acknowledges limitations, particularly regarding the complexities of the Arabic language, which include morphological richness and dialectal diversity. The model’s focus on textual data also restricts its applicability, as real-world misinformation often encompasses multimodal content. Future work is planned to enhance the model’s capabilities by incorporating additional languages and extending its framework to include images and videos, thereby addressing the limitations of text-only detection. The findings underscore the model’s potential for practical applications in improving information reliability on online social networks.

Introduction

The introduction of the paper discusses the rise of online social networks (OSNs) such as Facebook, Twitter, and Instagram as primary sources of information, which has led to significant challenges regarding the credibility of shared content, particularly the proliferation of fake news. Fake news, encompassing misinformation and disinformation, has been notably prevalent during events like the 2016 U.S. presidential elections, where misleading information about candidates circulated widely. The authors highlight the urgent need for effective fake news detection (FND) systems, particularly in Arabic, which presents unique challenges due to its complex morphology, grammar, and dialects, as well as a lack of resources compared to English.

The paper identifies a gap in the existing literature regarding the application of deep learning (DL) models for FND in Arabic, noting that while significant advancements have been made in English, the integration of models like Convolutional Neural Networks (CNNs) and Long Short-Term Memory (LSTM) networks in Arabic remains underexplored. The authors propose a novel hybrid model that combines dual BiLSTM and multi-channel CNN architectures to enhance feature extraction and improve the detection of fake news in both Arabic and English. This model aims to leverage the strengths of both approaches to create a more robust feature representation, addressing the limitations of current methodologies and contributing to the field of misinformation detection. The structure of the paper is outlined, indicating a comprehensive exploration of related literature, model design, experimental results, and future research directions.

Methods

In this section, the authors detail the experimental setup and hyperparameter configurations used to evaluate their proposed model, which addresses class imbalance through stratified 5-fold cross-validation. The experiments were conducted in Python using Google Colab, with deep learning models implemented via TensorFlow and Keras. The architecture includes a two-layer stacked BiLSTM network with 64 and 128 memory cells and a multi-channel CNN with three layers of 128 filters. To mitigate overfitting, a dropout rate of 0.5 was employed, and the Adam optimizer was utilized with a batch size of 32. The authors also reproduced strong transformer baselines (AraBERT, MARBERT, BERT, and DistilBERT) under identical conditions for fair comparison.

The model’s performance was assessed against state-of-the-art (SOTA) methods across three datasets: AFND, ANS, and WELFake. On the AFND dataset, the proposed model achieved a mean accuracy of \(94.43 \pm 0.19\%\), surpassing competitors by notable margins. For the ANS dataset, it demonstrated an average accuracy of \(71.63 \pm 1.45\%\) and an F1 score of \(79.6 \pm 1.42\%\), outperforming previous SOTA models. The WELFake dataset results were particularly impressive, with the model achieving an accuracy of \(98.85 \pm 0.03\%\) and an AUC-PR score of \(99.9\%\), indicating its robustness in identifying fake news. Overall, the findings suggest that the hybrid architecture and oversampling strategy significantly enhance model performance, particularly in imbalanced data scenarios.

Results

The Results section presents the quantitative findings from the experiments conducted to evaluate the proposed model. The data collected indicates a significant improvement in performance metrics compared to baseline models, demonstrating the efficacy of the suggested approach. Specific numerical results, including accuracy rates, precision, and recall, are detailed, highlighting the model’s strengths in various test scenarios.

Additionally, statistical analyses, such as p-values and confidence intervals, support the robustness of the findings, confirming that the observed improvements are statistically significant. Overall, the results substantiate the hypothesis that the proposed model outperforms existing methodologies in the targeted applications.

Discussion

The discussion section of the research paper highlights the current landscape of fake news detection (FND) methodologies, particularly contrasting approaches for English and Arabic languages. It identifies a significant gap in Arabic FND research compared to English, with various models and techniques explored in the literature. For English, notable contributions include the use of N-gram analysis, CNN-LSTM architectures, and hybrid models like BiLSTM with self-attention mechanisms, achieving high accuracy rates. In contrast, Arabic FND studies have primarily focused on deep learning methods, such as BiLSTM and CNN combinations, with some models achieving accuracies exceeding 90%. The section emphasizes the limitations of traditional methods like Bag of Words (BoW) and TF-IDF, which fail to capture semantic meaning, and critiques sequential hybrid models for potential information loss.

The authors propose a novel parallel hybrid architecture that independently extracts features using dual BiLSTM and multi-channel CNNs, aiming to enhance the representation of textual data. This approach is posited to improve accuracy in distinguishing fake from real news by effectively capturing both long-range dependencies and local patterns. The discussion concludes by underscoring the importance of addressing the unique challenges posed by the Arabic language in FND, advocating for more comprehensive and robust models that leverage advanced deep learning techniques.