YOLO-BS: خوارزمية كشف علامات المرور تعتمد على YOLOv8 YOLO-BS: a traffic sign detection algorithm based on YOLOv8

المجلة: Scientific Reports، المجلد: 15، العدد: 1
DOI: https://doi.org/10.1038/s41598-025-88184-0
PMID: https://pubmed.ncbi.nlm.nih.gov/40038318
تاريخ النشر: 2025-03-04
المؤلف: Hong Zhang وآخرون
الموضوع الرئيسي: التعرف على لوحات ترخيص المركبات

نظرة عامة

تقدم الورقة YOLO-BS، وهو خوارزمية متقدمة لاكتشاف إشارات المرور مبنية على إطار عمل YOLOv8، تهدف إلى معالجة قيود طرق الكشف الحالية التي تعاني من الدقة والأداء في الوقت الحقيقي في البيئات الديناميكية. تتضمن الخوارزمية المقترحة طبقة اكتشاف الأجسام الصغيرة لتعزيز استخراج الميزات لإشارات المرور الصغيرة، التي غالبًا ما تكون صعبة الاكتشاف. علاوة على ذلك، تدمج شبكة هرم الميزات ثنائية الاتجاه (BiFPN) لتحسين قدرة النموذج في إدارة الأجسام متعددة المقاييس، مما يعزز بشكل كبير دقة الكشف.

تظهر التقييمات التجريبية التي أجريت على مجموعة بيانات TT100K أن YOLO-BS يتفوق على النماذج السائدة الحالية، محققًا دقة متوسطة (mAP) تبلغ 90.1% ومعدل إطارات في الثانية (FPS) يبلغ 78. تشير النتائج إلى أن التحسينات في YOLO-BS، وخاصة التركيز على اكتشاف الأجسام الصغيرة وإدارة الميزات متعددة المقاييس، تساهم في أدائه المتفوق. ستهدف الأبحاث المستقبلية إلى تحسين هذه الخوارزمية بشكل أكبر، واستكشاف تطبيقاتها المحتملة ضمن أنظمة النقل الذكية.

الطرق

توضح قسم المنهجية النهج المنهجي المستخدم في البحث للتحقيق في الفرضيات المحددة. استخدمت الدراسة مزيجًا من الطرق الكمية والنوعية، بما في ذلك التحليلات الإحصائية ودراسات الحالة، لضمان فهم شامل للظواهر قيد التحقيق. شملت جمع البيانات استبيانات وتجارب، مع التركيز على الحصول على نتائج موثوقة وصحيحة.

تم تطبيق تقنيات إحصائية، مثل تحليل الانحدار وANOVA، لتحليل البيانات الكمية، بينما تم استخدام التحليل الموضوعي لتفسير البيانات النوعية. تم تحديد حجم العينة بناءً على تحليل القوة لضمان قوة إحصائية كافية لاكتشاف التأثيرات المهمة. تم الالتزام بالاعتبارات الأخلاقية طوال الدراسة، بما في ذلك الموافقة المستنيرة وتدابير السرية للمشاركين. بشكل عام، تم تصميم المنهجية لاختبار أسئلة البحث بدقة وتقديم نتائج قوية.

النتائج

تظهر نتائج تجارب الإزالة الأداء المتفوق لنموذج YOLO-BS المقترح مقارنةً بنموذج YOLOv8 الأساسي ونسخته مع طبقة اكتشاف الأجسام الصغيرة. كما هو مفصل في الجدول 6، تم تقييم مقاييس رئيسية مثل GFLOPs، الدقة (P)، الاسترجاع (R)، متوسط الدقة عند 50 (mAP50)، متوسط الدقة من 50 إلى 95 (mAP50-95)، ومعدل الإطارات في الثانية (FPS). تشير النتائج إلى أن YOLOv8 حقق دقة تبلغ 81.7%، واسترجاع 73.8%، وmAP50 يبلغ 81.8%. أدت إضافة طبقة اكتشاف الأجسام الصغيرة إلى تحسين هذه المقاييس إلى 86.3% دقة، و79.2% استرجاع، و87.3% mAP50. النموذج الكامل YOLO-BS، الذي يدمج كل من طبقة اكتشاف الأجسام الصغيرة وBiFPN، عزز هذه القيم بشكل أكبر إلى 87.9% دقة، و80.5% استرجاع، و90.1% mAP50.

تؤكد النتائج أن YOLO-BS يتفوق بشكل كبير على YOLOv8 الأساسي عبر جميع المقاييس، على الرغم من انخفاض طفيف في FPS. لقد حسنت إضافة BiFPN وطبقة اكتشاف الأجسام الصغيرة بشكل ملحوظ قدرة النموذج على اكتشاف الأهداف الصغيرة ومتعددة المقاييس، وهو أمر حاسم لمهام الكشف في الوقت الحقيقي. تسهل آلية دمج الميزات الفعالة لـ BiFPN سرعات تشغيل عالية مع الحفاظ على الدقة، مما يبرز دورها الأساسي في تعزيز أنظمة الكشف. يسمح تدفق المعلومات ثنائي الاتجاه ودمج الميزات الموزونة في BiFPN بإدارة أفضل للميزات متعددة المقاييس، مما يحسن دقة الكشف عن الأجسام الصغيرة، مثل إشارات المرور التي يتم مواجهتها عادةً في سيناريوهات اكتشاف إشارات المرور.

المناقشة

خوارزمية الكشف YOLO-BS، كما تم مناقشتها في هذه الورقة، هي إطار عمل لاكتشاف إشارات المرور من مرحلة واحدة مصمم لتعزيز اكتشاف إشارات المرور، وخاصة الأصغر منها، من خلال بنية جديدة تتكون من العمود الفقري، والعنق، والرأس. يقوم العمود الفقري باستخراج الميزات من الصور على مقاييس متعددة، والتي يتم دمجها بعد ذلك في العنق قبل أن تتم معالجتها بواسطة رأس الكشف للتنبؤ بالموقع، والثقة، وتصنيف صناديق الكشف. تعتبر إضافة طبقة اكتشاف الأجسام الصغيرة ابتكارًا كبيرًا في هذا النموذج، حيث تستخدم خرائط ميزات عالية الدقة لتحسين الحساسية تجاه الأهداف الصغيرة، مما يعالج التحدي الذي تطرحه هيمنة إشارات المرور الصغيرة في السيناريوهات الواقعية.

تعمل دمج شبكة هرم الميزات ثنائية الاتجاه (BiFPN) على تحسين أداء النموذج من خلال تسهيل تدفق المعلومات ثنائي الاتجاه واستخدام آلية دمج الميزات الموزونة. وهذا يسمح بتجميع أكثر فعالية للميزات متعددة المقاييس، مما يعزز قدرة النموذج على اكتشاف الأهداف بأحجام مختلفة. تظهر النتائج التجريبية على مجموعة بيانات TT100K أن YOLO-BS تحقق مقاييس أداء متفوقة، بما في ذلك دقة تبلغ 87.9%، واسترجاع 80.5%، ومتوسط دقة (mAP) يبلغ 90.1%، متفوقًا على خوارزميات متقدمة أخرى مثل Faster R-CNN وYOLOv8. على الرغم من انخفاض طفيف في معدل الإطارات في الثانية (FPS)، فإن YOLO-BS يحافظ على توازن جيد بين السرعة والدقة، مما يجعله حلاً قابلاً للتطبيق لاكتشاف إشارات المرور في الوقت الحقيقي في أنظمة النقل الذكية. ستهدف الأعمال المستقبلية إلى تعزيز هذا التوازن بشكل أكبر من خلال تسريع الأجهزة وهياكل الشبكات الأكثر كفاءة.

Journal: Scientific Reports, Volume: 15, Issue: 1
DOI: https://doi.org/10.1038/s41598-025-88184-0
PMID: https://pubmed.ncbi.nlm.nih.gov/40038318
Publication Date: 2025-03-04
Author(s): Hong Zhang et al.
Primary Topic: Vehicle License Plate Recognition

Overview

The paper presents YOLO-BS, an advanced traffic sign detection algorithm built upon the YOLOv8 framework, aimed at addressing the limitations of existing detection methods that struggle with accuracy and real-time performance in dynamic environments. The proposed algorithm incorporates a small object detection layer to enhance feature extraction for smaller traffic signs, which are often challenging to detect. Furthermore, it integrates a bidirectional feature pyramid network (BiFPN) to improve the model’s capability in managing multi-scale objects, thereby significantly boosting detection accuracy.

Experimental evaluations conducted on the TT100K dataset reveal that YOLO-BS outperforms current mainstream models, achieving a mean average precision (mAP) of 90.1% and a frames per second (FPS) rate of 78. The findings indicate that the enhancements in YOLO-BS, particularly the focus on small object detection and multi-scale feature handling, contribute to its superior performance. Future research will aim to refine this algorithm further, exploring its potential applications within intelligent transportation systems.

Methods

The methodology section outlines the systematic approach employed in the research to investigate the specified hypotheses. The study utilized a combination of quantitative and qualitative methods, including statistical analyses and case studies, to ensure a comprehensive understanding of the phenomena under investigation. Data collection involved surveys and experiments, with a focus on obtaining reliable and valid results.

Statistical techniques, such as regression analysis and ANOVA, were applied to analyze the quantitative data, while thematic analysis was used for qualitative data interpretation. The sample size was determined based on power analysis to ensure sufficient statistical power for detecting significant effects. Ethical considerations were adhered to throughout the study, including informed consent and confidentiality measures for participants. Overall, the methodology was designed to rigorously test the research questions and provide robust findings.

Results

The results of the ablation experiments demonstrate the superior performance of the proposed YOLO-BS model compared to the baseline YOLOv8 and its variant with a small object detection layer. As detailed in Table 6, key metrics such as GFLOPs, precision (P), recall (R), mean Average Precision at 50 (mAP50), mean Average Precision from 50 to 95 (mAP50-95), and frames per second (FPS) were evaluated. The findings indicate that YOLOv8 achieved a precision of 81.7%, recall of 73.8%, and mAP50 of 81.8%. The addition of the small object detection layer improved these metrics to 86.3% precision, 79.2% recall, and 87.3% mAP50. The full YOLO-BS model, which integrates both the small object detection layer and BiFPN, further enhanced these values to 87.9% precision, 80.5% recall, and 90.1% mAP50.

The results underscore that YOLO-BS significantly outperforms the baseline YOLOv8 across all metrics, despite a slight decrease in FPS. The incorporation of BiFPN and the small object detection layer has notably improved the model’s ability to detect small and multi-scale targets, which is crucial for real-time detection tasks. The efficient feature fusion mechanism of BiFPN facilitates high operational speeds while maintaining accuracy, highlighting its essential role in enhancing detection systems. The bidirectional information flow and weighted feature fusion in BiFPN allow for better management of multi-scale features, thereby improving detection accuracy for smaller objects, such as traffic signs commonly encountered in traffic sign detection scenarios.

Discussion

The YOLO-BS detection algorithm, as discussed in this paper, is a one-stage traffic sign detection framework designed to enhance the detection of traffic signs, particularly smaller ones, through a novel architecture comprising a backbone, neck, and head. The backbone extracts features from images at multiple scales, which are then fused in the neck before being processed by the detection head to predict the position, confidence, and classification of detection boxes. A significant innovation in this model is the introduction of a small object detection layer, which utilizes higher-resolution feature maps to improve sensitivity to small targets, addressing the challenge posed by the predominance of smaller traffic signs in real-world scenarios.

The integration of a Bidirectional Feature Pyramid Network (BiFPN) further optimizes the model’s performance by facilitating bidirectional information flow and employing a weighted feature fusion mechanism. This allows for a more effective aggregation of multi-scale features, enhancing the model’s ability to detect targets of varying sizes. Experimental results on the TT100K dataset demonstrate that YOLO-BS achieves superior performance metrics, including a precision of 87.9%, recall of 80.5%, and a mean Average Precision (mAP) of 90.1%, outperforming other advanced algorithms such as Faster R-CNN and YOLOv8. Despite a slight reduction in frames per second (FPS), YOLO-BS maintains a favorable balance between speed and accuracy, making it a viable solution for real-time traffic sign detection in intelligent transportation systems. Future work will aim to further enhance this balance through hardware acceleration and more efficient network architectures.