نظام كشف بصري عميق لسرطان الخلايا الحرشفية الفموية Deep visual detection system for oral squamous cell carcinoma

المجلة: Scientific Reports، المجلد: 16، العدد: 1
DOI: https://doi.org/10.1038/s41598-025-34332-5
PMID: https://pubmed.ncbi.nlm.nih.gov/41554886
تاريخ النشر: 2026-01-19
المؤلف: K. Akram وآخرون
الموضوع الرئيسي: الذكاء الاصطناعي في اكتشاف السرطان

نظرة عامة

تقدم ورقة البحث نظام الكشف البصري العميق (DVDS) الذي يهدف إلى أتمتة الكشف عن سرطان الخلايا الحرشفية الفموية (OSCC) من خلال الصور الهيستوباثولوجية. غالبًا ما تعيق الطرق التشخيصية التقليدية قيود الوقت والتفسير الذاتي، مما يؤدي إلى تباين في النتائج. تستفيد الدراسة من التقدم في الذكاء الاصطناعي، وخاصة التعلم العميق، لتعزيز دقة وكفاءة التشخيص. تم تدريب ثلاثة نماذج من الشبكات العصبية التلافيفية (CNN) – EfficientNetB3 و DenseNet121 و ResNet50 – على مجموعتين من البيانات: مجموعة بيانات كشف سرطان الفم من كاجل ومجموعة بيانات NDB-UFES. تم استخدام تحسين البيانات واستراتيجيات تدريب متقدمة لتحسين أداء النموذج.

تشير النتائج إلى أن EfficientNetB3 تفوق على النماذج الأخرى، محققًا دقة اختبار تبلغ 97.05% في مهمة التصنيف الثنائي و 97.16% في المهمة متعددة الفئات، مع دقة واسترجاع ودرجات F1 متسقة. بالمقابل، أظهرت DenseNet121 و ResNet50 دقة أقل بكثير. تؤكد النتائج على أهمية بنية النموذج والمعالجة المسبقة في تصنيف الصور الطبية، مما يشير إلى أن DVDS، وخاصة مع EfficientNetB3، لديه إمكانات قوية للتطبيق السريري في تشخيص OSCC. ومع ذلك، تعترف الدراسة بالقيود مثل الحاجة إلى التجارب السريرية وتنوع مجموعة البيانات، مما يبرز ضرورة المزيد من التحقق لتعزيز موثوقية النظام ودمجه في سير العمل التشخيصي.

مقدمة

يعد سرطان الفم قضية صحية عالمية هامة، حيث يحتل المرتبة السادسة كأكثر أنواع السرطان شيوعًا، مع حساب سرطان الخلايا الحرشفية الفموية (OSCC) لما يقرب من 90% من الحالات العدوانية. تشير منظمة الصحة العالمية (WHO) إلى أنه يتم تشخيص حوالي 657,000 حالة جديدة من سرطان الفم كل عام، مما يؤدي إلى أكثر من 330,000 حالة وفاة سنويًا. تؤكد هذه الإحصائيات على الحاجة الملحة لفهم وإدارة سرطان الفم بشكل أفضل، وخاصة OSCC، للتخفيف من تأثيره على الصحة العامة.

الطرق

تتناول قسم المنهجية في هذه الدراسة تطوير نظام الكشف البصري العميق (DVDS) لتصنيف سرطان الخلايا الحرشفية الفموية (OSCC) باستخدام نموذج EfficientNetB3 المعدل بدقة. تم اختيار هذا النموذج لتحقيق توازن مثالي بين الكفاءة والأداء، مع دمج ميزات مثل قاعدة مدربة مسبقًا، وتطبيع الدفعات، وطبقة كثيفة منتظمة، والتخلي، ومصنف softmax. لأغراض التحليل المقارن، تم تنفيذ DenseNet121 و ResNet50 أيضًا تحت ظروف تدريب متطابقة. تشمل المنهجية استراتيجية التدريب، وتكوينات النموذج، ومقاييس التقييم، والإعداد التجريبي العام، مع توضيح سير العمل التدريبي الشامل في الشكل 3.

تم إجراء الإعداد التجريبي على جهاز MacBook Pro (2021) مزود بشريحة Apple M1 Max، مما يضمن قدرات حسابية قوية مع معالج مركزي مكون من 10 نوى ومعالج رسومي مكون من 32 نواة، بالإضافة إلى 32 جيجابايت من الذاكرة الموحدة و1 تيرابايت من تخزين SSD. تم استخدام بيئة البرمجيات macOS مع Python 3.9.6، مع الاستفادة من المكتبات المثبتة عبر pip، بما في ذلك Jupyter Lab ومجموعة متنوعة من أطر التعلم العميق. تم تنفيذ جميع العمليات، بما في ذلك المعالجة المسبقة، والتدريب، والتقييم، محليًا للحفاظ على الاتساق وقابلية التكرار.

تشير النتائج إلى أن DVDS، المعزز بتحسين البيانات، والمعالجة المسبقة، واستدعاءات التدريب، تفوق بشكل كبير على الطرق الحالية في كشف سرطان الفم، محققًا دقة تبلغ 97.04% على مجموعة بيانات كاجل و97.15% على مجموعة بيانات NDB-UFES. بالمقابل، حققت DenseNet121 و ResNet50 أداءً أقل، مما يثبت أن DVDS هو الأكثر دقة وموثوقية في تصنيف OSCC الهيستوباثولوجي.

النتائج

يقيم قسم النتائج أداء ثلاثة نماذج من التعلم العميق – EfficientNetB3 و DenseNet121 و ResNet50 – في تصنيف سرطان الفم باستخدام مجموعة بيانات مكونة من 5,192 صورة، تم تقسيمها إلى 70% للتدريب و30% للاختبار. تم تقييم النماذج بناءً على عدة مقاييس، بما في ذلك الدقة، والدقة، والاسترجاع، ودرجة F1، والخصوصية، والحساسية، وتحليل مصفوفة الارتباك، لتحديد قدراتها في التصنيف وتحديد الاتجاهات في الأخطاء في التصنيف.

تم إجراء جميع التجارب على جهاز MacBook Pro 2021 مزود بشريحة Apple M1 Max، و32 جيجابايت من ذاكرة الوصول العشوائي، و1 تيرابايت من SSD، باستخدام Python 3.9.6 ضمن بيئة Jupyter Lab. كانت مجموعة البيانات تتكون من فئتين: طبيعية وسرطان الخلايا الحرشفية الفموية (OSCC)، مما يوفر إطارًا واضحًا لتقييم فعالية النماذج في التمييز بين الأنسجة الصحية والسرطانية.

المناقشة

تسلط قسم المناقشة في ورقة البحث الضوء على العبء العالمي الكبير لسرطان الخلايا الحرشفية الفموية (OSCC)، خاصة في المناطق النامية مثل جنوب وجنوب شرق آسيا، حيث تكون معدلات الإصابة تقريبًا ضعف المتوسط العالمي. تمثل الهند وحدها ثلث حالات OSCC في العالم، حيث تُبلغ باكستان عن سرطان الفم كأكثر أنواع السرطان انتشارًا بين الذكور. تشير الورقة إلى وجود تفاوت ملحوظ بين الجنسين في معدل الإصابة بـ OSCC، حيث يكون الرجال أكثر عرضة لتطوير المرض بحوالي 2.5 مرة، ويرجع ذلك أساسًا إلى عوامل نمط الحياة مثل استخدام التبغ والكحول. على الرغم من التقدم في الرعاية الصحية، فإن معدل الإصابة بـ OSCC في البلدان المتقدمة في تزايد، حيث تتوقع جمعية السرطان الأمريكية أكثر من 54,000 حالة جديدة سنويًا في الولايات المتحدة. تشير قاعدة بيانات GLOBOCAN 2022 إلى أن OSCC شكلت 1.9% من جميع أنواع السرطان، مما أدى إلى حوالي 188,230 حالة وفاة في جميع أنحاء العالم، مما يبرز الحاجة الملحة لتحسين طرق الكشف والعلاج.

تؤكد الورقة على الدور الحاسم للكشف المبكر في تحسين نتائج المرضى، مما قد يزيد من معدل البقاء لمدة خمس سنوات من 20-30% في المراحل المتقدمة إلى ما يقرب من 80% عند التشخيص المبكر. ومع ذلك، تواجه الطرق التشخيصية التقليدية تحديات، بما في ذلك الاعتماد على أطباء الأمراض الخبراء والتقييمات الذاتية، مما قد يؤدي إلى تأخير أو تشخيصات غير صحيحة. لمعالجة هذه القضايا، تقدم البحث نظام الكشف البصري العميق (DVDS)، وهو إطار تعلم عميق يستفيد من EfficientNetB3 للكشف التلقائي عن OSCC من الصور الهيستوباثولوجية. حقق النموذج معدلات دقة مثيرة للإعجاب تبلغ 97.05% للتصنيف الثنائي و97.16% للتصنيف متعدد الفئات، متفوقًا على النماذج الحالية مثل DenseNet121 و ResNet50. تدعو نتائج الدراسة إلى دمج تقنيات الذكاء الاصطناعي المتقدمة في البيئات السريرية لتحسين دقة وكفاءة التشخيص، مما يعزز في النهاية رعاية المرضى في مكافحة OSCC.

Journal: Scientific Reports, Volume: 16, Issue: 1
DOI: https://doi.org/10.1038/s41598-025-34332-5
PMID: https://pubmed.ncbi.nlm.nih.gov/41554886
Publication Date: 2026-01-19
Author(s): K. Akram et al.
Primary Topic: AI in cancer detection

Overview

The research paper presents a Deep Visual Detection System (DVDS) aimed at automating the detection of Oral Squamous Cell Carcinoma (OSCC) through histopathological images. Traditional diagnostic methods are often hindered by time constraints and subjective interpretation, leading to variability in results. The study leverages advancements in Artificial Intelligence, specifically deep learning, to enhance diagnostic accuracy and efficiency. Three convolutional neural network (CNN) models—EfficientNetB3, DenseNet121, and ResNet50—were trained on two datasets: the Kaggle Oral Cancer Detection dataset and the NDB-UFES dataset. Data augmentation and advanced training strategies were employed to improve model performance.

Results indicate that EfficientNetB3 outperformed the other models, achieving a test accuracy of 97.05% on the binary classification task and 97.16% on the multi-class task, with consistent precision, recall, and F1-scores. In contrast, DenseNet121 and ResNet50 exhibited significantly lower accuracy. The findings underscore the importance of model architecture and preprocessing in medical image classification, suggesting that the DVDS, particularly with EfficientNetB3, has strong potential for clinical application in OSCC diagnosis. However, the study acknowledges limitations such as the need for clinical trials and dataset diversity, emphasizing the necessity for further validation to enhance the system’s reliability and integration into diagnostic workflows.

Introduction

Oral cancer is a significant global health concern, ranking as the sixth most common cancer type, with oral squamous cell carcinoma (OSCC) accounting for nearly 90% of aggressive cases. The World Health Organization (WHO) reports that approximately 657,000 new cases of oral cancer are diagnosed each year, resulting in over 330,000 deaths annually. These statistics underscore the critical need for improved understanding and management of oral cancer, particularly OSCC, to mitigate its impact on public health.

Methods

The methodology section of this study details the development of a deep visual detection system (DVDS) for the classification of oral squamous cell carcinoma (OSCC) using a fine-tuned EfficientNetB3 model. This model was selected for its optimal balance between efficiency and performance, incorporating features such as a pre-trained base, batch normalization, a regularized dense layer, dropout, and a softmax classifier. For comparative analysis, DenseNet121 and ResNet50 were also implemented under identical training conditions. The methodology encompasses the training strategy, model configurations, evaluation metrics, and the overall experimental setup, with a comprehensive training workflow illustrated in Figure 3.

The experimental setup was conducted on a MacBook Pro (2021) equipped with an Apple M1 Max chip, ensuring robust computational capabilities with a 10-core CPU and a 32-core GPU, alongside 32 GB of unified memory and 1 TB of SSD storage. The software environment utilized macOS with Python 3.9.6, employing libraries installed via pip, including Jupyter Lab and various deep learning frameworks. All processes, including preprocessing, training, and evaluation, were executed locally to maintain consistency and reproducibility.

The results indicate that the DVDS, enhanced with data augmentation, preprocessing, and training callbacks, significantly outperformed existing methods in oral cancer detection, achieving accuracies of 97.04% on the Kaggle dataset and 97.15% on the NDB-UFES dataset. In contrast, DenseNet121 and ResNet50 yielded lower performance metrics, establishing the DVDS as the most accurate and reliable approach for histopathological OSCC classification.

Results

The results section evaluates the performance of three deep learning models—EfficientNetB3, DenseNet121, and ResNet50—in classifying oral cancer using a dataset of 5,192 images, which was split into 70% for training and 30% for testing. The models were assessed based on several metrics, including accuracy, precision, recall, F1-score, specificity, sensitivity, and confusion matrix analysis, to determine their classification capabilities and identify trends in misclassification.

All experiments were conducted on a MacBook Pro 2021 equipped with an Apple M1 Max chip, 32 GB of RAM, and a 1 TB SSD, utilizing Python 3.9.6 within the Jupyter Lab environment. The dataset consisted of two classes: Normal and Oral Squamous Cell Carcinoma (OSCC), providing a clear framework for evaluating the models’ effectiveness in distinguishing between healthy and cancerous tissue.

Discussion

The discussion section of the research paper highlights the significant global burden of Oral Squamous Cell Carcinoma (OSCC), particularly in developing regions such as South and Southeast Asia, where incidence rates are nearly double the global average. India alone accounts for one-third of the world’s OSCC cases, with Pakistan reporting oral cancer as the most prevalent cancer among males. The paper notes a pronounced gender disparity in OSCC incidence, with men being approximately 2.5 times more likely to develop the disease, primarily due to lifestyle factors like tobacco and alcohol use. Despite advancements in healthcare, the incidence of OSCC is rising in developed countries, with the American Cancer Society projecting over 54,000 new cases annually in the United States. The GLOBOCAN 2022 database indicates that OSCC accounted for 1.9% of all cancers, leading to approximately 188,230 deaths worldwide, underscoring the urgent need for improved detection and treatment methods.

The paper emphasizes the critical role of early detection in enhancing patient outcomes, potentially increasing the five-year survival rate from 20-30% at advanced stages to nearly 80% when diagnosed early. However, conventional diagnostic methods face challenges, including reliance on expert pathologists and subjective evaluations, which can lead to delayed or incorrect diagnoses. To address these issues, the research introduces the Deep Visual Detection System (DVDS), a deep learning framework leveraging EfficientNetB3 for automatic OSCC detection from histopathological images. The model achieved impressive accuracy rates of 97.05% for binary classification and 97.16% for multi-class classification, outperforming existing models like DenseNet121 and ResNet50. The study’s findings advocate for the integration of advanced AI techniques in clinical settings to improve diagnostic accuracy and efficiency, ultimately enhancing patient care in the fight against OSCC.