دمج نماذج CNN وطرق التعلم الآلي في تصنيف درجات الائتمان: تحويل الصور ثنائية الأبعاد واستخراج الميزات Integration of CNN Models and Machine Learning Methods in Credit Score Classification: 2D Image Transformation and Feature Extraction

المجلة: Computational Economics، المجلد: 65، العدد: 5
DOI: https://doi.org/10.1007/s10614-025-10893-5
تاريخ النشر: 2025-02-05
المؤلف: Yunus Emre Gür وآخرون
الموضوع الرئيسي: التنبؤ بالضغوط المالية والإفلاس

نظرة عامة

تتناول هذه الورقة البحثية التحدي الحاسم المتمثل في تصنيف درجات الائتمان بدقة، وهو أمر أساسي للمؤسسات المالية في تقييم الجدارة الائتمانية وإدارة المخاطر. تواجه طرق التصنيف التقليدية صعوبات مع مجموعات البيانات الكبيرة، مما يؤدي إلى عدم الكفاءة. للتغلب على هذه القيود، يقترح المؤلفون نموذجًا هجينًا جديدًا يدمج الشبكات العصبية التلافيفية (CNN) مع تقنيات التعلم الآلي. تتضمن المنهجية تحويل مجموعة بيانات أحادية البعد من بيانات نصية متسلسلة إلى صور رمادية ثنائية الأبعاد، مما يسمح بتطبيق هياكل CNN المختلفة (DenseNet201، GoogLeNet، MobileNetV2، ResNet18، ShuffleNet، وSqueezeNet) لاستخراج الميزات. يتم تعزيز النماذج ذات الأداء الأفضل مع طبقة متصلة بالكامل جديدة (NewFC)، ويتم إنشاء مجموعة ميزات قائمة على الفئات وإعادة تصنيفها باستخدام خوارزميات مثل KNN، LDA، Naive Bayes، وSVM. إن استخدام خوارزمية RelieF لاختيار الميزات يحسن بشكل كبير من دقة وكفاءة التصنيف.

يظهر النموذج المقترح دقة إجمالية مثيرة للإعجاب تبلغ 99.98%، مما يبرز إمكانيته في تعزيز اتخاذ القرار في المؤسسات المالية. تشمل المساهمات الرئيسية التحويل الفعال للبيانات المالية إلى تنسيق مناسب لاستخراج الميزات المتقدمة، وإنشاء نظام دعم قرار مستقل قائم على الذكاء الاصطناعي، والقدرة على تقديم تقييمات سريعة وجماعية لدرجات الائتمان عبر مناطق مختلفة. يركز تصميم النموذج على الجدوى الاقتصادية وسهولة التنفيذ على منصات الأجهزة المتنوعة. ستستكشف الأبحاث المستقبلية تطبيق مجموعات بيانات مالية مختلفة، وتقنيات معالجة مسبقة بديلة، ودمج نماذج المحولات لتعزيز قدرات النموذج وقابليته للتعميم بشكل أكبر.

مقدمة

تؤكد مقدمة هذه الورقة البحثية على الدور الحاسم لدرجات الائتمان في الأنظمة المالية الحديثة، حيث تعمل كعامل رئيسي للمقرضين في تقييم الجدارة الائتمانية. تعتبر نماذج تصنيف الائتمان ضرورية للتنبؤ بعجز العملاء والتخفيف من مخاطر الائتمان، ومع ذلك تواجه الطرق التقليدية قيودًا، خاصة في التعامل مع مجموعات البيانات الكبيرة والمعقدة. إن التعقيد المتزايد للبيانات المالية، الذي يتميز بطبيعته غير المتجانسة، يتطلب اعتماد أدوات تحليلية متقدمة، مثل خوارزميات التعلم العميق، التي أظهرت أداءً متفوقًا في التقاط الاتجاهات السوقية المعقدة وتعزيز دقة التنبؤ.

تقترح هذه الدراسة نهجًا جديدًا لتصنيف درجات الائتمان من خلال تحويل بيانات نصية متسلسلة أحادية البعد إلى صور رمادية ثنائية الأبعاد، مما يسمح بتطبيق الشبكات العصبية التلافيفية (CNN). باستخدام مجموعة بيانات من كاجل، تستخدم الدراسة ستة نماذج CNN لتصنيف درجات الائتمان إلى فئات “جيدة”، “سيئة”، و”قياسية”. تتضمن المنهجية تقنيات تقليل الميزات، مثل خوارزمية RelieF، لتعزيز أداء التصنيف. من خلال الاستفادة من أساليب التعلم العميق والتعلم الآلي، تهدف هذه الدراسة إلى معالجة قيود نماذج تصنيف الائتمان التقليدية، مقدمةً حلاً قابلاً للتوسع وفعالاً يتماشى مع الاتجاهات المعاصرة في تحليلات المالية. إن الاستخدام المبتكر لمسافة الكوساين لتمثيل البكسل يعزز الدراسة، مما يوفر منظورًا جديدًا حول دمج البيانات النصية في النمذجة التنبؤية ضمن مجال تصنيف الائتمان.

طرق

في هذا القسم، يتم تفصيل المنهجية المستخدمة للتحليل التجريبي لطرق CNN والتعلم الآلي. تم إجراء التحليل باستخدام MATLAB لتجميع الطرق وPython 3.6 لتحويل مجموعات بيانات CSV إلى رموز صور رمادية ثنائية الأبعاد. تضمنت مواصفات الأجهزة بطاقة فيديو NVIDIA بسعة 24 جيجابايت، ومعالج بسرعة 2.6 جيجاهرتز، وذاكرة وصول عشوائي سعتها 64 جيجابايت. تم تقييم أداء التصنيف للنماذج باستخدام مصفوفة الارتباك جنبًا إلى جنب مع عدة مقاييس أداء، بما في ذلك الدقة، والموثوقية، وF1-score، والخصوصية، كما هو محدد بواسطة الإيجابيات الحقيقية (TP)، والسلبيات الكاذبة (FP)، والسلبيات الحقيقية (FN)، والإيجابيات الكاذبة (TN).

لتحسين أداء التصنيف وكفاءته، تم استخدام خوارزمية RelieF لاختيار الميزات الخمس الأكثر أهمية من مجموعة أولية من ست ميزات. أظهرت النتائج تحسنًا ملحوظًا في دقة التصنيف مع مجموعة الميزات المخفضة. على وجه التحديد، حقق نموذج Cubic KNN 2260 تصنيفًا صحيحًا في فئة “الجيدة”، مع وجود حالتين فقط من السلبيات الكاذبة، بينما أظهرت نماذج أخرى مثل التمييز الخطي، وSVM الخطي، وGaussian Naive Bayes أيضًا دقة عالية ومعدلات تصنيف خاطئة منخفضة. تفوق نموذج Cubic KNN في جميع المقاييس التي تم تقييمها، محققًا دقة 100% لفئة القياسية وقيم عالية في الحساسية، والخصوصية، والدقة، وF-score، مما يثبت نفسه كنموذج الأفضل أداءً في تصنيف درجات الائتمان.

نتائج

يستخدم قسم النتائج في الدراسة اختبارات فريدمان ونمنيي لتحليل أداء النماذج المختلفة، مع التركيز على مقارنة الشبكات العصبية التلافيفية (CNN) مثل DenseNet201 وResNet-18 مع نموذج التعلم الآلي، Cubic KNN. أشار اختبار فريدمان إلى عدم وجود اختلافات كبيرة في مقاييس الأداء (F-score والدقة) بين النماذج، كما يتضح من القيم p التي تزيد عن 0.05 (0.1518 لـ F-score و0.1042 للدقة)، مما يستبعد الحاجة إلى اختبار نمنيي بعد ذلك. ومع ذلك، تم استخدام اختبار فيشر الدقيق لتحليل معدلات التصنيف الخاطئ بسبب وجود أعداد منخفضة في مصفوفات الارتباك.

كشف اختبار فيشر الدقيق عن اختلافات كبيرة في معدلات التصنيف الخاطئ بين نموذج Cubic KNN ونماذج CNN لفئات الجيدة (الفئة 1) والسيئة (الفئة 2)، مع قيم p تبلغ 3.44e-05 و1.15e-17، على التوالي، مما يشير إلى الأداء المتفوق لنموذج Cubic KNN في هذه الفئات. بالمقابل، لم يتم العثور على اختلاف كبير في الفئة القياسية (الفئة 3)، مع قيمة p تبلغ 0.125. تشير النتائج إلى أن أداء النماذج يختلف مع أعداد دعم الفئات، حيث قد تكون مجموعة البيانات الأكبر في الفئة القياسية قد أدت إلى توقعات أكثر استقرارًا لنماذج التعلم العميق. يبرز هذا أهمية حجم مجموعة البيانات وتوزيع الفئات في تقييم أداء النموذج، خاصة بين أساليب التعلم العميق والتعلم الآلي التقليدية.

مناقشة

تسلط قسم المناقشة في الورقة البحثية الضوء على تطور طرق تصنيف درجات الائتمان، مع التركيز على الانتقال من التقنيات الإحصائية التقليدية، مثل الانحدار اللوجستي وتحليل التمييز الخطي، إلى أساليب التعلم الآلي والتعلم العميق الحديثة. بينما أظهرت الطرق الكلاسيكية موثوقية ودقة في التمييز بين مخاطر الائتمان الجيدة والسيئة، فإنها غالبًا ما تكون محدودة من خلال افتراضاتها الخطية، التي لا تلتقط العلاقات المعقدة في مجموعات البيانات الكبيرة بشكل كافٍ. أظهرت التطورات الأخيرة في التعلم الآلي، وخاصة الشبكات العصبية والنماذج الهجينة، وعدًا في تعزيز الأداء التنبؤي. على سبيل المثال، تم استكشاف الشبكات العصبية التلافيفية (CNN) لقدرتها على معالجة وتصنيف بيانات الائتمان بشكل فعال، مع دراسات تشير إلى تحسينات كبيرة في الدقة ومقاييس الأداء.

علاوة على ذلك، تحدد الورقة فجوة ملحوظة في الأدبيات بشأن تطبيق الشبكات العصبية التلافيفية على تصنيف الائتمان، خاصة في تحويل البيانات الترتيبية أحادية البعد إلى صور رمادية ثنائية الأبعاد. يستفيد هذا النهج المبتكر من نقاط القوة في الشبكات العصبية التلافيفية في استخراج الميزات المكانية، مما قد يكشف عن أنماط معقدة قد تتجاهلها الطرق التقليدية. يقترح المؤلفون منهجية جديدة تدمج هذا التحويل مع تقنيات التعلم الآلي، مستفيدين من خوارزمية RelieF لاختيار الميزات لتعزيز أداء النموذج مع معالجة التحديات الحسابية المرتبطة بمجموعات البيانات المالية الكبيرة. لا تساهم هذه الدراسة فقط في زيادة الأبحاث حول التعلم العميق في المالية، بل تقدم أيضًا حلاً قابلاً للتوسع لتصنيف درجات الائتمان، مما يظهر إمكانيات الشبكات العصبية التلافيفية في تحسين الدقة التنبؤية في هذا المجال.

Journal: Computational Economics, Volume: 65, Issue: 5
DOI: https://doi.org/10.1007/s10614-025-10893-5
Publication Date: 2025-02-05
Author(s): Yunus Emre Gür et al.
Primary Topic: Financial Distress and Bankruptcy Prediction

Overview

This research paper addresses the critical challenge of accurately classifying credit scores, which is essential for financial institutions in assessing creditworthiness and managing risk. Traditional classification methods struggle with large datasets, leading to inefficiencies. To overcome these limitations, the authors propose a novel hybrid model that integrates convolutional neural networks (CNN) with machine learning techniques. The methodology involves transforming a 1D dataset of sequential text data into 2D greyscale images, allowing for the application of various CNN architectures (DenseNet201, GoogLeNet, MobileNetV2, ResNet18, ShuffleNet, and SqueezeNet) for feature extraction. The best-performing models are enhanced with a new fully connected layer (NewFC), and a class-based feature set is created and reclassified using algorithms such as KNN, LDA, Naive Bayes, and SVM. The use of the RelieF algorithm for feature selection significantly improves classification accuracy and efficiency.

The proposed model demonstrates an impressive overall accuracy of 99.98%, highlighting its potential to enhance decision-making in financial institutions. Key contributions include the effective transformation of financial data into a format conducive to advanced feature extraction, the establishment of an independent AI-based decision support system, and the ability to provide rapid and collective credit score assessments across various regions. The model’s design prioritizes cost-effectiveness and ease of implementation on diverse hardware platforms. Future research will explore the application of different financial datasets, alternative preprocessing techniques, and the incorporation of transformer models to further enhance the model’s capabilities and generalizability.

Introduction

The introduction of this research paper emphasizes the critical role of credit scores in modern financial systems, serving as a key determinant for lenders assessing creditworthiness. Credit scoring models are essential for predicting customer defaults and mitigating credit risks, yet traditional methods face limitations, particularly in handling large and complex datasets. The increasing complexity of financial data, characterized by its heterogeneous nature, necessitates the adoption of advanced analytical tools, such as deep learning algorithms, which have demonstrated superior performance in capturing intricate market trends and enhancing forecasting accuracy.

This study proposes a novel approach to credit score classification by transforming 1D sequential text data into 2D grayscale images, allowing for the application of Convolutional Neural Networks (CNNs). Utilizing a dataset from Kaggle, the research employs six CNN models to classify credit scores into categories of ‘Good’, ‘Poor’, and ‘Standard’. The methodology includes feature reduction techniques, such as the RelieF algorithm, to enhance classification performance. By leveraging deep learning and machine learning methods, this work aims to address the limitations of traditional credit scoring models, offering a scalable and efficient solution that aligns with contemporary trends in financial analytics. The innovative use of cosine distance for pixel representation further enriches the study, providing a fresh perspective on the integration of textual data in predictive modeling within the credit scoring domain.

Methods

In this section, the methodology employed for the experimental analysis of CNN and machine learning methods is detailed. The analysis was conducted using MATLAB for compiling methods and Python 3.6 for converting CSV datasets into 2D grayscale image codes. The hardware specifications included a 24 GB NVIDIA video card, a 2.6 GHz processor, and 64 GB of RAM. The classification performance of the models was evaluated using a confusion matrix alongside multiple performance metrics, including precision, accuracy, F1-score, and specificity, as defined by true positives (TP), false positives (FP), false negatives (FN), and true negatives (TN).

To enhance classification performance and efficiency, the RelieF algorithm was utilized to select the five most significant features from an initial set of six. The results indicated a marked improvement in classification accuracy with the reduced feature set. Specifically, the Cubic KNN model achieved 2260 correct classifications in the “good” class, with only two false negatives, while other models like linear discriminant, linear SVM, and Gaussian Naive Bayes also demonstrated high accuracy and reduced misclassification rates. The Cubic KNN model excelled in all evaluated metrics, achieving 100% accuracy for the Standard class and high values in sensitivity, specificity, precision, and F-score, thus establishing itself as the best-performing model for credit score classification.

Results

The results section of the study employs the Friedman and Nemenyi tests to analyze the performance of various models, specifically comparing convolutional neural networks (CNNs) like DenseNet201 and ResNet-18 with a machine learning model, Cubic KNN. The Friedman test indicated no significant differences in performance metrics (F-score and accuracy) among the models, as evidenced by p-values greater than 0.05 (0.1518 for F-score and 0.1042 for accuracy), thus precluding the need for a post-hoc Nemenyi test. However, the analysis of misclassification rates utilized Fisher’s Exact Test due to the presence of low counts in the confusion matrices.

Fisher’s Exact Test revealed significant differences in misclassification rates between the Cubic KNN and CNN models for the Good (Class 1) and Poor (Class 2) categories, with p-values of 3.44e-05 and 1.15e-17, respectively, indicating superior performance of the Cubic KNN model in these classes. Conversely, no significant difference was found in the Standard class (Class 3), with a p-value of 0.125. The results suggest that the performance of the models varies with class support numbers, as the larger dataset in the Standard class may have led to more stable predictions for the deep learning models. This highlights the importance of dataset size and class distribution in evaluating model performance, particularly between deep learning and traditional machine learning approaches.

Discussion

The discussion section of the research paper highlights the evolution of credit score classification methods, emphasizing the transition from traditional statistical techniques, such as logistic regression and linear discriminant analysis, to modern machine learning and deep learning approaches. While classical methods have demonstrated reliability and accuracy in distinguishing between good and bad credit risks, they are often limited by their linear assumptions, which inadequately capture complex relationships in large datasets. Recent advancements in machine learning, particularly neural networks and hybrid models, have shown promise in enhancing predictive performance. For instance, convolutional neural networks (CNNs) have been explored for their ability to process and classify credit data effectively, with studies indicating significant improvements in accuracy and performance metrics.

Moreover, the paper identifies a notable gap in the literature regarding the application of CNNs to credit scoring, particularly in transforming 1D ordinal data into 2D greyscale images. This innovative approach leverages CNNs’ strengths in spatial feature extraction, potentially revealing intricate patterns that traditional methods may overlook. The authors propose a novel methodology that integrates this transformation with machine learning techniques, utilizing the RelieF algorithm for feature selection to enhance model performance while addressing computational challenges associated with large financial datasets. This work not only contributes to the growing body of research on deep learning in finance but also offers a scalable solution for credit score classification, demonstrating the potential of CNNs to improve predictive accuracy in this domain.