نهج الشبكة العصبية لتعزيز توقعات التغيير باستخدام الترميز الفئوي والتقييس القياسي Neural network approach enhancing churn prediction with categorical encoding and standard scaling

المجلة: Scientific Reports، المجلد: 16، العدد: 1
DOI: https://doi.org/10.1038/s41598-026-37407-z
PMID: https://pubmed.ncbi.nlm.nih.gov/41593215
تاريخ النشر: 2026-01-27
المؤلف: Brijit Bhattacharjee وآخرون
الموضوع الرئيسي: تسرب العملاء والتجزئة

نظرة عامة

تناقش قسم ورقة البحث أهمية توقع فقدان العملاء كتطبيق حيوي لتعلم الآلة ضمن تحليلات الأعمال. وتفصل دراسة معيارية محكومة لنموذج متعدد الطبقات، تم تدريبه على مجموعة بيانات تتكون من 10,000 سجل عميل و12 ميزة. تشمل تقنيات معالجة البيانات المستخدمة الترميز الأحادي القياسي والتقييس القياسي، بهدف تعزيز قدرات تعميم النموذج.

تشير النتائج إلى أن النموذج يستخدم هذه الطرق المعالجة بشكل فعال لتحسين أدائه التنبؤي. تؤكد الدراسة على أهمية دمج تقنيات التعامل مع البيانات الفئوية القوية في نماذج تعلم الآلة لتحسين النتائج في توقع فقدان العملاء.

مقدمة

تتناول المقدمة القضية الحرجة لفقدان العملاء، والتي تُعرف بأنها فقدان عميل أو إنهاء علاقتهم مع عمل تجاري. يشكل هذا الظاهرة تحديات كبيرة للمنظمات التي تعمل في أسواق تنافسية ومشبعة للغاية. فهم العوامل التي تسهم في فقدان العملاء أمر ضروري للشركات التي تهدف إلى تعزيز الاحتفاظ بالعملاء والحفاظ على ميزة تنافسية.

الطرق

في هذا القسم، يناقش المؤلفون ظهور تقنيات تعلم الآلة، وخاصة الطرق التجميعية، لمعالجة القيود المرتبطة بالمصنفات الفردية. يشيرون إلى عمل بريمان، الذي أظهر أن استخدام الغابات العشوائية، المدعومة بالتجميع (التجميع القائم على البوتستراب)، يحسن بشكل كبير من دقة التنبؤ.

تتركز المنهجية الموضحة في الورقة على تحويل بيانات العملاء الخام إلى ميزات ذات مغزى مناسبة لتدريب النموذج والتنبؤ. يوضح المؤلفون العملية بأكملها، بدءًا من اختيار الميزات وانتهاءً بضبط المعلمات الفائقة، مدعومة بتمثيل بصري في الشكل 1. تهدف هذه الطريقة المنظمة إلى تعزيز فعالية النموذج التنبؤي من خلال ضمان أن البيانات المدخلة إليه معدة بشكل مثالي.

المناقشة

تسلط المناقشة الضوء على التحديات الحرجة التي تواجه قطاعات الاتصالات والبنوك والتجارة الإلكترونية القائمة على الاشتراك بسبب ارتفاع معدلات فقدان العملاء، مما يهدد الربحية والاستدامة. العواقب المالية لفقدان العملاء المخلصين كبيرة، حيث إن اكتساب عملاء جدد يكلف بشكل كبير أكثر من الاحتفاظ بالعملاء الحاليين. وبالتالي، أصبحت أنظمة توقع الفقدان ضرورية لهذه الصناعات، مما يمكّن المشغلين من تحديد العملاء المحتملين الذين قد يفقدونهم وتنفيذ تدخلات مستهدفة لتعزيز الاحتفاظ بالعملاء وزيادة القيمة مدى الحياة. بينما تكافح النماذج الإحصائية التقليدية، على الرغم من كونها قابلة للتفسير، مع العلاقات المعقدة وغير الخطية الموجودة في بيانات العملاء، مما أدى إلى تحول نحو تقنيات تعلم الآلة (ML) والتعلم العميق (DL)، مثل الغابات العشوائية، وآلات الدعم الناقل، والشبكات العصبية، وخاصة XGBoost، التي أظهرت أداءً تنبؤيًا متفوقًا.

على الرغم من التقدم في منهجيات توقع الفقدان، لا تزال هناك عدة تحديات قائمة، بما في ذلك التعامل مع المتغيرات الفئوية، وعدم التوازن في الفئات، وقابلية تفسير النماذج المعقدة. يمكن أن يؤدي المعالجة غير الكافية للبيانات الفئوية إلى إدخال تحيزات، بينما غالبًا ما يؤدي عدم التوازن في الفئات إلى نماذج تفضل الفئة الغالبة، مما يضعف اكتشاف العملاء الذين يفقدون. علاوة على ذلك، بينما تتفوق الشبكات العصبية في التنبؤ، فإن طبيعتها “الصندوق الأسود” تعقد الفهم الإداري واتخاذ القرار. تؤكد الورقة على الحاجة إلى استكشاف منهجي لتقنيات المعالجة المسبقة، خاصة فيما يتعلق بالشبكات العصبية، وتدعو إلى نهج متوازن يدمج بين طرق الترميز المتقدمة والتقييس مع النماذج التجميعية لتعزيز دقة التنبؤ وقابلية التفسير. تشمل اتجاهات البحث المستقبلية تطوير نماذج توقع الفقدان التكيفية التي تستفيد من البيانات في الوقت الحقيقي وتقنيات هندسة الميزات المتقدمة لالتقاط سلوكيات العملاء المتطورة بشكل أفضل.

القيود

تعتبر قيود نظام توقع الفقدان القائم على الشبكات العصبية، على الرغم من التقدم في الترميز المفصل والتقييس القياسي، ملحوظة. أولاً، يواجه النموذج تحديات تتعلق بالامتثال لخصوصية البيانات، والتحيز الخوارزمي، وقابلية تفسير الذكاء الاصطناعي، مما يثير مخاوف أخلاقية وتنظيمية في سياقات الأعمال. ثانيًا، يحد الاعتماد على مجموعات البيانات التاريخية من قدرة النموذج على التكيف مع سلوكيات العملاء المتغيرة بسرعة وظروف السوق المتطورة، مما يقلل من مرونته ومرونته على المدى الطويل. أخيرًا، تركز الدراسات الحالية بشكل أساسي على مقاييس الأداء الإحصائية، مثل الدقة ودرجة F1، بينما تتجاهل القيود العملية للأعمال وإمكانية التنفيذ. يخلق هذا الإغفال فجوة بين نتائج البحث وقابليتها للتطبيق في العالم الحقيقي، مما يؤدي غالبًا إلى عدم فعالية التشغيل.

Journal: Scientific Reports, Volume: 16, Issue: 1
DOI: https://doi.org/10.1038/s41598-026-37407-z
PMID: https://pubmed.ncbi.nlm.nih.gov/41593215
Publication Date: 2026-01-27
Author(s): Brijit Bhattacharjee et al.
Primary Topic: Customer churn and segmentation

Overview

The research paper section discusses the significance of customer churn prediction as a vital application of machine learning within business analytics. It details a controlled benchmarking study of a multilayer perceptron model, which was trained on a dataset comprising 10,000 customer records and 12 features. The data preprocessing techniques employed included one-hot encoding and standard scaling, aimed at enhancing the model’s generalization capabilities.

The findings indicate that the model effectively utilizes these preprocessing methods to improve its predictive performance. The study underscores the importance of integrating robust categorical data handling techniques in machine learning models to optimize outcomes in customer churn prediction.

Introduction

The introduction addresses the critical issue of customer churn, defined as the loss of a customer or the termination of their relationship with a business. This phenomenon poses significant challenges for organizations operating in highly competitive and saturated markets. Understanding the factors contributing to customer churn is essential for businesses aiming to enhance customer retention and maintain a competitive edge.

Methods

In this section, the authors discuss the emergence of machine learning techniques, particularly ensemble methods, to address the limitations associated with single classifiers. They reference Breiman’s work, which demonstrated that the use of Random Forests, facilitated by bagging (bootstrap aggregating), significantly improved forecasting accuracy.

The methodology outlined in the paper focuses on transforming raw customer data into meaningful features suitable for model training and prediction. The authors detail the entire process, beginning with feature selection and concluding with hyperparameter tuning, supported by a visual representation in Figure 1. This structured approach aims to enhance the effectiveness of the predictive model by ensuring that the data fed into it is optimally prepared.

Discussion

The discussion highlights the critical challenges faced by telecommunications, banking, and subscription-based e-commerce sectors due to high customer attrition rates, which threaten profitability and sustainability. The financial implications of losing loyal customers are significant, as acquiring new customers is substantially more costly than retaining existing ones. Consequently, churn prediction systems have become essential for these industries, enabling operators to identify potential churners and implement targeted interventions to enhance customer retention and maximize lifetime value. Traditional statistical models, while interpretable, have struggled with the complex, non-linear relationships present in customer data, leading to a shift towards machine learning (ML) and deep learning (DL) approaches, such as Random Forests, Support Vector Machines, and neural networks, particularly XGBoost, which have shown superior predictive performance.

Despite advancements in churn prediction methodologies, several challenges persist, including the handling of categorical variables, class imbalance, and the interpretability of complex models. Inadequate preprocessing of categorical data can introduce biases, while class imbalance often results in models that favor the majority class, undermining the detection of actual churners. Moreover, while neural networks excel in prediction, their “black box” nature complicates managerial understanding and decision-making. The paper emphasizes the need for systematic exploration of preprocessing techniques, particularly in relation to neural networks, and advocates for a balanced approach that integrates advanced encoding and scaling methods with ensemble models to enhance predictive accuracy and interpretability. Future research directions include the development of adaptive churn prediction models that leverage real-time data and advanced feature engineering techniques to better capture evolving customer behaviors.

Limitations

The limitations of the churn prediction system based on neural networks, despite advancements in discretized encoding and standard scaling, are notable. Firstly, the model faces challenges related to data privacy compliance, algorithmic bias, and the explainability of artificial intelligence, which raise ethical and regulatory concerns in business contexts. Secondly, reliance on historical data sets constrains the model’s ability to adapt to rapidly changing customer behaviors and evolving market conditions, thereby diminishing its long-term flexibility and resilience. Lastly, existing studies predominantly emphasize statistical performance metrics, such as accuracy and F1-score, while neglecting practical business constraints and deployment feasibility. This oversight creates a gap between research findings and their real-world applicability, often leading to ineffective operationalization.