تحليل تباين التنسور المعزز بذكاء: تمكين تصنيف السلاسل الزمنية متعددة الأبعاد في بيئات البيانات المنخفضة Intelligently Augmented Contrastive Tensor Factorization: Empowering multi-dimensional time series classification in low-data environments

المجلة: Expert Systems with Applications، المجلد: 287
DOI: https://doi.org/10.1016/j.eswa.2025.127889
تاريخ النشر: 2025-05-14
المؤلف: Anushiya Arunan وآخرون
الموضوع الرئيسي: تحليل الموتر وتطبيقاته

نظرة عامة

تقدم هذه الدراسة إطارًا جديدًا، وهو تحليل الموتر التبايني المعزز بذكاء (ITA-CTF)، يهدف إلى تحسين تصنيف بيانات السلاسل الزمنية متعددة الأبعاد، خاصة في السيناريوهات التي تحتوي على بيانات تدريب محدودة. يتناول الإطار التحديات المتعلقة بتعلم الميزات المعقدة، مثل الاعتماديات عبر الأبعاد والتباينات داخل الفئات، والتي غالبًا ما تعيقها مشكلة الإفراط في التكيف في أساليب التعلم العميق القياسية. يتعلم وحدة CTF داخل ITA-CTF بفعالية المكونات التفسيرية الأساسية واعتمادياتها المشتركة من خلال تحسين خسارة تباينية تعزز التعلم المتشابه والوعي بالفئة، مما يحسن أداء التصنيف.

لزيادة تعزيز عملية التعلم، تولد وحدة ITA تحسينات مستهدفة تبرز الأنماط الواقعية داخل الفئات مع الحفاظ على الخصائص حسب الفئة. يتم تحقيق ذلك من خلال أخذ عينات ديناميكية من نموذج فئة “ناعمة” لتوجيه تحسين كل عينة استعلام، مما يؤدي إلى تمثيل هجين يلتقط التباينات المعقدة داخل الفئات. تم تقييم الطريقة المقترحة بدقة عبر خمس مهام تصنيف، بما في ذلك السيناريوهات الصعبة مثل تحديد الأعطال مع أكثر من 50 فئة، محققة تحسينات في الأداء تصل إلى 18.7% مقارنة بتحليل الموتر القياسي ومعايير التعلم العميق المختلفة. تؤكد النتائج فعالية الإطار في استخراج الخصائص الثابتة من بيانات السلاسل الزمنية متعددة الأبعاد، مما يجعله قابلًا للتطبيق خارج السياقات الصناعية إلى مجالات مثل الرعاية الصحية الذكية، حيث يكون التعلم من بيانات محدودة معلمة أمرًا حيويًا.

مقدمة

تسلط مقدمة هذه الورقة البحثية الضوء على الزيادة في توليد بيانات السلاسل الزمنية متعددة الأبعاد من الشبكات الحسية المترابطة في الأنظمة الحرجة، مثل المصانع الصناعية وشبكات النقل. على الرغم من التقدم في التعلم العميق (DL) لتصنيف السلاسل الزمنية، فإن الاعتماد على مجموعات بيانات معلمة كبيرة يمثل تحديات كبيرة، خاصة في التطبيقات الواقعية حيث تكون البيانات المعلمة نادرة ومعقدة. يؤكد المؤلفون أن أساليب DL التقليدية غالبًا ما تفشل في التقاط التفاعلات الأساسية عبر الأبعاد والتباينات داخل الفئات، والتي تعتبر حاسمة للتصنيف الدقيق في بيانات الحساسات عالية الأبعاد.

لمعالجة هذه التحديات، تقترح الورقة إطارًا جديدًا يدمج تحسينات مستهدفة بذكاء مع تحليل الموتر التبايني الواعي بالفئة (CTF). يهدف هذا الإطار إلى تعزيز تعلم التمثيل من السلاسل الزمنية متعددة الأبعاد في ظروف البيانات المنخفضة. تولد وحدة التحسين المستهدفة بذكاء تحسينات بيانات ذات مغزى تعكس التباينات داخل الفئات، بينما يستفيد مستخرج الميزات CTF من التعلم التبايني للتركيز على الخصائص الثابتة عبر الفئات. تم تصميم الجمع بين هذه المكونات، إلى جانب مصنف متعدد الطبقات، لتحسين أداء التصنيف في السيناريوهات التي تتميز بتباين عالٍ داخل الفئات وبيانات تدريب محدودة. يؤكد المؤلفون أن نهجهم يعالج بفعالية أوجه القصور في الأساليب الحالية ويظهر أداءً قويًا عبر مهام تصنيف متعددة ذات تأثير عالٍ.

الطرق

في هذا القسم، يحدد المؤلفون الأساليب المرجعية المستخدمة لتقييم نهجهم المقترح لتصنيف السلاسل الزمنية. يقارنون طريقتهم مع مجموعة متنوعة من الأسس، بما في ذلك الجار الأقرب التقليدي (1-NN) وتحليل الزمن الديناميكي (DTW)، ونماذج التعلم العميق القياسية مثل الشبكات العصبية التلافيفية والمتكررة، وتقنيات التعلم العميق المتقدمة مثل MLSTM-FCN وTapNet. ومن الجدير بالذكر أنهم يقيمون أيضًا طرق التدريب الذاتي، وخاصة تلك المعتمدة على التعلم التبايني، مثل TS-TCC وTF-C، التي أظهرت فعاليتها في السيناريوهات التي تحتوي على بيانات معلمة محدودة. يبرز المؤلفون أن التعلم التبايني مفيد بشكل خاص لالتقاط الأنماط المميزة على مستوى الفئة، مما يجعله مناسبًا لمهام التصنيف.

تشمل الإعدادات التجريبية تطبيع Z-score لمجموعات بيانات QCAT وPSML لضمان الاتساق في تعلم المقياس القائم على المسافة، بينما لا تتطلب مجموعة بيانات PEMS، التي تتراوح قيمها بالفعل بين 0 و1، أي تعديل. يوضح المؤلفون عملية ضبط المعلمات الفائقة لطريقتهم المقترحة، بما في ذلك اختيار أحجام الدفعات الصغيرة، وأنواع DTW، ومعلمات مختلفة لوحدة استخراج الميزات. يبلغون عن الإعدادات المثلى للمعلمات الفائقة مثل عدد مكونات الموتر ومعدلات التعلم، ويصفون عملية التدريب، التي تشمل التوقف المبكر وتكرارات متعددة للبذور لضمان تقييم أداء قوي. بالنسبة للنماذج المرجعية، تم إجراء تحسين مماثل للمعلمات الفائقة، مع التركيز على الإعدادات الشائعة والتوصيات المحددة من الأدبيات.

النتائج

في هذا القسم، يقدم المؤلفون تحليل أداء مفصل لطريقتهم المقترحة، ITA-CTF، عبر مجموعات بيانات متنوعة، مع التأكيد على فعاليتها في مهام التصنيف. يتضمن التحليل دراسات إلغاء تقيم مساهمات التعلم التبايني (CL) واستراتيجيات تحسين البيانات المقترحة، التي تهدف إلى تعزيز قدرة النموذج على التعلم من بيانات التدريب المحدودة. ومن الجدير بالذكر أن مجموعة بيانات شبكة الكهرباء PSML تبرز لتباينها داخل الفئة، مما يشكل تحديات للتصنيف. تشير النتائج إلى أن ITA-CTF يتفوق على عدة طرق مرجعية عبر أربع من خمس مهام تصنيف، مما يظهر قوته وقدرته على التكيف في التعامل مع مجموعات بيانات معقدة.

تكشف دراسات الإلغاء أن فعالية CL تتأثر بشكل كبير بجودة تحسينات البيانات. بينما تؤدي تقنيات التحسين القياسية مثل الاهتزاز والترتيب إلى نتائج مختلطة، فإن تحسينات الطريقة المقترحة تلتقط بفعالية التباينات داخل الفئات، مما يؤدي إلى تحسين أداء التصنيف. بالإضافة إلى ذلك، يظهر تحليل حساسية المعلمات الفائقة أن حجم الدفعة الصغيرة، وعدد مكونات التحليل، ووزن خسارة التباين تلعب جميعها أدوارًا حاسمة في تحسين دقة التصنيف. يستنتج المؤلفون أن طريقتهم تحتل باستمرار مرتبة بين الأفضل، خاصة في المهام التي تتطلب تعلم ميزات دقيقة، مما يبرز أهمية التحسينات المستهدفة وبنية النموذج المنظمة جيدًا في تحسين نتائج التصنيف في سيناريوهات البيانات المنخفضة.

المناقشة

في هذا القسم، يناقش المؤلفون المفاهيم الأساسية التي تدعم إطارهم المقترح ITA-CTF، الذي يدمج تحليل الموتر، والتعلم التبايني، وتحليل الزمن الديناميكي لتعزيز تعلم التمثيل لبيانات السلاسل الزمنية متعددة الأبعاد، خاصة في السيناريوهات ذات البيانات المنخفضة. يتم تسليط الضوء على تحليل الموتر، وبشكل خاص التحليل متعدد الأبعاد القياسي (CPD)، لقدرتها على توفير تمثيلات فريدة وقابلة للتفسير للموترات عالية الأبعاد، والتقاط التفاعلات المكانية الزمنية الأساسية. يؤكد المؤلفون أنه بينما يقوم CPD بإعادة بناء البيانات بفعالية، فإنه لا يسهل بطبيعته تمثيلات واعية بالفئة، وهو أمر حاسم لمهام التصنيف.

يتم تقديم التعلم التبايني كطريقة لتعلم الميزات المميزة من خلال مقارنة عينات مشابهة وغير مشابهة، وتحسين دالة خسارة تباينية لتعزيز جودة التمثيلات المتعلمة. يتم الاعتراف بالتحدي المتمثل في تصميم استراتيجيات تحسين بيانات فعالة لبيانات السلاسل الزمنية، مع اقتراح تحليل الزمن الديناميكي (DTW) كتقنية لإنشاء تحسينات واقعية تحترم الاعتماديات الزمنية. يقترح المؤلفون استراتيجية تحسين مستهدفة بذكاء تستخدم نموذج فئة “ناعمة” يتم تحديدها ديناميكيًا لتوجيه عملية DTW، مما يؤدي إلى توليد تحسينات ذات مغزى تعكس التباينات داخل الفئات مع الحفاظ على خصائص الفئة. يتم دمج هذه العملية في إطار تحليل الموتر التبايني، الذي يهدف إلى تحسين أداء التصنيف من خلال تحسين التمثيلات المتعلمة في سياق بيانات التدريب المحدودة.

القيود

في قسم “القيود”، يعترف المؤلفون بمناطق رئيسية لتحسين طريقتهم، التي تظهر أداءً قويًا في التطبيقات الصناعية ذات البيانات المنخفضة. أولاً، يحددون الحاجة إلى تحسين وقت الحساب غير المتصل، خاصة في توليد تحسينات بيانات قائمة على نموذج فئة “ناعمة” للسلاسل الزمنية الأطول. بينما يكون وقت الاستدلال عبر الإنترنت فعالًا (حوالي \(10^{-5}\) ثوانٍ)، يمكن تحقيق مزيد من التقدم من خلال تنفيذ خوارزميات تحليل الزمن الديناميكي المتقدمة (DTW)، مثل DTW القابل للتوازي لتسريع GPU أو تقنيات قائمة على الحدود الدنيا للتقليم المبكر للمطابقات غير الواعدة.

ثانيًا، يبرز المؤلفون التحديات في مهمة تحديد الأعطال، المنسوبة إلى الاعتماديات المكانية الزمنية المعقدة، وعدد كبير من الفئات المستهدفة، وبيانات التدريب المحدودة. على الرغم من أن نموذجهم يتفوق على المعايير الحالية، لا يزال هناك مجال لتحسين أداء التحديد. يقترحون استكشاف نماذج التعلم المتقدمة، وخاصة التعلم الميتا، لتعزيز أداء التصنيف في السيناريوهات ذات البيانات القليلة مع الحفاظ على كفاءة البيانات. يقترح المؤلفون أن تعديل استراتيجيات التعلم الميتا الحالية، التي تم تطويرها أساسًا لرؤية الكمبيوتر، لتناسب تعقيدات السلاسل الزمنية متعددة الأبعاد يمكن أن يعالج هذه التحديات. بالإضافة إلى ذلك، يتم تقديم الاستفادة من المعرفة من مهام متنوعة لتسهيل التعديل السريع للمهام الجديدة ببيانات محدودة كاتجاه واعد للبحث المستقبلي.

Journal: Expert Systems with Applications, Volume: 287
DOI: https://doi.org/10.1016/j.eswa.2025.127889
Publication Date: 2025-05-14
Author(s): Anushiya Arunan et al.
Primary Topic: Tensor decomposition and applications

Overview

The research presents a novel framework, Intelligently Augmented Contrastive Tensor Factorization (ITA-CTF), aimed at enhancing the classification of multi-dimensional time series data, particularly in scenarios with limited training data. The framework addresses the challenges of learning complex features, such as cross-dimensional dependencies and intra-class variations, which are often hindered by model overfitting in standard deep learning approaches. The CTF module within ITA-CTF effectively learns core explanatory components and their joint dependencies through a contrastive loss optimization that promotes similarity learning and class-awareness, thereby improving classification performance.

To further bolster the learning process, the ITA module generates targeted augmentations that emphasize realistic intra-class patterns while maintaining classwise properties. This is achieved by dynamically sampling a “soft” class prototype to guide the augmentation of each query sample, resulting in a hybrid representation that captures complex intra-class variations. The proposed method was rigorously evaluated across five classification tasks, including challenging scenarios like fault localization with over 50 classes, achieving performance improvements of up to 18.7% compared to standard tensor factorization and various deep learning benchmarks. The findings underscore the framework’s effectiveness in extracting invariant properties from multi-dimensional time series data, making it applicable beyond industrial contexts to areas such as smart healthcare, where learning from limited labeled data is crucial.

Introduction

The introduction of this research paper highlights the increasing generation of multi-dimensional time series data from interconnected sensor networks in critical systems, such as industrial plants and transportation networks. Despite advancements in deep learning (DL) for time series classification, the reliance on large labeled datasets poses significant challenges, particularly in real-world applications where labeled data is scarce and complex. The authors emphasize that conventional DL methods often fail to capture essential cross-dimensional interactions and intra-class variations, which are crucial for accurate classification in high-dimensional sensor data.

To address these challenges, the paper proposes a novel framework that integrates intelligently targeted augmentations with class-aware contrastive tensor factorization (CTF). This framework aims to enhance representation learning from multi-dimensional time series under low-data conditions. The intelligently targeted augmentation module generates meaningful data augmentations that reflect intra-class variations, while the CTF feature extractor leverages contrastive learning to focus on invariant properties across classes. The combination of these components, along with a multi-layer perceptron classifier, is designed to improve classification performance in scenarios characterized by high intra-class variability and limited training samples. The authors assert that their approach effectively addresses the shortcomings of existing methods and demonstrates robust performance across multiple high-impact classification tasks.

Methods

In this section, the authors outline the benchmark methods used to evaluate their proposed approach for time series classification. They compare their method against a variety of baselines, including the classical one-nearest neighbor (1-NN) Dynamic Time Warping (DTW), standard deep learning (DL) models such as convolutional and recurrent neural networks, and advanced DL techniques like MLSTM-FCN and TapNet. Notably, they also assess self-supervised pre-training methods, particularly those based on contrastive learning, such as TS-TCC and TF-C, which have shown effectiveness in scenarios with limited labeled data. The authors highlight that contrastive learning is particularly advantageous for capturing high-level, class-discriminative patterns, making it suitable for classification tasks.

The experimental settings involve Z-score normalization of the QCAT and PSML datasets to ensure consistency in distance-based metric learning, while the PEMS dataset, with values already constrained between 0 and 1, does not require scaling. The authors detail the hyperparameter tuning process for their proposed method, including the selection of mini-batch sizes, DTW types, and various parameters for the feature extractor module. They report optimal settings for hyperparameters such as the number of tensor components and learning rates, and describe the training process, which includes early stopping and multiple seed repetitions to ensure robust performance evaluation. For benchmark models, similar hyperparameter optimization was conducted, with attention to common settings and specific recommendations from the literature.

Results

In this section, the authors present a detailed performance analysis of their proposed method, ITA-CTF, across various datasets, emphasizing its effectiveness in classification tasks. The analysis includes ablation studies that assess the contributions of contrastive learning (CL) and the proposed data augmentation strategies, which aim to enhance the model’s ability to learn from limited training data. Notably, the PSML Electric Grid dataset is highlighted for its intraclass variability, which poses challenges for classification. The results indicate that ITA-CTF outperforms several benchmark methods across four out of five classification tasks, demonstrating its robustness and adaptability in handling complex datasets.

The ablation studies reveal that the effectiveness of CL is significantly influenced by the quality of data augmentations. While standard augmentation techniques like jittering and permutation yield mixed results, the proposed method’s augmentations effectively capture intra-class variations, leading to improved classification performance. Additionally, the analysis of hyperparameter sensitivity shows that the mini-batch size, the number of decomposition components, and the contrastive loss weight all play critical roles in optimizing classification accuracy. The authors conclude that their method consistently ranks among the top performers, particularly in tasks requiring nuanced feature learning, thereby underscoring the importance of targeted augmentations and a well-structured model architecture in enhancing classification outcomes in low-data scenarios.

Discussion

In this section, the authors discuss the foundational concepts underpinning their proposed ITA-CTF framework, which integrates tensor factorization, contrastive learning, and dynamic time warping to enhance representation learning for multi-dimensional time series data, particularly in low-data scenarios. Tensor factorization, specifically Canonical Polyadic Decomposition (CPD), is highlighted for its ability to provide unique and interpretable representations of high-dimensional tensors, capturing essential spatio-temporal interactions. The authors emphasize that while CPD effectively reconstructs data, it does not inherently facilitate class-aware representations, which is crucial for classification tasks.

Contrastive learning is introduced as a method to learn discriminative features by contrasting similar and dissimilar samples, optimizing a contrastive loss function to enhance the quality of learned representations. The challenge of designing effective data augmentation strategies for time series data is acknowledged, with dynamic time warping (DTW) proposed as a technique to create realistic augmentations that respect temporal dependencies. The authors propose an intelligently targeted augmentation strategy that utilizes a dynamically determined “soft” class prototype to guide the DTW process, thereby generating meaningful augmentations that reflect intra-class variations while maintaining class characteristics. This augmentation process is integrated into the contrastive tensor factorization framework, which aims to improve classification performance by refining the learned representations in the context of limited training data.

Limitations

In the “Limitations” section, the authors acknowledge two primary areas for enhancement in their method, which demonstrates strong performance in low-data industrial applications. First, they identify the need for optimization of offline computation time, particularly in generating “soft” class prototype-based data augmentations for longer time series. While the online inference time is efficient (approximately \(10^{-5}\) seconds), further advancements could be achieved through the implementation of advanced Dynamic Time Warping (DTW) algorithms, such as parallelizable DTW for GPU acceleration or lower bound-based techniques for early pruning of unpromising matches.

Second, the authors highlight challenges in the fault localization task, attributed to complex spatio-temporal dependencies, a high number of target classes, and limited training data. Although their model surpasses existing benchmarks, there remains potential for improved localization performance. They propose exploring advanced learning paradigms, particularly meta-learning, to enhance classification performance in few-shot scenarios while maintaining data efficiency. The authors suggest that adapting existing meta-learning strategies, primarily developed for computer vision, to the complexities of multi-dimensional time series could address these challenges. Additionally, leveraging knowledge from various tasks to facilitate rapid fine-tuning for new tasks with limited data is presented as a promising direction for future research.