نتائج حول التعلم الآلي لتحديد السيراميك الأثري: مراجعة أدبية منهجية Findings on Machine Learning for Identification of Archaeological Ceramics: A Systematic Literature Review

المجلة: IEEE Access، المجلد: 12
DOI: https://doi.org/10.1109/access.2024.3429623
تاريخ النشر: 2024-01-01
المؤلف: Ziyao Ling وآخرون
الموضوع الرئيسي: تحليل مواد التراث الثقافي

نظرة عامة

تقدم الورقة البحثية مراجعة منهجية للأدبيات تتناول ثلاثة وثلاثين دراسة تركزت على تحديد الفخار الأثري من خلال تقنيات التعلم الآلي (ML) والتعلم العميق (DL). تسلط الضوء على الأهمية التاريخية للفخار في التراث الثقافي وتنتقد الطرق التقليدية التجريبية والتقنية لتحديده، والتي غالبًا ما تكون مكلفة وتستغرق وقتًا طويلاً. تشير النتائج إلى أن DL، وخاصة الشبكات العصبية التلافيفية (CNNs)، قد تفوقت على خوارزميات ML التقليدية مثل آلات الدعم الناقل (SVM) وأقرب الجيران (KNN) في تصنيف صور الفخار الأثري. لقد أظهرت نماذج CNN البارزة مثل VGG16 وMobileNet وResNet دقة متفوقة، مما يشير إلى تحول في هذا المجال نحو منهجيات DL.

تؤكد الورقة على إمكانية ML وDL لتعزيز كفاءة ودقة تصنيف الفخار، مما يكمل الطرق التقليدية. من خلال أتمتة عملية التصنيف، تسمح هذه الخوارزميات للآثاريين بتحليل مجموعات بيانات أكبر والتركيز على تحليلات وتفسيرات أكثر تعقيدًا. علاوة على ذلك، يمكن أن تكشف تقنيات ML عن أنماط خفية داخل بيانات الفخار، مما يوفر رؤى حول الممارسات الثقافية والسياقات التاريخية التي قد تظل غامضة من خلال طرق التعريف التقليدية. يدعو المؤلفون إلى تطوير مجموعات بيانات أكثر شمولاً ومعيارية لتعزيز تطبيق هذه التقنيات في البحث الأثري.

مقدمة

تحدد مقدمة الورقة أهمية الفخار كفئة حيوية من المواد التي استخدمها البشر لأكثر من 10,000 عام. يُلاحظ أن الفخار، الذي يشمل الفخار والخزف وأشكال أخرى، يتميز بمتانته وأهميته التاريخية، حيث يعمل كعلامات زمنية تساعد في فهم التطور الثقافي وشبكات التجارة بين المجتمعات القديمة. تؤكد الورقة على ضرورة تأريخ وتصنيف هذه القطع بدقة، مشددة على قيود طرق التعريف الحالية، التي تعتمد أساسًا على التجريب أو العلم. تعتمد الطرق التجريبية على تحليل الخبراء، مما يؤدي إلى عدم الاتساق، بينما تواجه التقنيات العلمية، مثل تأريخ الفلورسنس بالأشعة السينية وتأريخ اللمعان الحراري، تحديات تتعلق بالتلف والتكلفة واستهلاك الوقت.

لمعالجة هذه التحديات، يقترح المؤلفون استكشاف رؤية الكمبيوتر (CV) والذكاء الاصطناعي (AI) كأدوات مبتكرة لتحديد الفخار. لقد دفعت التقدمات الأخيرة في هذه المجالات الدراسات التي تحقق في تطبيقها في تحليل صور الفخار الأثري. تقدم الورقة مراجعة منهجية للأدبيات حول خوارزميات التعلم الآلي (ML) المستخدمة لهذا الغرض، تغطي الأبحاث المنشورة بين عامي 2013 و2023. وفقًا لمنهجية PRISMA، تحلل المراجعة 33 ورقة ذات صلة، مع التركيز على مجموعات البيانات وطرق معالجة الصور وخوارزميات ML. يتم توضيح هيكل الورقة، مما يشير إلى أن الأقسام التالية ستناقش مراجعات الأدبيات، وأسئلة البحث، والمنهجية، والنتائج، والاتجاهات المستقبلية في هذا المجال.

طرق

توضح قسم الطرق أسئلة البحث التي توجه الدراسة وتفاصيل تطبيق منهجية PRISMA (عناصر التقرير المفضلة للمراجعات المنهجية والتحليلات التلوية) لإجراء مراجعة منهجية. تم تصميم هذه المنهجية لضمان ملخص شامل وغير متحيز للأدبيات من خلال توفير نهج منظم لتحديد وتقييم وتجميع الدراسات ذات الصلة عبر قواعد بيانات مختلفة. يعزز إطار عمل PRISMA الشفافية وقابلية التكرار، مما يقلل من التحيز في عملية المراجعة.

بالإضافة إلى ذلك، يناقش القسم طرق معالجة الصور الحيوية لتصنيف صور الفخار القديمة، مشددًا على ثلاثة نهج رئيسية: استخراج الميزات، تقسيم الصور، وتحسين الصور. يتضمن استخراج الميزات تحديد كل من الخصائص الطبيعية والصناعية للصور، والتي تعتبر ضرورية لتقسيم الصور إلى مناطق ذات خصائص مشابهة. يعتمد تقسيم الصور بشكل أساسي على خصائص مثل السطوع ومكونات اللون، بينما تهدف تقنيات تحسين الصور إلى تحسين الجودة البصرية، مما يسهل استخراج البيانات. يشير القسم إلى تحول كبير نحو الشبكات العصبية التلافيفية (CNNs) لاستخراج الميزات وتقسيمها في الفخار الأثري منذ عام 2021، متجاوزًا الطرق التقليدية. على الرغم من مزايا CNNs، لا تزال هناك تحديات، مثل تباين جودة السطح وتعقيد القوام. يُقترح استخدام المحولات لاستخراج الميزات كمسار واعد للبحث المستقبلي، نظرًا لقدرتها على التقاط الاعتماديات بعيدة المدى وتحديد العلاقات المعقدة داخل الصور، مما قد يعزز فهم خصائص الفخار والسياق التاريخي.

نتائج

يوفر قسم “النتائج” تحليلًا كميًا للأوراق المسترجعة، مما يمهد الطريق لمناقشة مفصلة للنتائج البحثية. يتناول بشكل منهجي الأسئلة البحثية الثلاثة المطروحة في الدراسة، حيث يتم استكشاف كل سؤال في قسمه الخاص. يسمح هذا النهج المنظم بعرض واضح للبيانات والرؤى المستمدة من التحليل، مما يسهل فهمًا شاملاً لتداعيات النتائج.

مناقشة

في هذا القسم، تبحث الدراسة في تطبيق تقنيات التعلم الآلي (ML) لتحديد الفخار الأثري، مع معالجة ثلاثة أسئلة بحثية رئيسية: مجموعات البيانات المستخدمة (RQ1)، طرق معالجة الصور المستخدمة (RQ2)، وخوارزميات ML المطبقة (RQ3). قامت الدراسة بمراجعة منهجية لثلاثة وثلاثين مقالًا تمت مراجعتها من قبل الأقران، نشرت بين عامي 2013 و2023، مع التركيز على المصادر باللغة الإنجليزية من مكتبات رقمية بارزة. شملت عملية المراجعة منهجية اختيار صارمة، مما أسفر عن مجموعة بيانات نهائية تبرز هيمنة الخزف والفخار الصيني عبر فترات ودول مختلفة، مع تركيز ملحوظ على سلالة مينغ.

تكشف النتائج أن الغالبية العظمى من الدراسات التي تمت مراجعتها (97%) استخدمت تقنيات معالجة الصور، بشكل أساسي لاستخراج الميزات وتقسيمها، حيث كانت نماذج التعلم العميق، وخاصة الشبكات العصبية التلافيفية (CNNs)، هي الخوارزميات الأكثر استخدامًا (60.7%). كانت أداء هذه الخوارزميات عمومًا مرتفعًا، حيث تجاوزت دقة التصنيف 70%، وفي بعض الحالات، تجاوزت 90%. ومع ذلك، تحدد الدراسة أيضًا تحديات حاسمة، لا سيما ندرة البيانات والحاجة إلى مجموعات بيانات شاملة تتضمن توثيقًا مفصلًا لقطع الفخار. تشير الاتجاهات المستقبلية إلى ضرورة إنشاء ممارسات تسجيل معيارية وإمكانية استخدام توليد البيانات الاصطناعية لتعزيز قوة مجموعة البيانات، مما يحسن فعالية تطبيقات ML في السياقات الأثرية.

القيود

تتعدد قيود هذه المراجعة المنهجية، التي أجريت وفقًا لمنهجية PRISMA. أولاً، كانت المراجعة انتقائية، حيث ركزت فقط على المقالات العلمية من مجلات موثوقة وإجراءات المؤتمرات لضمان محتوى عالي الجودة، مما قد يحد من نطاق الرؤى. بالإضافة إلى ذلك، كان استبعاد تحليل الاقتباسات يهدف إلى الحفاظ على الإيجاز، لكن هذا الخيار قد يتجاهل الروابط القيمة داخل الأدبيات. يمكن أن تعزز الأبحاث المستقبلية شمولية المراجعات من خلال دمج مجموعة أوسع من المصادر واستخدام الرسوم البيانية المعرفية لتوضيح الاتجاهات ونقاط البحث الساخنة في تطبيق خوارزميات التعلم الآلي لتحديد الفخار الأثري.

علاوة على ذلك، حددت المراجعة مجالات بحث غير معالجة، لا سيما دمج خوارزميات التعلم العميق (DL) مع تقنيات أخرى، مثل المسح ثلاثي الأبعاد والواقع الافتراضي (VR)، مما قد يسهل تحليلات أكثر شمولاً للفخار الأثري. لا تزال الدراسات المقارنة عبر الثقافات غير مستكشفة بشكل كافٍ، لا سيما في كيفية تطبيق خوارزميات DL بشكل فعال لتحديد الفخار عبر ثقافات وسياقات تاريخية متنوعة. أخيرًا، تشمل المراجعة الدراسات المنشورة من 2013 إلى 2023، مما يشير إلى أن التحقيقات المستقبلية يمكن أن تستفيد من تضمين أبحاث أكثر حداثة من 2024 فصاعدًا، مما قد يؤدي إلى رؤى أعمق وتطبيقات أوسع في هذا المجال.

Journal: IEEE Access, Volume: 12
DOI: https://doi.org/10.1109/access.2024.3429623
Publication Date: 2024-01-01
Author(s): Ziyao Ling et al.
Primary Topic: Cultural Heritage Materials Analysis

Overview

The research paper provides a systematic literature review of thirty-three studies focused on the identification of archaeological ceramics through machine learning (ML) and deep learning (DL) techniques. It highlights the historical significance of ceramics in cultural heritage and critiques the traditional empirical and technical methods for their identification, which are often costly and time-consuming. The findings indicate that DL, particularly convolutional neural networks (CNNs), has outperformed traditional ML algorithms such as Support Vector Machines (SVM) and K-Nearest Neighbors (KNN) in classifying archaeological ceramic images. Notable CNN models like VGG16, MobileNet, and ResNet have demonstrated superior accuracy, suggesting a shift in the field towards DL methodologies.

The paper emphasizes the potential of ML and DL to enhance the efficiency and accuracy of ceramic classification, thereby complementing traditional methods. By automating the classification process, these algorithms allow archaeologists to analyze larger datasets and focus on more complex analyses and interpretations. Furthermore, ML techniques can reveal hidden patterns within ceramic data, providing insights into cultural practices and historical contexts that may remain obscured through conventional identification methods. The authors advocate for the development of more comprehensive and standardized datasets to further advance the application of these technologies in archaeological research.

Introduction

The introduction of the paper outlines the significance of ceramics as a vital category of materials that have been utilized by humans for over 10,000 years. Ceramics, which include pottery, porcelain, and other forms, are noted for their durability and historical importance, serving as chronological markers that aid in understanding cultural evolution and trade networks among ancient societies. The paper emphasizes the necessity of accurately dating and classifying these artifacts, highlighting the limitations of current identification methods, which are primarily empirical or scientific. Empirical methods rely on expert analysis, leading to inconsistencies, while scientific techniques, such as X-ray fluorescence and thermoluminescence dating, face challenges related to damage, cost, and time consumption.

To address these challenges, the authors propose the exploration of Computer Vision (CV) and Artificial Intelligence (AI) as innovative tools for ceramic identification. Recent advancements in these fields have prompted studies investigating their application in analyzing archaeological ceramic images. The paper presents a systematic literature review of machine learning (ML) algorithms used for this purpose, covering research published between 2013 and 2023. Following the PRISMA methodology, the review analyzes 33 relevant papers, focusing on datasets, image processing methods, and ML algorithms. The structure of the paper is outlined, indicating that subsequent sections will discuss literature reviews, research questions, methodology, results, and future directions in the field.

Methods

The Methods section outlines the research questions guiding the study and details the application of the PRISMA (Preferred Reporting Items for Systematic reviews and Meta-Analyses) methodology for conducting a systematic review. This methodology is designed to ensure a comprehensive, unbiased literature summary by providing a structured approach to identifying, evaluating, and synthesizing relevant studies across various databases. The PRISMA framework enhances transparency and reproducibility, thereby minimizing bias in the review process.

Additionally, the section discusses image processing methods critical for classifying ancient ceramic images, highlighting three main approaches: feature extraction, image segmentation, and image enhancement. Feature extraction involves identifying both natural and artificial attributes of images, which are essential for segmenting images into regions with similar properties. Image segmentation is primarily based on attributes such as luminance and color components, while image enhancement techniques aim to improve visual quality, facilitating data extraction. The section notes a significant shift towards Convolutional Neural Networks (CNNs) for feature extraction and segmentation in archaeological ceramics since 2021, surpassing traditional methods. Despite the advantages of CNNs, challenges remain, such as variability in surface quality and texture complexity. The potential use of Transformers for feature extraction is proposed as a promising avenue for future research, given their ability to capture long-range dependencies and identify intricate relationships within images, which could enhance the understanding of ceramic characteristics and historical context.

Results

The “Results” section provides a quantitative analysis of the retrieved papers, laying the groundwork for a detailed discussion of the research findings. It systematically addresses the three research questions posed in the study, with each question explored in its own subsection. This structured approach allows for a clear presentation of the data and insights derived from the analysis, facilitating a comprehensive understanding of the implications of the findings.

Discussion

In this section, the research investigates the application of machine learning (ML) techniques for identifying archaeological ceramics, addressing three primary research questions: the datasets used (RQ1), the image processing methods employed (RQ2), and the ML algorithms applied (RQ3). The study systematically reviewed thirty-three peer-reviewed articles published between 2013 and 2023, focusing on English-language sources from prominent digital libraries. The review process involved a rigorous selection methodology, resulting in a final dataset that highlights the predominance of Chinese porcelain and pottery across various periods and countries, with a notable emphasis on the Ming dynasty.

The findings reveal that a significant majority of the reviewed studies (97%) utilized image processing techniques, primarily for feature extraction and segmentation, with deep learning models, particularly Convolutional Neural Networks (CNNs), being the most frequently employed algorithms (60.7%). The performance of these algorithms was generally high, with classification accuracies exceeding 70%, and in some cases, surpassing 90%. However, the research also identifies critical challenges, particularly data scarcity and the need for comprehensive datasets that include detailed documentation of ceramic artefacts. Future directions suggest the establishment of standardized recording practices and the potential use of synthetic data generation to enhance dataset robustness, thereby improving the effectiveness of ML applications in archaeological contexts.

Limitations

The limitations of this systematic review, conducted following the PRISMA methodology, are multifaceted. Firstly, the review was selective, concentrating solely on scientific articles from reputable journals and conference proceedings to ensure high-quality content, which may restrict the breadth of insights. Additionally, the exclusion of citation analysis aimed to maintain conciseness, but this choice may overlook valuable connections within the literature. Future research could enhance the comprehensiveness of reviews by incorporating a wider range of sources and utilizing knowledge graphs to illustrate trends and research hotspots in the application of machine learning algorithms for archaeological ceramics identification.

Moreover, the review identified unaddressed research areas, notably the integration of deep learning (DL) algorithms with other technologies, such as 3D scanning and virtual reality (VR), which could facilitate more thorough analyses of archaeological ceramics. Cross-cultural comparative studies also remain underexplored, particularly in how DL algorithms can be effectively applied to identify ceramics across diverse cultures and historical contexts. Lastly, the review encompasses studies published from 2013 to 2023, indicating that future investigations could benefit from including more recent research from 2024 onwards, potentially yielding deeper insights and broader applications in the field.