تقييم عمر العظام الآلي وتوقع طول البالغين من صور الأشعة السينية ليد الأطفال عبر إطار عمل التعلم العميق المتسلسل Automated Bone Age Assessment and Adult Height Prediction from Pediatric Hand Radiographs via a Cascaded Deep Learning Framework

المجلة: Journal of Medical Systems، المجلد: 49، العدد: 1
DOI: https://doi.org/10.1007/s10916-025-02306-9
PMID: https://pubmed.ncbi.nlm.nih.gov/41296212
تاريخ النشر: 2025-11-26
المؤلف: Nihui Pei وآخرون
الموضوع الرئيسي: دراسات الأنثروبولوجيا الجنائية والبيوآركيولوجيا

نظرة عامة

تقدم هذه الدراسة نهجًا آليًا جديدًا لتقييم عمر العظام وتوقع طول البالغين باستخدام نموذج تعلم عميق متسلسل تم تطبيقه على أشعة اليد للأطفال. تعتمد الطرق التقليدية، التي تعتمد على التفسير اليدوي، غالبًا على التقدير الشخصي وتستغرق وقتًا طويلاً، مما يؤدي إلى تباين في النتائج. استخدمت الدراسة مجموعة بيانات تتكون من 8,242 صورة أشعة لليد اليسرى من أطفال صينيين، تم وضع علامات عليها من قبل أطباء الغدد الصماء للأطفال وفقًا لمعيار الصين-05. استخدم النموذج Yolact لتجزئة الكائنات لتحديد وتصنيف هياكل العظام، تلاه شبكات فرعية من ResNet-18 لتقييم مراكز التكلس. أظهرت النتائج ارتباطًا عاليًا لبيرسون قدره 0.98 لعمر العظام و0.94 لتوقعات طول البالغين، مع متوسط أخطاء مطلقة قدرها 0.25 سنة و1.75 سم، على التوالي. كان متوسط وقت الاستدلال 7.8 ثوانٍ، مما يشير إلى تحسين كبير في الكفاءة السريرية.

تخلص الدراسة إلى أن نموذج التعلم العميق المتسلسل المقترح لا يعزز الدقة في تقييم عمر العظام فحسب، بل يوفر أيضًا توقعات موثوقة لطول البالغين، متجاوزًا الطرق السابقة في الأداء. ومع ذلك، تشمل القيود تحيز مجموعة البيانات الجغرافي وتوزيع العمر غير المتوازن، مما قد يؤثر على قابلية تعميم النموذج واستقراره. يجب أن تركز الأبحاث المستقبلية على دمج مجموعات بيانات متنوعة ومتعددة المراكز لتحسين القوة والملاءمة عبر مختلف السكان الأطفال. بشكل عام، يبرز هذا النهج إمكانات تقنيات التعلم العميق في الأشعة للأطفال، مما يوفر خط أنابيب آلي بالكامل لاتخاذ القرارات السريرية في تقييم النمو.

مقدمة

تتناول مقدمة ورقة البحث القلق الاجتماعي المتزايد بشأن نمو الأطفال وتطورهم، وخاصة فيما يتعلق بقصر القامة ومعدلات النمو البطيئة. تبرز أهمية عمر العظام كمؤشر حاسم لنضج الهيكل العظمي، والذي يتم تقييمه من خلال أشعة اليد لتوفير قياس موضوعي لحالة النمو وتوقع الإمكانات المستقبلية للنمو. تُعتبر الطرق الحالية لتقييم عمر العظام، مثل أطلس Greulich-Pyle (GP) ونظام تسجيل Tanner-Whitehouse (TW)، غير كافية للأطفال الصينيين المعاصرين، مما أدى إلى تطوير طريقة الصين-05، التي تستخدم نظام تسجيل RUS-CHN المصمم لهذه الفئة السكانية.

تتناول المقدمة أيضًا قيود طرق تقييم عمر العظام التقليدية، بما في ذلك طبيعتها التي تستغرق وقتًا طويلاً وإمكانية التفسير الشخصي من قبل الأطباء. استجابةً لهذه التحديات، تقترح الورقة دمج تقنية التعلم العميق، وخاصة الشبكات العصبية التلافيفية، لتعزيز دقة وكفاءة تقييمات عمر العظام. تهدف الدراسة إلى تطوير نموذج شبكة متسلسل لا يقيم عمر العظام فحسب، بل يتوقع أيضًا طول البالغين، مما يقلل من التقدير الشخصي ويحسن من الملاءمة السريرية.

طرق

تحدد الدراسة نهجًا منهجيًا لتقييم عمر العظام وتوقع طول البالغين، كما هو موضح في الشكل 1. تنقسم المنهجية إلى ثلاث مراحل رئيسية: المعالجة المسبقة، معالجة الشبكة العصبية العميقة، والمعالجة اللاحقة.

في مرحلة المعالجة المسبقة، يتم توحيد بيانات الصورة لضمان الاتساق عبر المدخلات. تتضمن مرحلة معالجة الشبكة العصبية العميقة اكتشاف الكائنات وتصنيفها، حيث يقوم النموذج بتحديد الميزات ذات الصلة من الصور الموحدة. أخيرًا، تقوم مرحلة المعالجة اللاحقة بت quantifying المخرجات التي تم إنشاؤها بواسطة النموذج، مما يسهل تفسير النتائج. كل من هذه المراحل ضرورية لتعزيز دقة وموثوقية التوقعات التي يقدمها الخوارزم.

نتائج

قيمت الدراسة نموذجًا لتوقع عمر العظام وطول البالغين باستخدام مجموعة بيانات تتكون من 1,648 صورة أشعة لليد اليسرى، تم وضع علامات عليها جميعًا من قبل خبراء. أظهر النموذج ارتباطات إيجابية قوية مع علامات الخبراء، محققًا معاملات ارتباط بيرسون قدرها 0.98 (ص < 0.0001) لعمر العظام و0.94 (ص < 0.0001) لطول البالغين. كان حوالي 90% من نقاط التوقع قريبة من خطوط الانحدار الملائمة، مما يشير إلى أخطاء توقع طفيفة. تم تقييم عينة عشوائية من 100 صورة أشعة، مما كشف عن متوسط خطأ مطلق (MAE) قدره 0.25 سنة لعمر العظام، قابل للمقارنة مع أطباء الأشعة ذوي الخبرة، وانخفاض بنسبة 53.3% في الخطأ لتوقعات طول البالغين، مما يظهر دقة متفوقة. سلطت التجارب المقارنة الضوء على فعالية النموذج، حيث حقق MAE أقل قدره 0.25 سنة مقارنة بالطرق السابقة، مثل تلك التي قدمها Han et al. (MAE قدره 0.42 سنة) وSuh et al. (MAE قدره 0.48 سنة). أظهر الهيكل المتسلسل المقترح، الذي يدمج Yolact لتحديد مواقع مراكز التكلس وResNet لتصنيف النضج الهيكلي، أداءً محسنًا عبر مجموعات عمرية أطفال متنوعة. بالإضافة إلى ذلك، كان متوسط وقت معالجة النموذج 7.8 ثوانٍ لكل حالة، مما حسّن بشكل كبير من كفاءة سير العمل مقارنةً بأكثر من 10 دقائق المطلوبة للتقييمات اليدوية. تشير هذه النتائج إلى أن الطريقة المقترحة تقدم بديلاً واعدًا لتقييم عمر العظام التقليدي، مما قد يخفف من الأعباء السريرية مع الحفاظ على دقة التشخيص.

مناقشة

في هذه الدراسة، قدمنا نموذج شبكة تعلم عميق متسلسل جديد لتقييم عمر العظام وتوقع طول البالغين باستخدام صور أشعة اليد الأمامية والخلفية، مصمم خصيصًا لنظام التصنيف الصين-05. تتكون مجموعة بياناتنا من 8,242 صورة من أطفال تتراوح أعمارهم بين 3-18 سنة، وهي الأولى التي تشمل النطاق العمري الكامل لتطور طول الأطفال الصينيين. أظهر النموذج دقة عالية في تقييم عمر العظام وتوقع طول البالغين، خاصة في الفئة العمرية من 6-12 سنة، مع متوسط خطأ مطلق (MAE) قدره 0.25 سنة، متفوقًا على الطرق الحالية. حسّن نظام التحليل الذكي بشكل كبير من كفاءة المعالجة، حيث قلل من وقت تقييم الحالة من حوالي 10 دقائق إلى 7 ثوانٍ فقط.

يجمع الهيكل بين نموذج تجزئة الكائنات القائم على YOLACT مع مصنف ResNet101 متعدد المستويات، مما يسمح بتحديد وتصنيف دقيق لمراكز التكلس. تشير نتائجنا إلى قابلية تعميم قوية، حيث تتماشى توقعات النموذج بشكل وثيق مع تقييمات الخبراء، كما يتضح من الفرق المتوسط البالغ 0.19 سنة فقط في تقييمات عمر العظام. علاوة على ذلك، تشمل مخرجات النموذج خرائط تجزئة بصرية ومنحنيات نمو، مما يعزز من قابلية التفسير السريرية ويسهل التواصل الأفضل مع المرضى ومقدمي الرعاية. يجب أن تركز الأبحاث المستقبلية على توسيع مجموعة البيانات لتشمل مجموعات سكانية متنوعة ودمج ميزات هيكلية إضافية للعظام لتحسين قوة النموذج وملاءمته السريرية بشكل أكبر.

Journal: Journal of Medical Systems, Volume: 49, Issue: 1
DOI: https://doi.org/10.1007/s10916-025-02306-9
PMID: https://pubmed.ncbi.nlm.nih.gov/41296212
Publication Date: 2025-11-26
Author(s): Nihui Pei et al.
Primary Topic: Forensic Anthropology and Bioarchaeology Studies

Overview

This research presents a novel automated approach for bone age assessment and adult height prediction using a cascaded deep learning model applied to pediatric hand radiographs. Traditional methods, which depend on manual interpretation, are often subjective and time-consuming, leading to variability in results. The study utilized a dataset of 8,242 left-hand radiographs from Chinese children, annotated by pediatric endocrinologists according to the China-05 standard. The model employed Yolact for instance segmentation to identify and classify bone structures, followed by ResNet-18 subnetworks to evaluate ossification centers. The results demonstrated a high Pearson correlation of 0.98 for bone age and 0.94 for adult height predictions, with mean absolute errors of 0.25 years and 1.75 cm, respectively. The average inference time was 7.8 seconds, indicating a significant improvement in clinical efficiency.

The study concludes that the proposed cascaded deep learning model not only enhances accuracy in bone age assessment but also provides reliable adult height predictions, surpassing previous methods in performance. However, limitations include the dataset’s geographic bias and imbalanced age distribution, which may affect the model’s generalizability and stability. Future research should focus on incorporating diverse, multicenter datasets to improve robustness and applicability across different pediatric populations. Overall, this approach highlights the potential of deep learning technologies in pediatric radiology, offering a fully automated pipeline for clinical decision-making in growth evaluation.

Introduction

The introduction of the research paper addresses the growing societal concern regarding children’s growth and development, particularly in relation to short stature and slow growth rates. It highlights the significance of bone age as a critical indicator of skeletal maturity, which is assessed through hand X-rays to provide an objective measure of growth status and predict future growth potential. Current methods for bone age assessment, such as the Greulich-Pyle (GP) atlas and Tanner-Whitehouse (TW) scoring system, are noted to be inadequate for contemporary Chinese children, leading to the development of the China-05 method, which utilizes the RUS-CHN scoring system tailored for this population.

The introduction further discusses the limitations of traditional bone age assessment methods, including their time-consuming nature and the potential for subjective interpretation by clinicians. In response to these challenges, the paper proposes the integration of deep learning technology, specifically convolutional neural networks, to enhance the accuracy and efficiency of bone age assessments. The study aims to develop a cascaded network model that not only assesses bone age but also predicts adult height, thereby reducing subjectivity and improving clinical applicability.

Methods

The study outlines a systematic approach for bone age assessment and adult height prediction, as depicted in Figure 1. The methodology is divided into three key stages: preprocessing, deep neural network processing, and postprocessing.

In the preprocessing stage, image data is standardized to ensure consistency across inputs. The deep neural network processing stage involves object detection and classification, where the model identifies relevant features from the standardized images. Finally, the postprocessing stage quantifies the outputs generated by the model, facilitating the interpretation of results. Each of these stages is crucial for enhancing the accuracy and reliability of the predictions made by the algorithm.

Results

The study evaluated a model for bone age and adult height prediction using a dataset of 1,648 left-hand X-ray images, all of which were annotated by experts. The model demonstrated strong positive correlations with expert annotations, achieving Pearson correlation coefficients of 0.98 (p < 0.0001) for bone age and 0.94 (p < 0.0001) for adult height. Approximately 90% of prediction points were close to the fitted regression lines, indicating minimal prediction errors. A random sample of 100 X-ray images was assessed, revealing a mean absolute error (MAE) of 0.25 years for bone age, comparable to experienced radiologists, and a 53.3% reduction in error for adult height predictions, showcasing superior accuracy. Comparative experiments highlighted the model's effectiveness, achieving a lower MAE of 0.25 years compared to previous methods, such as those by Han et al. (MAE of 0.42 years) and Suh et al. (MAE of 0.48 years). The proposed cascaded architecture, integrating Yolact for ossification center localization and ResNet for skeletal maturity classification, demonstrated enhanced performance across diverse pediatric age groups. Additionally, the model's average processing time of 7.8 seconds per case significantly improved workflow efficiency compared to the over 10 minutes required for manual evaluations. These findings suggest that the proposed method offers a promising alternative to traditional bone age assessment, potentially alleviating clinical workloads while maintaining diagnostic accuracy.

Discussion

In this study, we introduced a novel cascaded deep learning network for bone age assessment and adult height prediction using hand anteroposterior X-ray images, specifically tailored to the China-05 grading system. Our dataset, comprising 8,242 images from children aged 3-18 years, is the first to encompass the full age range of Chinese pediatric height development. The model demonstrated high accuracy in bone age assessment and adult height prediction, particularly in the 6-12 year age group, with a mean absolute error (MAE) of 0.25 years, outperforming existing methods. The intelligent analysis system significantly enhanced processing efficiency, reducing case evaluation time from approximately 10 minutes to just 7 seconds.

The architecture integrates a YOLACT-based instance segmentation model with a multi-level ResNet101 classifier, allowing for precise localization and classification of ossification centers. Our results indicate strong generalizability, with the model’s predictions closely aligning with expert evaluations, as evidenced by a mean difference of only 0.19 years in bone age assessments. Furthermore, the model’s outputs include visual segmentation maps and growth curves, enhancing clinical interpretability and facilitating better communication with patients and caregivers. Future research should focus on expanding the dataset to include diverse populations and integrating additional structural features of bones to further improve model robustness and clinical applicability.