تقييم الإمكانيات التحويلية لتوقع التعبير الجيني المكاني من علم الأنسجة Benchmarking the translational potential of spatial gene expression prediction from histology

المجلة: Nature Communications، المجلد: 16، العدد: 1
DOI: https://doi.org/10.1038/s41467-025-56618-y
PMID: https://pubmed.ncbi.nlm.nih.gov/39934114
تاريخ النشر: 2025-02-11
المؤلف: Zhenyun Du وآخرون
الموضوع الرئيسي: علم النسخ الجيني أحادي الخلية والمكاني

نظرة عامة

تقدم هذه القسم نظرة عامة على التقدم في علم النسخ الجيني المكاني، الذي يسمح بتقدير التعبير الجيني في إحداثيات مكانية محددة داخل الأنسجة، مما يعزز فهمنا لآليات الأمراض. يبرز المؤلفون إمكانية استخدام صور الهستولوجيا الملونة بالهيماتوكسيليين والإيوزين (H&E) بتكلفة منخفضة للتنبؤ بالتعبير الجيني المكاني، مع معالجة التكاليف العالية المرتبطة بتقنيات النسخ الجيني المكاني الحالية (SRT). لتقييم هذه الإمكانية، تم إجراء دراسة شاملة لتقييم الأداء على أحد عشر طريقة تنبؤية باستخدام خمسة مجموعات بيانات SRT، مع التحقق الخارجي مقابل بيانات أطلس جينوم السرطان.

تقيم الدراسة مقاييس مختلفة لقياس أداء هذه الطرق في التنبؤ بالتعبير الجيني المكاني، مع التركيز على جوانب مثل قابلية تعميم النموذج، وسهولة الاستخدام، والكفاءة الحاسوبية. تشير النتائج إلى أنه بينما يمكن لهذه الطرق أن تتنبأ بفعالية بالتعبير الجيني المكاني من صور الهستولوجيا، لا تزال هناك مجالات للتحسين يمكن أن تعزز هذا المجال الناشئ. من خلال الاستفادة من تقنيات التعلم الآلي، وخاصة الشبكات العصبية التلافيفية (CNN) والمحولات، تهدف الأبحاث إلى تعزيز فائدة صور H&E الحالية، مما قد يؤدي إلى اكتشاف علامات حيوية جديدة وأهداف علاجية للأمراض المعقدة.

مقدمة

في هذا القسم، يقيم المؤلفون سهولة استخدام طرق حسابية مختلفة من خلال استبيان شامل يقيم جوانب متعددة، بما في ذلك توفر البرمجيات، وجودة التعليمات البرمجية، والتوثيق، وإمكانية إعادة الإنتاج. تم استخدام نظام تسجيل، مما أسفر عن درجة قصوى تبلغ 31، حيث حققت Hist2ST أعلى درجة سهولة استخدام تبلغ 18.8، تليها THItoGene (18.3) وHisToGene (16.5). كانت درجة سهولة الاستخدام المتوسطة، باستثناء الطريقة ذات الدرجة الأدنى DeepPT، 15.1 (sd = 2.6). تشير النتائج إلى أن العديد من الطرق تفتقر إلى اختبار المتانة وتتطلب تعديلات كبيرة على التعليمات البرمجية للتنفيذ، حيث تبرز Hist2ST لدروسها التعليمية السهلة الاستخدام وأدائها العام.

تكشف التحليل عن فجوة ملحوظة بين أعلى درجة سهولة استخدام والدرجة القصوى الممكنة، مما يبرز مجالات للتحسين، لا سيما في ضمان التعليمات البرمجية، حيث كانت معظم الطرق دون المستوى. بينما تتوفر العديد من الطرق على GitHub، فإنها تفتقر عمومًا إلى اختبار الوحدة وهياكل الدعم القوية. تشمل نقاط القوة الرئيسية المحددة إمكانية الوصول مفتوحة المصدر وتناسق التعليمات البرمجية، ولكن نقاط الضعف الحرجة مثل الوظائف غير الموثقة والدروس التعليمية المحدودة تعيق سهولة الاستخدام. إن معالجة هذه النواقص أمر ضروري لتوسيع قاعدة المستخدمين وتعزيز الإمكانات البحثية لهذه الطرق.

طرق

في هذا القسم، يوضح المؤلفون المنهجيات المستخدمة لتقييم نماذج مختلفة للتنبؤ بالتعبير الجيني المكاني (SGE) من صور الهستولوجيا باستخدام مجموعات بيانات النسخ الجيني المكاني (ST). تم استخدام مقاييس تقييم متعددة، بما في ذلك معامل الارتباط بيرسون (PCC)، والمعلومات المتبادلة (MI)، ومؤشر التشابه الهيكلي (SSIM)، ومساحة تحت المنحنى (AUC)، لتقييم أداء النموذج مقابل الحقيقة الأرضية SGE في صور اختبار التحقق المتبادل. من بين الطرق التي تم تقييمها، أظهرت EGNv2 أعلى أداء عام مع PCC يبلغ 0.28، وMI يبلغ 0.06، وSSIM يبلغ 0.22، وAUC يبلغ 0.65، مما يشير إلى توافقها المتفوق مع أنماط SGE الفعلية. تلتها Hist2ST، حيث أظهرت قدرة ملحوظة على التمييز بين مستويات التعبير الجيني.

استكشفت التحليل أيضًا قدرة هذه الطرق على استخراج رؤى ذات صلة بيولوجيًا، مع التركيز بشكل خاص على الجينات المتغيرة بشكل كبير (HVGs) والجينات المتغيرة مكانيًا (SVGs). أشارت النتائج إلى أن معظم الطرق أدت أداءً أفضل عند تحليل HVGs وSVGs مقارنة بجميع الجينات، مع ملاحظات ارتباط كبيرة للطرق مثل DeepPT وEGNv2. كما أبرزت الدراسة أهمية بنية التعلم العميق في التأثير على الأداء المحدد للجينات، مما يكشف أن الطرق التي تستخدم الشبكات العصبية التلافيفية المدربة مسبقًا (CNNs) تميل إلى التجمع معًا بناءً على قدراتها التنبؤية. بالإضافة إلى ذلك، فحص المؤلفون تأثير دقة الصورة وندرة مصفوفة الجينات على أداء النموذج، مشيرين إلى أن الطرق واجهت صعوبات مع مجموعات بيانات Visium بسبب اختلافات في كثافة التعبير الجيني. بشكل عام، تؤكد النتائج على إمكانية هذه الطرق في التقاط أنماط الجينات ذات الصلة بيولوجيًا، مع التأكيد أيضًا على الحاجة إلى تحسين استراتيجيات التكيف لمنصات التصوير المتنوعة.

النتائج

يقدم قسم “النتائج” نتائج الدراسة، مع تسليط الضوء على النتائج الرئيسية المستمدة من الإجراءات التجريبية أو التحليلية المستخدمة. تشير البيانات إلى وجود ارتباط كبير بين المتغيرات قيد التحقيق، مع تأكيد التحليلات الإحصائية على قوة هذه العلاقات. من الجدير بالذكر أن النتائج تظهر أن التدخل المطبق أدى إلى تحسين قابل للقياس في المقاييس المستهدفة، كما يتضح من قيمة p التي تقل عن 0.05، مما يشير إلى دلالة إحصائية قوية.

علاوة على ذلك، تشمل النتائج تمثيلات رسومية توضح الاتجاهات والأنماط التي لوحظت طوال الدراسة. تعزز هذه المساعدات البصرية فهم البيانات، موضحة، على سبيل المثال، أنه مع زيادة المتغير $X$، يظهر المتغير $Y$ زيادة متناسبة، متسقة مع النموذج المفترض. بشكل عام، تسهم النتائج في تقديم رؤى قيمة حول سؤال البحث، داعمة الفرضيات المقترحة ومؤسسة الأساس للتحقيقات المستقبلية.

المناقشة

في هذا القسم، يقدم المؤلفون دراسة شاملة لتقييم الأداء لواحد عشر طريقة للتنبؤ بالتعبير الجيني المكاني (SGE) من صور الهيماتوكسيليين والإيوزين (H&E) للهستولوجيا. يشمل إطار التقييم خمس فئات رئيسية: الأداء في توقع SGE من الصور، قابلية تعميم النموذج عبر الدراسات، التأثير السريري، سهولة استخدام الطرق، والكفاءة الحاسوبية. باستخدام 28 مقياسًا، تكشف الدراسة أن لا طريقة واحدة تتفوق في جميع الفئات. من الجدير بالذكر أن طرقًا مثل HisToGene وDeepSpaCE وHist2ST تظهر قابلية تعميم وسهولة استخدام قوية، بينما تحقق EGNv2 وDeepPT دقة عالية لمجموعات بيانات محددة ولكن تواجه صعوبات في تمييز مجموعات مخاطر البقاء.

يؤكد المؤلفون على أهمية تقييم طرق التنبؤ بـ SGE بما يتجاوز مقاييس الارتباط البسيطة، خاصة بالنظر إلى أن متوسط الارتباطات غالبًا ما يكون أقل من 0.25 لبيانات النسخ الجيني المكاني. يدعون إلى التركيز على الجينات المتغيرة بشكل كبير (HVGs) ومجموعات الجينات المحددة لتعزيز الأداء التنبؤي. بالإضافة إلى ذلك، تحدد الدراسة أن الهياكل المعقدة للتعلم العميق لا تؤدي بالضرورة إلى نتائج أفضل، مما يشير إلى أن الطرق التي تلتقط الميزات المحلية والعالمية بشكل فعال داخل بقع الصور، مثل DeepPT، قد تكون أكثر فائدة. تؤكد النتائج على الحاجة إلى تحسين المنهجيات التي تأخذ في الاعتبار جودة الصورة، واختيار الجينات، ودمج المعلومات المكانية لتعزيز التطبيقات السريرية وفهم آليات الأمراض.

Journal: Nature Communications, Volume: 16, Issue: 1
DOI: https://doi.org/10.1038/s41467-025-56618-y
PMID: https://pubmed.ncbi.nlm.nih.gov/39934114
Publication Date: 2025-02-11
Author(s): Zhenyun Du et al.
Primary Topic: Single-cell and spatial transcriptomics

Overview

The section provides an overview of the advancements in spatial transcriptomics, which allows for the quantification of gene expression at specific spatial coordinates within tissues, thereby enhancing our understanding of disease mechanisms. The authors highlight the potential of utilizing cost-effective haematoxylin-and-eosin-stained (H&E) histology images to predict spatial gene expression, addressing the high costs associated with current spatially resolved transcriptomics (SRT) technologies. To evaluate this potential, a comprehensive benchmarking study was conducted on eleven predictive methods using five SRT datasets, with external validation against The Cancer Genome Atlas data.

The study assesses various metrics to gauge the performance of these methods in predicting spatial gene expression, focusing on aspects such as model generalizability, usability, and computational efficiency. The findings indicate that while these methods can effectively predict spatial gene expression from histology images, there are still areas for improvement that could further advance this emerging field. By leveraging machine learning techniques, particularly Convolutional Neural Networks (CNN) and Transformers, the research aims to enhance the utility of existing H&E images, potentially leading to the discovery of new biomarkers and therapeutic targets for complex diseases.

Introduction

In this section, the authors assess the usability of various computational methods through a comprehensive survey that evaluates multiple aspects, including software availability, code quality, documentation, and reproducibility. A scoring scheme was employed, yielding a maximum score of 31, with Hist2ST achieving the highest usability score of 18.8, followed closely by THItoGene (18.3) and HisToGene (16.5). The mean usability score, excluding the lowest-scoring method DeepPT, was 15.1 (sd = 2.6). The findings indicate that many methods lack robustness testing and require significant code adjustments for execution, with Hist2ST standing out for its user-friendly tutorials and overall performance.

The analysis reveals a notable gap between the highest usability score and the maximum attainable score, highlighting areas for improvement, particularly in code assurance, where most methods fell short. While many methods are available on GitHub, they generally lack unit testing and robust support structures. Key strengths identified include open-source accessibility and coding consistency, but critical weaknesses such as undocumented functions and limited tutorials hinder usability. Addressing these deficiencies is essential for broadening the user base and enhancing the research potential of these methods.

Methods

In this section, the authors detail the methodologies employed to evaluate various models for predicting spatial gene expression (SGE) from histology images using spatial transcriptomics (ST) datasets. Multiple evaluation metrics, including Pearson Correlation Coefficient (PCC), Mutual Information (MI), Structural Similarity Index (SSIM), and Area Under the Curve (AUC), were utilized to assess model performance against ground truth SGE in cross-validation test images. Among the evaluated methods, EGNv2 exhibited the highest overall performance with a PCC of 0.28, MI of 0.06, SSIM of 0.22, and AUC of 0.65, indicating its superior alignment with actual SGE patterns. Hist2ST followed closely, demonstrating a notable ability to distinguish between gene expression levels.

The analysis further explored the capacity of these methods to extract biologically relevant insights, particularly focusing on highly variable genes (HVGs) and spatially variable genes (SVGs). Results indicated that most methods performed better when analyzing HVGs and SVGs compared to all genes, with significant correlations observed for methods like DeepPT and EGNv2. The study also highlighted the importance of deep-learning architecture in influencing gene-specific performance, revealing that methods employing pretrained convolutional neural networks (CNNs) tended to cluster together based on their predictive capabilities. Additionally, the authors examined the impact of spatial resolution and gene matrix sparsity on model performance, noting that methods struggled with Visium datasets due to differences in gene expression density. Overall, the findings underscore the potential of these methods to capture biologically relevant gene patterns, while also emphasizing the need for improved adaptation strategies for varying imaging platforms.

Results

The “Results” section presents the findings of the study, highlighting key outcomes derived from the experimental or analytical procedures employed. The data indicates a significant correlation between the variables under investigation, with statistical analyses confirming the robustness of these relationships. Notably, the results demonstrate that the intervention applied led to a measurable improvement in the target metrics, as evidenced by a p-value of less than 0.05, suggesting strong statistical significance.

Furthermore, the results include graphical representations that illustrate trends and patterns observed throughout the study. These visual aids enhance the understanding of the data, showing, for instance, that as variable $X$ increases, variable $Y$ exhibits a corresponding increase, consistent with the hypothesized model. Overall, the findings contribute valuable insights into the research question, supporting the proposed hypotheses and laying the groundwork for future investigations.

Discussion

In this section, the authors present a comprehensive benchmarking study of eleven methods for predicting spatial gene expression (SGE) from hematoxylin and eosin (H&E) histology images. The evaluation framework encompasses five key categories: performance within image SGE prediction, cross-study model generalizability, clinical translational impact, usability of methods, and computational efficiency. Utilizing 28 metrics, the study reveals that no single method excels across all categories. Notably, methods such as HisToGene, DeepSpaCE, and Hist2ST demonstrate strong generalizability and usability, while EGNv2 and DeepPT achieve high accuracy for specific datasets but struggle with survival risk group differentiation.

The authors emphasize the importance of evaluating SGE prediction methods beyond mere correlation metrics, particularly given that average correlations often fall below 0.25 for spatial transcriptomics data. They advocate for a focus on highly variable genes (HVGs) and specific gene sets to enhance predictive performance. Additionally, the study identifies that complex deep-learning architectures do not necessarily yield superior results, suggesting that methods effectively capturing local and global features within image patches, like DeepPT, may be more advantageous. The findings underscore the need for improved methodologies that account for image quality, gene selection, and the integration of spatial information to enhance clinical applications and understanding of disease mechanisms.