التحقيق فيما إذا كانت نماذج التعلم العميق للتجميع المشترك تتعلم فيزياء تفاعلات البروتين-الليغاند Investigating whether deep learning models for co-folding learn the physics of protein-ligand interactions

المجلة: Nature Communications، المجلد: 16، العدد: 1
DOI: https://doi.org/10.1038/s41467-025-63947-5
PMID: https://pubmed.ncbi.nlm.nih.gov/41053181
تاريخ النشر: 2025-10-06
المؤلف: Matthew R. Masters وآخرون
الموضوع الرئيسي: هيكل البروتين والديناميات

نظرة عامة

تقدم هذه القسم نظرة عامة على التقدمات الأخيرة في توقع هيكل البروتينات والليغاندات المعتمدة على التعلم العميق، مع تسليط الضوء بشكل خاص على نماذج الطي المشترك مثل AlphaFold 3 (AF3) وRoseTTAFold All-Atom (RFAA). لقد أظهرت هذه النماذج تحسينات كبيرة في توقع هياكل البروتينات التي تتفاعل مع مختلف الجزيئات الحيوية، محققة دقة عالية من خلال الاستفادة من بيانات التسلسل التطورية والقوالب الهيكلية. يمثل إدخال الطي المشترك، حيث يتم توقع هياكل البروتين والليغاند في وقت واحد، ابتكارًا ملحوظًا في هذا المجال.

ومع ذلك، تثير الدراسة مخاوف حاسمة بشأن التزام هذه النماذج بالمبادئ الفيزيائية الأساسية. من خلال استخدام أمثلة معادية تعكس المفاهيم الفيزيائية والكيميائية والبيولوجية الراسخة، يحدد المؤلفون تناقضات كبيرة في التوقعات الهيكلية لمجمعات البروتين-الليغاند. تشير هذه النتائج إلى أن النماذج قد تكون تتكيف بشكل مفرط مع ميزات معينة في بيانات التدريب الخاصة بها، مما يؤدي إلى قيود في قابليتها للتعميم عبر تفاعلات البروتين-الليغاند المتنوعة. يؤكد المؤلفون على أهمية دمج المبادئ الفيزيائية والكيميائية القوية في تطوير أدوات التنبؤ، داعين إلى نهج حذر في الاعتماد على نماذج التعلم العميق للتطبيقات في اكتشاف الأدوية وهندسة البروتينات، حيث يكون الفهم الشامل للخصائص الفيزيائية الأساسية أمرًا ضروريًا.

مقدمة

تناقش مقدمة هذه الورقة البحثية التقدمات في توقع هيكل البروتينات من خلال نماذج التعلم العميق، مع تسليط الضوء بشكل خاص على AlphaFold 2 وRoseTTAFold، التي حققت تحسينات كبيرة في الدقة. ومع ذلك، تشير إلى أن هذه النماذج تظهر نقاط ضعف، حيث يمكن أن تؤدي حتى الاضطرابات البيولوجية البسيطة إلى تناقضات كبيرة في الهياكل المتوقعة. تتماشى هذه المشكلة مع النتائج الأوسع في التعلم الآلي، حيث غالبًا ما تتداخل النماذج بشكل جيد ولكنها تكافح مع الاستقراء عندما تواجه مدخلات غير مألوفة. اعتمدت النسخ الأخيرة من AlphaFold وRoseTTAFold أساليب قائمة على الانتشار لنمذجة هياكل كيميائية متنوعة، محققة نتائج ملحوظة، بينما وصلت نماذج أخرى مثل Chai-1 وBoltz-1 إلى مستويات دقة قابلة للمقارنة.

تنتقد الورقة قابلية نماذج التعلم العميق للهجمات المعادية والعيوب غير الفيزيائية، مثل الاصطدامات الستيرية، مشددة على أن أدائها غالبًا ما يعتمد على قدرات العثور على الجيوب بدلاً من التفاعلات الجزيئية التفصيلية. تضمنت الطرق السابقة لتوليد الأمثلة المعادية بحثًا حسابيًا عن الاضطرابات التي غيرت المخرجات بشكل كبير، مما يكشف عن منحدرات دقة في هذه الشبكات. يقترح المؤلفون نهجًا جديدًا يولد أمثلة معادية تستند إلى المبادئ الفيزيائية والكيميائية والبيولوجية، مع التركيز على نمذجة مجمعات البروتين-الليغاند. من خلال تغيير بقايا مواقع الربط وتعديل الليغاندات لتعطيل التفاعلات، تهدف هذه الطريقة إلى توفير قابلية تفسير أكبر وتسليط الضوء على قيود النماذج، مثل اعتمادها على الارتباطات الإحصائية بدلاً من فهم حقيقي للعمليات الفيزيائية.

طرق

في هذه الدراسة، أجرينا سلسلة من التحديات المعادية لتقييم المتانة الفيزيائية لأربعة نماذج متقدمة من الطي المشترك: AlphaFold3 وRosettaFold All-Atom وChai-1 وBoltz-1. تم تقييم كل نموذج من خلال توليد أمثلة ذات نتائج فيزيائية محددة مسبقًا ومقارنة التوقعات التي قدمتها النماذج مع هذه التوقعات. النماذج المختارة معروفة بأدائها التنافسي عبر معايير مختلفة، ومع ذلك لا تزال مرونتها تجاه السيناريوهات المعادية غير مفحوصة. بالنسبة للتجارب، تم تنفيذ جميع النماذج تحت الإعدادات الافتراضية دون فرض أي قيود إضافية قد تؤثر على أدائها.

نتائج

يقدم قسم “النتائج” في الورقة البحثية النتائج الرئيسية المستمدة من التجارب أو التحليلات التي أجريت. يسلط الضوء على النتائج المهمة التي تدعم الفرضيات أو الأسئلة البحثية المطروحة في الدراسة. عادةً ما يتم توضيح النتائج من خلال أشكال مختلفة من تمثيل البيانات، مثل الجداول أو الرسوم البيانية أو التحليلات الإحصائية، التي توفر فهمًا بصريًا واضحًا للنتائج.

قد يتضمن القسم أيضًا مقارنات بين المجموعات التجريبية، والأهمية الإحصائية للنتائج، وأي اتجاهات أو أنماط تم ملاحظتها. بالإضافة إلى ذلك، قد يناقش الآثار المترتبة على هذه النتائج بالنسبة للأدبيات الموجودة، مشددًا على كيفية مساهمتها في المجال الأوسع للدراسة. بشكل عام، تعتبر النتائج حاسمة للتحقق من البحث وتوجيه التحقيقات المستقبلية.

مناقشة

يقيم قسم المناقشة في الورقة البحثية أداء نماذج الطي المشترك في توقع تفاعلات البروتين-الليغاند، خاصةً في ظل ظروف طفرات مواقع الربط وتعديلات الليغاند. تكشف الدراسة أنه بينما تظهر هذه النماذج، بما في ذلك AlphaFold3 وRosettaFold All-Atom، دقة عالية في توقع أوضاع ربط الليغاند للهياكل الأصلية، إلا أنها تظهر تحيزات كبيرة عند مواجهة الاضطرابات المتعمدة. على وجه التحديد، تميل النماذج إلى الحفاظ على هندسة ربط الليغاند الأصلية حتى بعد تغييرات كبيرة في موقع الربط أو هيكل الليغاند، مما يؤدي إلى توقعات غير واقعية تتميز بالاصطدامات الستيرية والتفاعلات غير المواتية. يشير هذا إلى أن النماذج تعتمد بشكل أساسي على الأنماط المتعلمة من بيانات التدريب بدلاً من التقاط المبادئ الفيزيائية الحيوية الأساسية التي تحكم التعرف الجزيئي بدقة.

تشير النتائج إلى أن النماذج تكافح للتكيف مع التغييرات في موقع الربط أو الليغاند، مما يؤدي غالبًا إلى توقعات لا تعكس الواقع الفيزيائي للنظام. على سبيل المثال، في التحديات التي تم فيها تغيير بقايا مواقع الربط إلى الجلايسين أو الفينيل ألانين، استمرت النماذج في وضع الليغاندات في مواقع الربط الأصلية على الرغم من غياب التفاعلات المواتية. بالإضافة إلى ذلك، تسلط الدراسة الضوء على أن اعتماد النماذج على محاذاة التسلسلات المتعددة والميزات المستندة إلى القوالب قد يسهم في عدم قدرتها على الاستجابة بشكل مناسب للطفرات. يدعو المؤلفون إلى نهج حذر في استخدام هذه النماذج في التطبيقات الحرجة مثل اكتشاف الأدوية، مشددين على الحاجة إلى التحقق الإضافي من خلال طرق قائمة على الفيزياء وتقييم شامل لتقييم متانة ودقة النموذج. بشكل عام، بينما تمثل نماذج الطي المشترك تقدمًا كبيرًا في نمذجة الجزيئات الحيوية، فإن قيودها تبرز ضرورة تحسينات منهجية إضافية ودمج المبادئ الفيزيائية لتعزيز قدراتها التنبؤية.

Journal: Nature Communications, Volume: 16, Issue: 1
DOI: https://doi.org/10.1038/s41467-025-63947-5
PMID: https://pubmed.ncbi.nlm.nih.gov/41053181
Publication Date: 2025-10-06
Author(s): Matthew R. Masters et al.
Primary Topic: Protein Structure and Dynamics

Overview

The section provides an overview of recent advancements in deep-learning-based protein-ligand structure prediction, particularly highlighting the co-folding models such as AlphaFold 3 (AF3) and RoseTTAFold All-Atom (RFAA). These models have demonstrated significant improvements in predicting the structures of proteins interacting with various biomolecules, achieving high accuracy by leveraging evolutionary sequence data and structural templates. The introduction of co-folding, where protein and ligand structures are predicted simultaneously, marks a notable innovation in the field.

However, the study raises critical concerns regarding the adherence of these models to fundamental physical principles. Through the use of adversarial examples that reflect established physical, chemical, and biological concepts, the authors identify substantial discrepancies in the structural predictions of protein-ligand complexes. These findings suggest that the models may be overfitting to specific features in their training data, leading to limitations in their generalizability across diverse protein-ligand interactions. The authors emphasize the importance of incorporating robust physical and chemical priors in the development of predictive tools, advocating for a cautious approach in relying on deep-learning models for applications in drug discovery and protein engineering, where a thorough understanding of underlying physical properties is essential.

Introduction

The introduction of this research paper discusses the advancements in protein structure prediction through deep learning models, particularly highlighting AlphaFold 2 and RoseTTAFold, which have achieved significant accuracy improvements. However, it notes that these models exhibit vulnerabilities, as even minor biologically plausible perturbations can lead to substantial discrepancies in predicted structures. This issue is consistent with broader findings in machine learning, where models often interpolate well but struggle with extrapolation when faced with unfamiliar inputs. Recent iterations of AlphaFold and RoseTTAFold have adopted diffusion-based approaches to model diverse chemical structures, achieving notable results, while other models like Chai-1 and Boltz-1 have reached comparable accuracy levels.

The paper critiques the susceptibility of deep learning models to adversarial attacks and non-physical artifacts, such as steric clashes, emphasizing that their performance often relies on pocket-finding capabilities rather than detailed molecular interactions. Previous methods for generating adversarial examples involved computational searches for perturbations that significantly altered outputs, revealing accuracy cliffs in these networks. The authors propose a novel approach that generates adversarial examples grounded in physical, chemical, and biological principles, focusing on the modeling of protein-ligand complexes. By mutating binding site residues and modifying ligands to disrupt interactions, this method aims to provide greater interpretability and highlight the models’ limitations, such as their reliance on statistical correlations rather than a true understanding of physical processes.

Methods

In this study, we conducted a series of adversarial challenges to evaluate the physical robustness of four advanced co-folding models: AlphaFold3, RosettaFold All-Atom, Chai-1, and Boltz-1. Each model was assessed by generating examples with predetermined physical outcomes and comparing the predictions made by the models against these expectations. The selected models are recognized for their competitive performance across various benchmarks, yet their resilience to adversarial scenarios remains unexamined. For the experiments, all models were executed under default settings without imposing any additional constraints that could influence their performance.

Results

The “Results” section of the research paper presents the key findings derived from the conducted experiments or analyses. It highlights the significant outcomes that support the hypotheses or research questions posed in the study. The results are typically illustrated through various forms of data representation, such as tables, graphs, or statistical analyses, which provide a clear visual understanding of the findings.

The section may also include comparisons between experimental groups, statistical significance of the results, and any observed trends or patterns. Additionally, it may discuss the implications of these findings in relation to existing literature, emphasizing how they contribute to the broader field of study. Overall, the results are crucial for validating the research and guiding future investigations.

Discussion

The discussion section of the research paper evaluates the performance of co-folding models in predicting protein-ligand interactions, particularly under conditions of binding site mutagenesis and ligand modifications. The study reveals that while these models, including AlphaFold3 and RosettaFold All-Atom, demonstrate high accuracy in predicting ligand binding modes for wild-type structures, they exhibit significant biases when faced with intentional perturbations. Specifically, the models tend to maintain the original ligand binding geometry even after substantial alterations to the binding site or ligand structure, leading to unrealistic predictions characterized by steric clashes and unfavorable interactions. This suggests that the models primarily rely on learned patterns from training data rather than accurately capturing the underlying biophysical principles governing molecular recognition.

The findings indicate that the models struggle to adapt to changes in the binding site or ligand, often resulting in predictions that do not reflect the physical realities of the system. For instance, in challenges where binding site residues were mutated to glycine or phenylalanine, the models continued to place ligands in their original binding sites despite the absence of favorable interactions. Additionally, the study highlights that the models’ reliance on multiple-sequence alignments and template-based features may contribute to their inability to respond appropriately to mutations. The authors advocate for a cautious approach to using these models in critical applications such as drug discovery, emphasizing the need for supplementary validation through physics-based methods and a comprehensive evaluation pipeline to assess model robustness and accuracy. Overall, while co-folding models represent a significant advancement in biomolecular modeling, their limitations underscore the necessity for further methodological improvements and integration of physical principles to enhance their predictive capabilities.