توقعات AlphaFold للأشكال المتغيرة مدفوعة بتذكر الهيكل AlphaFold predictions of fold-switched conformations are driven by structure memorization

المجلة: Nature Communications، المجلد: 15، العدد: 1
DOI: https://doi.org/10.1038/s41467-024-51801-z
PMID: https://pubmed.ncbi.nlm.nih.gov/39181864
تاريخ النشر: 2024-08-24
المؤلف: Devlina Chakravarty وآخرون
الموضوع الرئيسي: هيكل البروتين والديناميات

نظرة عامة

تبحث الدراسة في القدرات التنبؤية لـ AlphaFold (AF)، وخاصة في سياق بروتينات تغيير الطي، التي يمكن أن تتبنى أشكال هيكلية متعددة. تكشف الدراسة أن AF هو متنبئ ضعيف لتغيير الطي، حيث حقق معدل نجاح يبلغ 35% فقط للبروتينات التي من المحتمل أن تكون مدرجة في مجموعات التدريب الخاصة به. ومن الجدير بالذكر أن نجاحات النموذج تبدو ناتجة عن حفظ بيانات التدريب بدلاً من فهم قوي لطاقات البروتين. على الرغم من أخذ عينات واسعة من أكثر من 280,000 نموذج، فشل AF2 في التنبؤ بدقة بالهياكل المؤكدة تجريبياً لتغيير الطي خارج بيانات التدريب الخاصة به، حيث تم التقاط واحدة فقط من بين سبع حالات من هذا القبيل. بالإضافة إلى ذلك، لم تكن مقاييس الثقة لـ AF2 فعالة في التمييز بين التشكيلات ذات الطاقة المنخفضة والعالية، مما يشير إلى قيود كبيرة في نطاقه التنبؤي.

تؤكد النتائج على ضرورة وجود طرق أكثر استناداً إلى الفيزياء قادرة على التنبؤ بتشكيلات بروتين متعددة. بينما أظهرت خوارزميات التعلم العميق مثل AF2 وخليفته AF3 دقة ملحوظة في التنبؤ بهياكل البروتين من التسلسلات، تشير قيودها في التقاط سلوكيات البروتين المعقدة إلى أنها قد تعتمد بشكل أساسي على التعرف على الأنماط بدلاً من فهم عميق لفيزياء الطي. يثير هذا تساؤلات حول قدرة AF على نمذجة المناظر الطاقية المعقدة للبروتينات المطوية، والتي تتميز بتنوع وتكرار تشكيلاتها.

الطرق

تحدد قسم الطرق في ورقة البحث التصميم التجريبي والتقنيات التحليلية المستخدمة للتحقيق في سؤال البحث. يتناول اختيار المشاركين، بما في ذلك معايير الإدراج والاستبعاد، بالإضافة إلى حسابات حجم العينة لضمان القوة الإحصائية. استخدمت الدراسة تصميم تجربة عشوائية محكومة، حيث تم تخصيص المشاركين إما إلى المجموعة التجريبية أو المجموعة الضابطة.

تُوصف طرق جمع البيانات، بما في ذلك استخدام استبيانات موثقة وقياسات معيارية لتقييم النتائج الرئيسية. تم إجراء التحليلات الإحصائية باستخدام برامج مناسبة، مع إيلاء اهتمام خاص لافتراضات الاختبارات المستخدمة. يسلط القسم الضوء أيضًا على الطرق المستخدمة للتعامل مع البيانات المفقودة والنهج المتبع لتحليلات الحساسية للتحقق من قوة النتائج. بشكل عام، تم تصميم المنهجية لضمان موثوقية وصلاحية النتائج المستخلصة من الدراسة.

النتائج

في هذا القسم، يقيم المؤلفون أداء AlphaFold (AF) في التنبؤ بتشكيلات بروتينات تغيير الطي، مع التركيز بشكل خاص على قدرة AF2.3.1 و AF3 على نمذجة هذه البروتينات بدقة استنادًا إلى مجموعة بيانات من 92 بروتينًا تم تحديدها تجريبيًا. تشير النتائج إلى أن AF2.3.1 تنبأت بنجاح بكلا التشكيلتين لـ 11 من 92 بروتينًا لتغيير الطي، بينما كان أداء AF3 أقل، حيث التقط 7 فقط. ومن الجدير بالذكر أن AF2_multimer أظهر أداءً محسنًا من خلال التنبؤ بكلا التشكيلتين لـ 12 من 92 بروتينًا. كما تسلط الدراسة الضوء على أن طرق أخذ العينات المحسنة مثل SPEACH_AF و AF-cluster، على الرغم من توليد عدد كبير من الهياكل، فشلت في التنبؤ بتغيير الطي بشكل متسق، بمعدلات نجاح بلغت 8% و 20% على التوالي.

يحقق المؤلفون أيضًا في قيود تنبؤات AF، وخاصة مقاييس الثقة الخاصة به (plDDT و pTM scores)، التي لم تميز بشكل فعال بين التنبؤات الدقيقة وغير الدقيقة لبروتينات تغيير الطي. وجدوا أن تنبؤات AF غالبًا ما تفضل الهياكل غير المتوافقة مع البيانات التجريبية، مما يشير إلى ميل نحو حفظ هياكل مجموعة التدريب بدلاً من الاعتماد على المعلومات التعاونية. بالإضافة إلى ذلك، عند تقييم بروتينات تغيير الطي خارج مجموعة تدريب AF، فشلت النماذج في التنبؤ بالتشكيلات البديلة، مما يثير القلق بشأن موثوقية AF في التنبؤ بهياكل البروتين الجديدة. بشكل عام، تؤكد النتائج على التحديات التي تواجه AF في نمذجة الطبيعة الديناميكية لبروتينات تغيير الطي وقيود المنهجيات الحالية في التقاط تنوعها التشكلي.

المناقشة

تسلط المناقشة الضوء على قيود نماذج AlphaFold (AF)، وخاصة AF2 و AF3، في التنبؤ بدقة بالتشكيل الثنائي للبروتين المناعي XCL1 وبروتينات تغيير الطي الأخرى. بينما يتنبأ AF2 بنجاح بالهيكل الثنائي لـ XCL1 استنادًا إلى حفظ التسلسل بدلاً من المعلومات التعاونية، يفشل AF3 في ذلك، حيث يتنبأ بهيكل غير صحيح متبادل النطاق. تشير الدراسة إلى أن كلا النموذجين AF يواجهان صعوبات مع بروتينات تغيير الطي، حيث حققا معدل نجاح يبلغ 35% فقط في التنبؤ بالتشكيلات الصحيحة من مجموعة بيانات تضم 92 بروتينًا، مع معدلات نجاح أقل بكثير لأولئك خارج مجموعة تدريبهم. يشير هذا إلى أن نماذج AF قد تعتمد بشكل كبير على الهياكل المحفوظة من بيانات التدريب الخاصة بها، بدلاً من فهم قوي لمناظر طاقة البروتين.

علاوة على ذلك، تكشف النتائج أن نماذج AF تظهر معدلات إيجابية خاطئة عالية، بمتوسط 43%، مما يعقد التحقق من التنبؤات. يجادل المؤلفون بأن إخفاقات AF التنبؤية تنبع من ارتباطات غير كافية بين التسلسل والهيكل، خاصة بالنسبة لبروتينات تغيير الطي. يقترحون أن تحسين قدرة النماذج على استخدام الإشارات التعاونية من محاذاة تسلسلات متعددة (MSAs) ذات أعماق متباينة قد يحسن دقة التنبؤ. بشكل عام، تؤكد الدراسة على الحاجة إلى تقنيات أخذ عينات أكثر تطورًا وفهم أعمق لديناميات البروتين لتعزيز القدرات التنبؤية لنماذج AF في تحديد بروتينات تغيير الطي الجديدة.

Journal: Nature Communications, Volume: 15, Issue: 1
DOI: https://doi.org/10.1038/s41467-024-51801-z
PMID: https://pubmed.ncbi.nlm.nih.gov/39181864
Publication Date: 2024-08-24
Author(s): Devlina Chakravarty et al.
Primary Topic: Protein Structure and Dynamics

Overview

The research investigates the predictive capabilities of AlphaFold (AF), particularly in the context of fold-switching proteins, which can adopt multiple structural forms. The study reveals that AF is a weak predictor of fold switching, achieving only a 35% success rate for proteins likely included in its training sets. Notably, the model’s successes appear to stem from memorization of training data rather than a robust understanding of protein energetics. Despite extensive sampling of over 280,000 models, AF2 failed to accurately predict experimentally confirmed fold-switching structures outside its training data, capturing only one out of seven such instances. Additionally, AF2’s confidence metrics did not effectively differentiate between low and high energy conformations, indicating significant limitations in its predictive scope.

The findings underscore the necessity for more physically grounded methods capable of predicting multiple protein conformations. While deep learning algorithms like AF2 and its successor AF3 have demonstrated remarkable accuracy in predicting protein structures from sequences, their limitations in capturing complex protein behaviors suggest that they may primarily rely on pattern recognition rather than a deep understanding of folding physics. This raises questions about the potential of AF to accurately model the intricate energy landscapes of folded proteins, which are characterized by the diversity and frequency of their conformations.

Methods

The Methods section of the research paper outlines the experimental design and analytical techniques employed to investigate the research question. It details the selection of participants, including inclusion and exclusion criteria, as well as the sample size calculations to ensure statistical power. The study utilized a randomized controlled trial design, with participants assigned to either the experimental group or the control group.

Data collection methods are described, including the use of validated questionnaires and standardized measures to assess the primary outcomes. Statistical analyses were performed using appropriate software, with specific attention given to the assumptions of the tests employed. The section also highlights the methods for handling missing data and the approach taken for sensitivity analyses to validate the robustness of the findings. Overall, the methodology is designed to ensure the reliability and validity of the results obtained from the study.

Results

In this section, the authors evaluate the performance of AlphaFold (AF) in predicting the conformations of fold-switching proteins, specifically focusing on the ability of AF2.3.1 and AF3 to accurately model these proteins based on a dataset of 92 experimentally determined fold switchers. The results indicate that AF2.3.1 successfully predicted both conformations for 11 out of 92 fold switchers, while AF3 underperformed, capturing only 7. Notably, AF2_multimer showed improved performance by predicting both conformations for 12 of the 92 proteins. The study also highlights that enhanced sampling methods like SPEACH_AF and AF-cluster, while generating a large number of structures, failed to consistently predict fold switching, with success rates of only 8% and 20%, respectively.

The authors further investigate the limitations of AF’s predictions, particularly its confidence metrics (plDDT and pTM scores), which did not effectively discriminate between accurate and inaccurate predictions for fold switchers. They found that AF’s predictions often favored structures inconsistent with experimental data, suggesting a tendency towards memorization of training set structures rather than reliance on coevolutionary information. Additionally, when assessing fold switchers outside of AF’s training set, the models failed to predict alternative conformations, raising concerns about AF’s reliability in predicting novel protein structures. Overall, the findings underscore the challenges faced by AF in accurately modeling the dynamic nature of fold-switching proteins and the limitations of current methodologies in capturing their conformational diversity.

Discussion

The discussion highlights the limitations of AlphaFold (AF) models, particularly AF2 and AF3, in accurately predicting the dimeric conformation of the immune protein XCL1 and other fold-switching proteins. While AF2 successfully predicts the dimeric structure of XCL1 based on sequence memorization rather than coevolutionary information, AF3 fails to do so, predicting an incorrect domain-swapped structure. The study indicates that both AF models struggle with fold-switching proteins, achieving only a 35% success rate in predicting correct conformations from a dataset of 92 proteins, with even lower success rates for those outside their training set. This suggests that AF models may rely heavily on memorized structures from their training data, rather than a robust understanding of protein energy landscapes.

Furthermore, the findings reveal that AF models exhibit high false positive rates, with an average of 43%, complicating the validation of predictions. The authors argue that AF’s predictive failures stem from inadequate associations between sequence and structure, particularly for fold-switching proteins. They propose that enhancing the models’ ability to utilize coevolutionary signals from multiple sequence alignments (MSAs) of varying depths could improve prediction accuracy. Overall, the study underscores the need for more sophisticated sampling techniques and a deeper understanding of protein dynamics to enhance the predictive capabilities of AF models in identifying novel fold-switching proteins.