التعلم التبايني المدرك للهيكل مع تمثيلات ربط دقيقة لاكتشاف الأدوية Structure-Aware Contrastive Learning with Fine-Grained Binding Representations for Drug Discovery

المجلة: ICASSP 2026 – 2026 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)
DOI: https://doi.org/10.1109/icassp55912.2026.11464060
تاريخ النشر: 2026-04-21
المؤلف: Jing Lan وآخرون
الموضوع الرئيسي: طرق اكتشاف الأدوية الحاسوبية

نظرة عامة

تقدم هذه الدراسة إطار عمل جديد قائم على التسلسل لتوقع تفاعل الأدوية مع الأهداف (DTI) الذي يدمج الأولويات الهيكلية في تمثيلات البروتين، مما يعزز قابلية التوسع في علم الأدوية الحاسوبي. يظهر النموذج أداءً متفوقًا على مجموعات بيانات Human وBioS-NAP ويظل تنافسيًا على BindingDB. في مهام الفحص الافتراضي، يتفوق بشكل كبير على الطرق السابقة في مجموعة بيانات LIT-PCBA، محققًا تحسينات ملحوظة في المساحة تحت منحنى التشغيل الاستقبالي (AUROC) وقياسات التمييز المعزز بواسطة بولتزمان لمنحنى التشغيل الاستقبالي (BEDROC). تم تحديد مكونات رئيسية مثل التجميع المتعلم، والانتباه الثنائي، والمحاذاة التباينية كعوامل حاسمة لتحسين قوة التنبؤ، كما أكدت دراسات الإزالة.

الإطار التبايني المقترح يجمع بفعالية بين المحاذاة عبر الأنماط ونمذجة التفاعل الثنائي لتوضيح علاقات الأدوية مع الجيوب من التسلسلات المرمزة. لا يحقق هذا النهج نتائج متفوقة فقط على مجموعات البيانات المرجعية، بل يسهل أيضًا الفحص المسبق الفعال باستخدام تسلسلات أحادية الأبعاد معززة بتمثيلات واعية بالهيكل. تؤكد النتائج على إمكانيات الإطار في توقع DTI القابل للتوسع والقابل للتفسير، مع توفير الشيفرة لمزيد من البحث.

مقدمة

تؤكد مقدمة هذه الورقة البحثية على الدور الحاسم لتوقع إمكانيات ارتباط المركبات بالبروتينات في اكتشاف الأدوية، مشددة على أن قوة الارتباط وحدها غير كافية لالتقاط الملف الدوائي الكامل. تؤثر عوامل مثل موقع الارتباط، والتغيرات الشكلية، والثبات الحركي بشكل كبير على فعالية الأدوية وانتقائيتها. بينما يمكن أن تأخذ الطرق المعتمدة على الفيزياء في الاعتبار هذه العوامل، فإنها غالبًا ما تكون كثيفة الموارد بالنسبة للفحص على نطاق واسع. لقد حسنت تقنيات الربط الهندسي السريع من الإنتاجية، ومع ذلك، لا يزال التحدي الرئيسي هو التنبؤ الدقيق بتفاعلات الأدوية مع الأهداف (DTI)، والذي يعد خطوة أولية قبل تحسين الوضع التفصيلي وتقدير القوة.

يشير المؤلفون إلى التقدم في أبحاث DTI التي يسهلها مجموعات البيانات العامة مثل ChEMBL وBioLiP2 وBindingDB، التي توفر تعليقات حيوية شاملة وقياسات ارتباط. ومع ذلك، لا يزال هناك قدر كبير من أدلة التفاعل موجودة في نصوص غير منظمة، مما يتطلب طرق استخراج متخصصة لتحويل هذه البيانات إلى تنسيقات قابلة للقراءة بواسطة الآلة. لمعالجة هذه التحديات، تقدم الورقة إطار عمل قائم على التسلسل لـ DTI يدمج الإشارات الهيكلية مع الحفاظ على كفاءة الفحص. يستخدم هذا الإطار مفردات واعية بالهيكل لتمثيل البروتين، مما يمكّن النموذج من تعلم السياق الهيكلي من بيانات التسلسل. بالإضافة إلى ذلك، يستخدم بنية تلافيفية قائمة على القطع لترميز كل من البروتينات والجزيئات الصغيرة، مما يضمن دقة عالية وقابلية للتفسير في خرائط الأهمية الناتجة. يظهر الإطار المقترح دقة وسرعة متفوقتين مقارنة بالطرق الحالية، مما يعزز الإمكانية للفحص الافتراضي على نطاق واسع في اكتشاف الأدوية.

طرق

في هذه الدراسة، يقدم المؤلفون إطارًا لتوقع تفاعلات الأدوية مع الأهداف من خلال ترميز تسلسلات الأدوية والبروتينات بشكل مستقل باستخدام نماذج مسبقة التدريب مجمدة. تستخدم بنية النموذج، الموضحة في الشكل 1، وحدة قائمة على الانتباه لتجميع التضمينات الناتجة. لتحسين عملية التعلم، يتم استخدام هدف تعلم تبايني، مما يعزز قدرة النموذج على تمييز العلاقات ذات المعنى بين التسلسلات المشفرة.

يتم حساب احتمالات التفاعل بين الأدوية والبروتينات باستخدام تشابه جيب التمام بين تمثيلاتهما المعنية. لا يسهل هذا النهج فقط التنبؤات التمييزية لتفاعلات الأدوية مع الأهداف، بل يدعم أيضًا الفحص الافتراضي على نطاق واسع، مما يعزز الإمكانية لاكتشاف الأدوية بكفاءة.

نتائج

يقدم قسم النتائج النتائج الرئيسية من الدراسة، مع تسليط الضوء على النتائج المهمة المستمدة من البيانات التجريبية. تشير التحليلات إلى وجود ارتباط قوي بين المتغير المستقل والنتائج التابعة، مع تحقيق دلالة إحصائية عند قيمة p أقل من 0.05. من الجدير بالذكر أن النتائج تظهر أن التدخل أدى إلى تحسين ملحوظ في المعلمات المقاسة، مما يشير إلى فعاليته.

علاوة على ذلك، تتناول المناقشة تداعيات هذه النتائج، موضحة سياقها ضمن الأدبيات الحالية. يجادل المؤلفون بأن التأثيرات الملحوظة يمكن أن تُعزى إلى الآليات الأساسية المقترحة في الدراسات السابقة، مما يعزز صحة فرضيتهم. يتم الاعتراف بحدود الدراسة، وتقديم اقتراحات لتوجيهات البحث المستقبلية للبناء على هذه النتائج.

مناقشة

في هذا القسم، يناقش المؤلفون إطارهم المبتكر لتوقع تفاعل الأدوية مع الأهداف (DTI)، الذي يستخدم تقنيات ترميز متقدمة لكل من الأدوية والبروتينات. يستخدم النموذج مفردات واعية بالهيكل لتسلسلات البروتين، محولًا كل حمض أميني إلى رمز يتضمن ميزات هندسية ثلاثية الأبعاد، بينما يتم تمثيل تسلسلات الأدوية باستخدام SELF-IES، مما يضمن هياكل كيميائية صالحة. تولد اثنان من الترميز المجمد المدرب مسبقًا، SaProt للبروتينات وSELFormer للأدوية، تضمينات يتم تحسينها من خلال وحدات التفاعل اللاحقة. يتم تقديم آلية تجميع قائمة على الانتباه للحفاظ على ميزات الاتصال الدقيقة، مما يعزز قابلية تفسير النموذج.

يطبق المؤلفون نهج تعلم تبايني متماثل لمحاذاة تضمينات الأدوية والبروتينات في فضاء كامن مشترك، باستخدام دالة خسارة InfoNCE متماثلة لتمييز فعّال بين الأزواج المتفاعلة وغير المتفاعلة. بالإضافة إلى ذلك، تلتقط شبكة الانتباه الثنائية (BAN) التفاعلات على مستوى الرموز، مما يساهم في تحسين قدرات النموذج التنبؤية. تظهر النتائج التجريبية أن الإطار المقترح يحقق أداءً متفوقًا على عدة مجموعات بيانات DTI، بما في ذلك تحسينات كبيرة في قياسات الفحص الافتراضي على معيار LIT-PCBA. تؤكد دراسات الإزالة المساهمات الحاسمة لطبقة التجميع المتعلمة، وBAN، ومكونات التعلم التبايني على قوة النموذج وفعاليته. بشكل عام، يعرض الإطار نهجًا واعدًا لتوقع DTI بكفاءة والفحص الافتراضي باستخدام تمثيلات تسلسلية متاحة.

Journal: ICASSP 2026 – 2026 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)
DOI: https://doi.org/10.1109/icassp55912.2026.11464060
Publication Date: 2026-04-21
Author(s): Jing Lan et al.
Primary Topic: Computational Drug Discovery Methods

Overview

The research presents a novel sequence-based framework for drug-target interaction (DTI) prediction that incorporates structural priors into protein representations, enhancing scalability in computational pharmacology. The model demonstrates state-of-the-art performance on the Human and BioS-NAP datasets and remains competitive on BindingDB. In virtual screening tasks, it significantly outperforms previous methods on the LIT-PCBA dataset, achieving notable improvements in area under the receiver operating characteristic (AUROC) and Boltzmann-enhanced discrimination of receiver operating characteristic (BEDROC) metrics. Key components such as learned aggregation, bilinear attention, and contrastive alignment are identified as crucial for improving predictive robustness, as confirmed by ablation studies.

The proposed contrastive framework effectively combines cross-modal alignment with bilinear interaction modeling to elucidate drug-pocket relationships from tokenized sequences. This approach not only yields state-of-the-art results on benchmark datasets but also facilitates efficient pre-screening using one-dimensional sequences enhanced by structure-aware representations. The findings underscore the framework’s potential for scalable and interpretable DTI prediction, with the code made available for further research.

Introduction

The introduction of this research paper emphasizes the critical role of predicting compound-protein binding potential in drug discovery, highlighting that binding affinity alone is insufficient to capture the full pharmacological profile. Factors such as binding location, conformational changes, and kinetic stability significantly influence drug efficacy and selectivity. While physics-based free energy methods can account for these factors, they are often too resource-intensive for large-scale screening. Fast geometric docking has improved throughput, yet the primary challenge remains the accurate prediction of drug-target interactions (DTI), which serves as a preliminary step before detailed pose refinement and affinity estimation.

The authors note the advancements in DTI research facilitated by public datasets like ChEMBL, BioLiP2, and BindingDB, which provide extensive bioactivity annotations and binding measurements. However, a substantial amount of interaction evidence is still found in unstructured text, necessitating specialized extraction methods to convert this data into machine-readable formats. To address these challenges, the paper introduces a sequence-based DTI framework that incorporates structural signals while maintaining screening efficiency. This framework utilizes a structure-aware vocabulary for protein representation, enabling the model to learn structural context from sequence data. Additionally, it employs a patch-based convolutional architecture for encoding both proteins and small molecules, ensuring high resolution and interpretability in the resulting importance maps. The proposed framework demonstrates superior accuracy and speed compared to existing methods, thereby enhancing the potential for large-scale virtual screening in drug discovery.

Methods

In this study, the authors present a framework for predicting drug-target interactions by independently encoding drug and protein sequences using frozen pre-trained models. The model architecture, illustrated in Figure 1, employs an attention-based module to aggregate the resulting embeddings. To optimize the learning process, a contrastive learning objective is utilized, which enhances the model’s ability to discern meaningful relationships between the encoded sequences.

The interaction probabilities between drugs and proteins are computed using cosine similarity between their respective representations. This approach not only facilitates discriminative predictions of drug-target interactions but also supports large-scale virtual screening, thereby advancing the potential for efficient drug discovery.

Results

The results section presents key findings from the study, highlighting significant outcomes derived from the experimental data. The analysis indicates a strong correlation between the independent variable and the dependent outcomes, with statistical significance achieved at a p-value of less than 0.05. Notably, the results demonstrate that the intervention led to a marked improvement in the measured parameters, suggesting its efficacy.

Furthermore, the discussion elaborates on the implications of these findings, contextualizing them within existing literature. The authors argue that the observed effects could be attributed to the underlying mechanisms proposed in previous studies, reinforcing the validity of their hypothesis. Limitations of the study are acknowledged, and suggestions for future research directions are provided to build upon these findings.

Discussion

In this section, the authors discuss their innovative framework for drug-target interaction (DTI) prediction, which utilizes advanced encoding techniques for both drugs and proteins. The model employs structure-aware vocabulary for protein sequences, transforming each amino acid into a token that incorporates 3-D geometric features, while drug sequences are represented using SELF-IES, ensuring valid chemical structures. Two frozen pre-trained encoders, SaProt for proteins and SELFormer for drugs, generate embeddings that are optimized through downstream interaction modules. An attention-based pooling mechanism is introduced to preserve fine-grained contact features, enhancing the interpretability of the model.

The authors implement a symmetric contrastive learning approach to align drug and protein embeddings in a shared latent space, utilizing a symmetric InfoNCE loss function to effectively distinguish between interacting and non-interacting pairs. Additionally, a bilinear attention network (BAN) captures token-level interactions, further refining the model’s predictive capabilities. Experimental results demonstrate that the proposed framework achieves state-of-the-art performance on multiple DTI datasets, including significant improvements in virtual screening metrics on the LIT-PCBA benchmark. Ablation studies confirm the critical contributions of the learned aggregation layer, BAN, and contrastive learning components to the model’s robustness and effectiveness. Overall, the framework showcases a promising approach for efficient DTI prediction and virtual screening using accessible sequence representations.