الذكاء الاصطناعي التوليدي في اكتشاف الأدوية: الإطار الأساسي، التقدمات الحديثة، التحديات، والفرص Generative artificial intelligence in drug discovery: basic framework, recent advances, challenges, and opportunities

المجلة: Frontiers in Pharmacology، المجلد: 15
DOI: https://doi.org/10.3389/fphar.2024.1331062
PMID: https://pubmed.ncbi.nlm.nih.gov/38384298
تاريخ النشر: 2024-02-07
المؤلف: Amit Gangwal وآخرون
الموضوع الرئيسي: طرق اكتشاف الأدوية الحاسوبية

نظرة عامة

تقدم هذه القسم نظرة عامة على منهجيتين رئيسيتين لاكتشاف أو تصميم جزيئات الأدوية الصغيرة: تحسين المركبات الموجودة من خلال علاقات الهيكل-النشاط الكمية والفحص الافتراضي، وتوليد جزيئات جديدة عبر تصميم الأدوية de novo أو علاقات الهيكل-النشاط العكسية الكمية. تهدف كلا الطريقتين إلى تحسين الخصائص الدوائية والدوائية الحيوية، ومع ذلك، تظل عملية تطوير الأدوية مكلفة وطويلة، بمتوسط حوالي 2.5 مليار دولار. تعتبر الفحص الشامل المطلوب لتحديد المرشحين الآمنين والفعالين عقبة كبيرة في هذه العملية.

لقد حول ظهور الذكاء الاصطناعي (AI) مجالات مختلفة، بما في ذلك علوم الأدوية، وخاصة في اكتشاف الأدوية. تشمل تطبيقات الذكاء الاصطناعي الآن توقع خصائص الجزيئات، وتوليد الجزيئات، والفحص الافتراضي، وتخطيط التخليق، وإعادة استخدام الأدوية. ومن الجدير بالذكر أن الذكاء الاصطناعي التوليدي قد ظهر كأداة قوية قادرة على إنشاء بيانات جديدة، بما في ذلك الجزيئات الكيميائية، وقد أظهرت نتائج واعدة في اكتشاف الأدوية. تستعرض هذه المقالة مراجعة الأسس والأطر لنماذج الذكاء الاصطناعي التوليدي المختلفة في سياق اكتشاف الأدوية، مع التركيز على نهج تصميم الأدوية de novo. تناقش النماذج الأساسية والمتقدمة، وتطبيقاتها الحديثة، وتسلط الضوء على التحديات المستمرة والجهود للاستفادة من الذكاء الاصطناعي التوليدي لتوليد جزيئات أدوية جديدة بشكل فعال وبتكلفة معقولة.

مقدمة

تسلط مقدمة هذه الورقة البحثية الضوء على التحديات وعدم الكفاءة في عملية تطوير الأدوية التقليدية، التي تتميز بتكاليف عالية ومعدلات نجاح منخفضة. تؤكد على الإمكانات التحويلية للذكاء الاصطناعي (AI)، وخاصة التعلم العميق (DL) والذكاء الاصطناعي التوليدي (GAI)، في تعزيز مختلف مراحل اكتشاف الأدوية. تظهر التطورات الأخيرة، مثل AlphaFold من DeepMind، قدرة الذكاء الاصطناعي على حل المشكلات البيولوجية المعقدة، مما يسرع البحث والتطوير في مجال الأدوية. تشير الورقة إلى أن شركات الأدوية تتعاون بشكل متزايد مع الشركات الناشئة في مجال الذكاء الاصطناعي وتدمج منهجيات الذكاء الاصطناعي لتبسيط اكتشاف الأدوية، من اختيار الأهداف إلى تحديد المشاركين في التجارب السريرية.

تتوسع هذه القسم في دور GAI في توليد هياكل جزيئية جديدة، وهي عملية تعرف باسم توليد الجزيئات de novo. تناقش مختلف أطر GAI وتطبيقاتها، بما في ذلك القدرة على توقع خصائص الجزيئات وتصميم مركبات جديدة بخصائص مرغوبة. تشمل الأمثلة البارزة نموذج GAI GENTRL الذي طورته Insilico Medicine، والذي نجح في تحديد مثبطات كيناز جديدة في فترة زمنية قصيرة بشكل ملحوظ. على الرغم من القدرات الواعدة لـ GAI، تعترف الورقة بالتحديات الكبيرة، مثل الحاجة إلى مجموعات بيانات تدريب واسعة وتعقيد الأنظمة البيولوجية، والتي قد تعيق قابلية تعميم هذه النماذج. تمهد المقدمة الطريق لمراجعة شاملة لتطبيقات GAI في تطوير الأدوية، بهدف استكشاف كل من النجاحات والتحديات المستمرة في هذا المجال.

نقاش

في مناقشة نماذج الذكاء الاصطناعي التوليدي (GAI)، تصنف الورقة هذه الأدوات إلى نماذج غير ذاتية الانحدار ونماذج ذاتية الانحدار، مع تسليط الضوء على تطبيقاتها في اكتشاف الأدوية. تولد النماذج غير ذاتية الانحدار، مثل الشبكات التنافسية التوليدية (GANs) والمشفّرات التلقائية التغيرية (VAEs)، البيانات في وقت واحد، بينما تقوم النماذج ذاتية الانحدار، التي تمثلها الشبكات العصبية المتكررة (RNNs)، بإنشاء البيانات بشكل تكراري. لقد اكتسبت VAEs شهرة في تطوير الأدوية بسبب قدرتها على إنشاء مساحة كامنة مستمرة، مما يسمح بتوليد جزيئات شبيهة بالأدوية جديدة من خلال أخذ عينات من هذه المساحة. تؤكد الورقة على فعالية VAEs في إنتاج هياكل جزيئية واقعية مع معالجة التحديات مثل الإفراط في التكيف والاستمرارية في التمثيلات الكامنة.

تغطي المناقشة أيضًا التقدم في GANs، التي تستخدم التدريب التنافسي لتوليد مركبات اصطناعية بخصائص مرغوبة. على الرغم من إمكانياتها، تواجه GANs مشكلات مثل انهيار الوضع وعدم استقرار التدريب، مما يجعلها أقل تفضيلًا مقارنة بـ VAEs لبعض التطبيقات. بالإضافة إلى ذلك، تقدم الورقة نماذج قائمة على التدفق ونماذج التوليد الانتشاري، التي تقدم طرقًا بديلة لتوليد الجزيئات، مع التأكيد على قدرتها على نمذجة التوزيعات المعقدة وتعزيز عمليات اكتشاف الأدوية. يتم تسليط الضوء على الشبكات العصبية الرسومية (GNNs) لقدرتها على معالجة الرسوم البيانية الجزيئية، مما يمكّن من مهام مثل توقع الخصائص والفحص الافتراضي، على الرغم من أن التحديات مثل الإفراط في التنعيم وقيود التمثيل لا تزال قائمة. بشكل عام، تؤكد الورقة على المشهد المتطور لنماذج GAI في اكتشاف الأدوية، داعية إلى استمرار البحث وتحسين هذه المنهجيات.

Journal: Frontiers in Pharmacology, Volume: 15
DOI: https://doi.org/10.3389/fphar.2024.1331062
PMID: https://pubmed.ncbi.nlm.nih.gov/38384298
Publication Date: 2024-02-07
Author(s): Amit Gangwal et al.
Primary Topic: Computational Drug Discovery Methods

Overview

This section provides an overview of two primary methodologies for discovering or designing small drug molecules: the refinement of existing compounds through quantitative structure-activity relationships and virtual screening, and the generation of new molecules via de novo drug design or inverse quantitative structure-activity relationships. Both approaches aim to optimize pharmacokinetic and pharmacodynamic properties, yet the drug development process remains costly and lengthy, averaging around $2.5 billion. A significant hurdle in this process is the extensive screening required to identify safe and effective candidates.

The advent of artificial intelligence (AI) has transformed various fields, including pharmaceutical sciences, particularly in drug discovery. AI applications now encompass molecular property prediction, molecule generation, virtual screening, synthesis planning, and drug repurposing. Notably, generative artificial intelligence has emerged as a powerful tool capable of creating novel data, including chemical molecules, and has shown promising results in drug discovery. This review article examines the fundamentals and frameworks of various generative AI models within the context of drug discovery, focusing on the de novo drug design approach. It discusses both basic and advanced models, their recent applications, and highlights ongoing challenges and efforts to leverage generative AI for the efficient and cost-effective generation of new drug molecules.

Introduction

The introduction of this research paper highlights the challenges and inefficiencies in the traditional drug development process, characterized by high costs and low success rates. It emphasizes the transformative potential of artificial intelligence (AI), particularly deep learning (DL) and generative AI (GAI), in enhancing various stages of drug discovery. Recent advancements, such as DeepMind’s AlphaFold, demonstrate AI’s capability to solve complex biological problems, thereby accelerating research and development in pharmaceuticals. The paper notes that pharmaceutical companies are increasingly collaborating with AI startups and integrating AI methodologies to streamline drug discovery, from target selection to clinical trial participant identification.

The section further elaborates on the role of GAI in generating novel molecular structures, a process known as de novo molecular generation. It discusses various GAI frameworks and their applications, including the ability to predict molecular properties and design new compounds with desired characteristics. Notable examples include the GAI model GENTRL developed by Insilico Medicine, which successfully identified novel kinase inhibitors in a remarkably short timeframe. Despite the promising capabilities of GAI, the paper acknowledges significant challenges, such as the need for extensive training datasets and the complexity of biological systems, which may hinder the generalizability of these models. The introduction sets the stage for a comprehensive review of GAI applications in drug development, aiming to explore both successes and ongoing challenges in the field.

Discussion

In the discussion of generative AI (GAI) models, the paper categorizes these tools into non-autoregressive and autoregressive models, highlighting their applications in drug discovery. Non-autoregressive models, such as generative adversarial networks (GANs) and variational autoencoders (VAEs), generate data simultaneously, while autoregressive models, exemplified by recurrent neural networks (RNNs), construct data iteratively. VAEs have gained prominence in drug development due to their ability to create a continuous latent space, allowing for the generation of novel drug-like molecules by sampling from this space. The paper emphasizes the effectiveness of VAEs in producing realistic molecular structures while addressing challenges like overfitting and continuity in latent representations.

The discussion also covers the advancements in GANs, which utilize adversarial training to generate synthetic compounds with desired characteristics. Despite their potential, GANs face issues such as mode collapse and training instability, making them less favorable compared to VAEs for certain applications. Additionally, the paper introduces flow-based models and diffusion generative models, which offer alternative approaches to molecular generation, emphasizing their ability to model complex distributions and enhance drug discovery processes. Graph neural networks (GNNs) are highlighted for their capacity to process molecular graphs, enabling tasks such as property prediction and virtual screening, although challenges like over-smoothing and representation limitations remain. Overall, the paper underscores the evolving landscape of GAI models in drug discovery, advocating for continued research and refinement of these methodologies.