تجميع الجينوم من طرف التيلومير إلى طرف التيلومير مقترنًا ببيانات متعددة الأوميك يوفر رؤى حول تطور القمح الخبزي سداسي الصيغة الصبغية A telomere-to-telomere genome assembly coupled with multi-omic data provides insights into the evolution of hexaploid bread wheat

المجلة: Nature Genetics، المجلد: 57، العدد: 4
DOI: https://doi.org/10.1038/s41588-025-02137-x
PMID: https://pubmed.ncbi.nlm.nih.gov/40195562
تاريخ النشر: 2025-04-01
المؤلف: Shoucheng Liu وآخرون
الموضوع الرئيسي: التvariations الكروموسومية والجينية

نظرة عامة

يقدم القسم تحليلًا مقارنًا لتجميعات جينوم القمح CS-IAAS و CS RefSeq (الإصدار 2.1)، مع تسليط الضوء على الفروق المهمة في إحصائيات جينومية مختلفة. يتكون تجميع CS-IAAS من 14.51 مليار قاعدة مثبتة، متجاوزًا 14.01 مليار قاعدة في CS RefSeq بنسبة 3.57%. ومن الجدير بالذكر أن CS-IAAS يظهر انخفاضًا ملحوظًا في الفجوات (0 مقارنة بـ 183,603 في CS RefSeq) وانخفاضًا كبيرًا في عدد القطع (21 مقابل 183,624)، مما يؤدي إلى N50 أعلى بكثير للقطع يبلغ 723.78 Mbp مقارنة بـ 0.35 Mbp لـ CS RefSeq.

فيما يتعلق بتعليق الجينات، يحتوي CS-IAAS على 141,035 جينًا مشفرًا للبروتينات ذات الثقة العالية، وهو زيادة بنسبة 24.19% مقارنة بـ 106,915 جينًا في CS RefSeq. على العكس من ذلك، فإن الجينات المشفرة للبروتينات ذات الثقة المنخفضة أقل في CS-IAAS (109,932) مقارنة بـ CS RefSeq (159,846)، مما يشير إلى انخفاض بنسبة 45.40%. كما يكشف التحليل عن اختلافات في المجالات الهيكلية (SDs)، حيث يظهر CS-IAAS نسبة أعلى من SDs (64.84%) وقواعد SD (9.41 Gbp) مقارنة بـ CS RefSeq (62.37% و 8.87 Gbp، على التوالي). ومع ذلك، تشير مقاييس SD الأساسية والجوهرية إلى انخفاض في كل من الطول والعدد في CS-IAAS مقارنة بـ CS RefSeq، مما يشير إلى هيكل جينومي أكثر دقة في الأول.

النتائج

يقدم قسم “النتائج” في ورقة البحث النتائج الرئيسية المستمدة من التجارب أو التحليلات التي تم إجراؤها. عادةً ما يتضمن بيانات كمية، وتحليلات إحصائية، وتمثيلات بصرية مثل الرسوم البيانية أو الجداول لتوضيح النتائج. غالبًا ما تتم مقارنة النتائج مع الفرضيات أو الدراسات السابقة لتسليط الضوء على الفروق المهمة أو التأكيدات.

في هذا القسم، قد يبلغ المؤلفون عن مقاييس محددة، مثل المتوسطات، والانحرافات المعيارية، أو قيم p، لدعم مطالباتهم. بالإضافة إلى ذلك، يتم مناقشة أي اتجاهات أو أنماط ملحوظة في البيانات، مما يوفر رؤى حول تداعيات النتائج. بشكل عام، تعتبر النتائج أساسًا للنقاشات اللاحقة والاستنتاجات المستخلصة في الورقة.

المناقشة

تقدم الدراسة تجميعًا شاملاً لجينوم صنف القمح الخبز CS، باستخدام نهج هجين يجمع بين تسلسل القراءة الطويلة الفائقة من PacBio HiFi وتقنيات Oxford Nanopore (ONT) مع التقاط تكوين الكروموسومات (Hi-C)، وبيانات Illumina، وBionano. أسفر عملية التجميع عن طول جينوم إجمالي يبلغ 14.51 Gbp، مثبتًا على 21 كروموسوم، مع حجم N50 للقطع يبلغ 723.78 Mbp، مما يمثل تحسينًا كبيرًا مقارنة بالتجميعات السابقة. تم تقييم اكتمال جينوم CS-IAAS باستخدام Benchmarking Universal Single-Copy Orthologs (BUSCO)، محققًا اكتمالًا ملحوظًا بنسبة 99.96%، متجاوزًا التجميعات السابقة.

تتعمق الدراسة أيضًا في الديناميات التطورية لفرعي جينوم القمح، مع تسليط الضوء على إعادة ترتيب الكروموسومات التي حدثت خلال تعدد الصبغيات. ومن الجدير بالذكر أنه تم تحديد 23 انقلابًا رئيسيًا في القمح سداسي الصبغيات، مما أثر على أكثر من 2,000 جين، مع تداعيات على الصفات التكيفية مثل التمثيل الضوئي وتوليد الطاقة. كما كشف التجميع عن الخصائص الهيكلية للسنتروميرات والعناصر التكرارية، بما في ذلك العناصر القابلة للنقل (TEs) والتكرارات القطاعية، التي تلعب أدوارًا حاسمة في الابتكار الجينومي. تؤكد النتائج على تعقيد تطور جينوم القمح وتوفر إطارًا قويًا للدراسات الجينية المستقبلية وبرامج التربية.

Journal: Nature Genetics, Volume: 57, Issue: 4
DOI: https://doi.org/10.1038/s41588-025-02137-x
PMID: https://pubmed.ncbi.nlm.nih.gov/40195562
Publication Date: 2025-04-01
Author(s): Shoucheng Liu et al.
Primary Topic: Chromosomal and Genetic Variations

Overview

The section presents a comparative analysis of the wheat genome assemblies CS-IAAS and CS RefSeq (v.2.1), highlighting significant differences in various genomic statistics. The CS-IAAS assembly comprises 14.51 billion anchored bases, surpassing the 14.01 billion bases of CS RefSeq by 3.57%. Notably, CS-IAAS exhibits a remarkable reduction in gaps (0 compared to 183,603 in CS RefSeq) and a substantial decrease in the number of contigs (21 versus 183,624), resulting in a significantly higher contig N50 of 723.78 Mbp compared to 0.35 Mbp for CS RefSeq.

In terms of gene annotation, CS-IAAS contains 141,035 high-confidence predicted protein-coding genes, which is a 24.19% increase over the 106,915 genes in CS RefSeq. Conversely, the low-confidence predicted protein-coding genes are fewer in CS-IAAS (109,932) compared to CS RefSeq (159,846), indicating a 45.40% decrease. The analysis also reveals differences in structural domains (SDs), with CS-IAAS showing a higher percentage of SDs (64.84%) and SD bases (9.41 Gbp) than CS RefSeq (62.37% and 8.87 Gbp, respectively). However, the elementary and core SD metrics indicate a decrease in both length and number in CS-IAAS compared to CS RefSeq, suggesting a more refined genomic structure in the former.

Results

The “Results” section of the research paper presents key findings derived from the conducted experiments or analyses. It typically includes quantitative data, statistical analyses, and visual representations such as graphs or tables to illustrate the outcomes. The results are often compared against the hypotheses or previous studies to highlight significant differences or confirmations.

In this section, the authors may report specific metrics, such as means, standard deviations, or p-values, to substantiate their claims. Additionally, any observed trends or patterns in the data are discussed, providing insights into the implications of the findings. Overall, the results serve as a foundation for the subsequent discussion and conclusions drawn in the paper.

Discussion

The research presents a comprehensive assembly of the bread wheat cultivar CS genome, utilizing a hybrid approach that combines PacBio HiFi and Oxford Nanopore Technologies (ONT) ultra-long read sequencing with chromosome conformation capture (Hi-C), Illumina, and Bionano data. The assembly process resulted in a total genome length of 14.51 Gbp, anchored to 21 chromosomes, with a contig N50 size of 723.78 Mbp, marking a significant improvement over previous assemblies. The completeness of the CS-IAAS genome was assessed using Benchmarking Universal Single-Copy Orthologs (BUSCO), achieving a remarkable 99.96% completeness, surpassing earlier assemblies.

The study also delves into the evolutionary dynamics of wheat subgenomes, highlighting chromosomal rearrangements that have occurred during polyploidization. Notably, 23 major inversions were identified in the hexaploid wheat, affecting over 2,000 genes, with implications for adaptive traits such as photosynthesis and energy generation. The assembly further revealed the structural characteristics of centromeres and repetitive elements, including transposable elements (TEs) and segmental duplications, which play crucial roles in genomic innovation. The findings underscore the complexity of wheat genome evolution and provide a robust framework for future genetic studies and breeding programs.

كلمات مفتاحية: DNA، التطور، الجزيئي، التيلومير، الجينوم، الجينوم، نبات، السنترومير، المتعددات الجينية، ترانسكريبتوم، تعدد الصيغ الصبغية، تيلومير، جين، خبز، علم الأحياء، علم الأحياء التطوري، علم الأحياء الحاسوبي، علم الوراثة، عناصر قابلة للنقل في الحمض النووي، قمح، كروموسومات، نبات