CMAB: مجموعة بيانات متعددة السمات للبناء في الصين CMAB: A Multi-Attribute Building Dataset of China

المجلة: Scientific Data، المجلد: 12، العدد: 1
DOI: https://doi.org/10.1038/s41597-025-04730-5
PMID: https://pubmed.ncbi.nlm.nih.gov/40075090
تاريخ النشر: 2025-03-12
المؤلف: Yecheng Zhang وآخرون
الموضوع الرئيسي: النمذجة ثلاثية الأبعاد في التطبيقات الجغرافية

نظرة عامة

تتناول هذه الفقرة تطوير أول مجموعة بيانات لمباني متعددة الخصائص على نطاق وطني (CMAB)، والتي تعتبر ضرورية للتحليل الحضري وصنع السياسات. تشمل مجموعة البيانات 3,667 مدينة، و31 مليون مبنى، و23.6 مليار متر مربع من الأسطح، محققة درجة F1 مثيرة للإعجاب تبلغ 89.93% في الاستخراج باستخدام OCRNet. إجمالي مخزون المباني الممثل في مجموعة البيانات هو حوالي 363 مليار متر مكعب.

لإنشاء هذه المجموعة الشاملة من البيانات، استخدم المؤلفون نماذج XGBoost المجمعة التي تعتمد على ميزات تتعلق بتصنيفات الإدارة الحضرية، والشكل، والموقع، والوظيفة. من خلال الاستفادة من مجموعة واسعة من البيانات متعددة المصادر، بما في ذلك مليارات الصور من الاستشعار عن بعد و60 مليون صورة من عرض الشارع، أنتجت الدراسة سمات تفصيلية لكل مبنى، مثل شكل السطح، والارتفاع، والبنية، والوظيفة، والأسلوب، والعمر، والجودة. تم التحقق من دقة هذه السمات مقابل معايير النماذج والتقييمات اليدوية، حيث تجاوزت معظم النتائج 80%. تعتبر النتائج من هذه البحث مهمة لدعم أهداف التنمية المستدامة العالمية (SDGs) وتعزيز جهود التخطيط الحضري.

مقدمة

تسلط مقدمة هذه الورقة البحثية الضوء على التحولات الكبيرة التي شهدتها المدن بسبب التحضر السريع على مدى العقدين الماضيين، مما يبرز الحاجة إلى فهم دقيق للهياكل الحضرية من خلال مجموعات بيانات المباني التفصيلية. تصنف سمات المباني إلى سمات هندسية (مثل، الأقدام، الارتفاعات) وسمات دالة (مثل، الوظائف، الأساليب، الأعمار)، وكلاهما ضروريان للتخطيط الحضري وجهود الاستدامة. على الرغم من التقدم في تقنيات الاستشعار عبر الأقمار الصناعية والتقنيات الحاسوبية، تشير الورقة إلى ندرة مجموعات بيانات المباني ثلاثية الأبعاد على نطاق واسع، خاصة تلك التي تشمل سمات دالة شاملة، مما يعيق التحليل والتخطيط الحضري الفعال.

يحدد المؤلفون التحديات في استخراج بيانات المباني الدقيقة، مثل التكاليف العالية، والحد من الدقة المكانية، وعدم كفاية الأساليب الحالية التي تركز بشكل أساسي على الخصائص الهندسية. يجادلون من أجل تطوير أساليب استخراج عالمية للسمات المتعددة لدعم الدراسات الحضرية وأهداف التنمية المستدامة (SDGs). تقدم الورقة نهجًا مبتكرًا يدمج أدوات التحليل الجغرافي وتعلم الآلة لإنشاء مجموعة بيانات شاملة لمباني متعددة الخصائص (CMAB) تغطي 32 مليون مبنى عبر 3,667 مدينة في الصين. تهدف هذه المجموعة إلى تعزيز دقة النمذجة الحضرية وتوفير مورد أساسي للبحوث المستقبلية ومبادرات التخطيط الحضري.

الطرق

تتكون المنهجية الموضحة في هذا البحث من أربع خطوات أساسية تهدف إلى استخراج والتحقق من سمات المباني في المناطق الحضرية في الصين. تتضمن الخطوة الأولى التحضير، حيث يتم تحديد حدود المدن المكانية لتحديد نطاق استخراج البيانات. يتم تصنيف عينات المباني إلى خمس فئات بناءً على مناطق المناخ والمستويات الإدارية.

في الخطوة الثانية، يتم استخراج السمات الهندسية باستخدام عينات الأسطح التي تم وضع علامات عليها يدويًا لتدريب نموذج OCRNet. يتم تطبيق طريقة تجميع مكاني لالتقاط أسطح المباني عبر المدن المحددة، مما يسمح بحساب خصائص شكلية وكثافة وموقع متنوعة. ثم يتم تدريب نماذج مصممة لكل فئة إدارية لاشتقاق ارتفاعات المباني، مما يسهل إنشاء تمثيلات ثلاثية الأبعاد للمباني.

تركز الخطوة الثالثة على استخراج السمات الدالة، باستخدام بيانات متعددة المصادر للتنبؤ بوظائف المباني بناءً على الارتفاع. يتضمن ذلك دمج بيانات الأسطح غير النفاذة والمطابقة المكانية لتعيين عمر المبنى وجودته، إلى جانب ضبط نموذج متعدد الوسائط لالتقاط السمات الهيكلية والأسلوبية. أخيرًا، تتحقق الخطوة الرابعة من هذه السمات من خلال تقييمات النماذج ووضع علامات يدوية على مجموعة بيانات التحقق، مما يضمن دقة أسطح المباني المستخرجة، والارتفاعات، والوظائف، والجودة، والأعمار. يتم تلخيص عملية الإنتاج العامة بصريًا في شكل مرجعي في النص.

المناقشة

في هذه الفقرة، يناقش المؤلفون المنهجيات المستخدمة لتعزيز دقة وكفاءة استخراج سمات المباني عبر الصين، مع معالجة التحديات المتعلقة بالقوة الحاسوبية وقيود البيانات. يؤكدون على استخدام حدود المدن المكانية المستمدة من مجموعات بيانات متنوعة، بما في ذلك صور الإضاءة الليلية وبيانات تغطية الأرض، لتعريف المناطق الحضرية بدقة أكبر. يسمح هذا النهج بتحليل مركز على المناطق المبنية، مما يقلل من التحيزات الموجودة في البيانات متعددة المصادر، خاصة في المناطق الريفية. تستخدم الدراسة نظام تصنيف يعتمد على التسلسل الإداري الحضري في الصين لتصنيف مواقع المباني، مما يساعد في تدريب النماذج للتنبؤ بارتفاع المباني ووظيفتها.

يستعرض المؤلفون مصادر بياناتهم، بما في ذلك صور الأقمار الصناعية عالية الدقة من Google Earth ومجموعات بيانات متنوعة لسمات المباني، مثل الارتفاع، والوظيفة، والجودة. يستخدمون تقنيات تعلم الآلة المتقدمة، وبشكل خاص خوارزمية XGBoost، للتنبؤ بهذه السمات بناءً على ميزات متعددة الأبعاد مستخرجة من البيانات. كما تسلط الدراسة الضوء على أهمية التحقق من النماذج من خلال تدقيقات يدوية ومقارنات مع مجموعات بيانات موجودة، مما يوضح قوة أساليبهم. بشكل عام، تسهم هذه البحث في مجموعة بيانات شاملة تتضمن سمات مباني تفصيلية، وهو أمر ضروري للتخطيط الحضري وتحليل التنمية في الصين.

Journal: Scientific Data, Volume: 12, Issue: 1
DOI: https://doi.org/10.1038/s41597-025-04730-5
PMID: https://pubmed.ncbi.nlm.nih.gov/40075090
Publication Date: 2025-03-12
Author(s): Yecheng Zhang et al.
Primary Topic: 3D Modeling in Geospatial Applications

Overview

This section outlines the development of the first national-scale Multi-Attribute Building dataset (CMAB), which is crucial for urban analysis and policy-making. The dataset encompasses 3,667 cities, 31 million buildings, and 23.6 billion square meters of rooftops, achieving an impressive F1-Score of 89.93% in extraction using OCRNet. The total building stock represented in the dataset is approximately 363 billion cubic meters.

To create this comprehensive dataset, the authors employed bootstrap aggregated XGBoost models that utilized features related to city administrative classifications, morphology, location, and function. By leveraging a vast array of multisource data, including billions of remote sensing images and 60 million street view images, the study generated detailed attributes for each building, such as rooftop shape, height, structure, function, style, age, and quality. The accuracy of these attributes was validated against model benchmarks and manual assessments, with most results exceeding 80%. The findings from this research are significant for advancing global Sustainable Development Goals (SDGs) and enhancing urban planning efforts.

Introduction

The introduction of this research paper highlights the significant transformations cities have undergone due to rapid urbanization over the past two decades, emphasizing the need for a nuanced understanding of urban structures through detailed building datasets. It categorizes building attributes into geometric (e.g., footprints, heights) and indicative (e.g., functions, styles, ages), both of which are essential for urban planning and sustainability efforts. Despite advancements in satellite sensing and computational technologies, the paper notes a scarcity of large-scale 3D building datasets, particularly those encompassing comprehensive indicative attributes, which hampers effective urban analysis and planning.

The authors identify challenges in extracting accurate building data, such as high costs, limited spatial resolution, and the inadequacy of existing methods that primarily focus on geometric properties. They argue for the development of universal extraction methods for multi-attributes to support urban studies and Sustainable Development Goals (SDGs). The paper presents a novel approach that integrates geographic analysis tools and machine learning to create a comprehensive multi-attribute building dataset (CMAB) covering 32 million buildings across 3,667 cities in China. This dataset aims to enhance the accuracy of urban modeling and provide a foundational resource for future research and urban planning initiatives.

Methods

The methodology outlined in this research consists of four essential steps aimed at extracting and validating building attributes in urban areas of China. The first step involves preparation, where the boundaries of spatial cities are defined to establish the data extraction range. Building samples are categorized into five classes based on climate zones and administrative levels.

In the second step, geometric attributes are extracted using rooftop samples that are manually labeled to train the OCRNet model. A spatial aggregation method is applied to capture building rooftops across the defined cities, allowing for the calculation of various morphological, density, and locational characteristics. Models tailored to each administrative class are then trained to derive building heights, facilitating the creation of 3D building representations.

The third step focuses on extracting indicative attributes, utilizing multi-source data to predict building functions based on height. This involves integrating impervious surface data and spatial matching to assign building age and quality, alongside fine-tuning a multimodal model to capture structural and stylistic attributes. Finally, the fourth step validates these attributes through model evaluations and manual labeling of a validation dataset, ensuring the accuracy of the extracted building rooftops, heights, functions, quality, and ages. The overall production process is visually summarized in a figure referenced in the text.

Discussion

In this section, the authors discuss the methodologies employed to enhance the accuracy and efficiency of building attribute extraction across China, addressing challenges related to computational power and data limitations. They emphasize the use of spatial city boundaries derived from various datasets, including night light imagery and land cover data, to define urban areas more accurately. This approach allows for a focused analysis on built-up regions, mitigating biases inherent in multi-source data, particularly in rural areas. The study utilizes a classification system based on China’s urban administrative hierarchy to categorize building locations, which aids in the training of models for predicting building height and function.

The authors detail their data sources, including high-resolution satellite imagery from Google Earth and various datasets for building attributes, such as height, function, and quality. They employ advanced machine learning techniques, specifically the XGBoost algorithm, to predict these attributes based on multi-dimensional features extracted from the data. The study also highlights the importance of model validation through manual audits and comparisons with existing datasets, demonstrating the robustness of their methods. Overall, the research contributes to a comprehensive dataset that includes detailed building attributes, which is essential for urban planning and development analysis in China.