Ming Gao, Chaofan Tu, Miaomiao Liu, Jiandong Chen, Xingyu Chen, Hong Zou, Thomas Shiu Tong, Long Chen, Shuke Fu
{"title":"2013-2021年中国县级月二氧化碳排放量。","authors":"Ming Gao, Chaofan Tu, Miaomiao Liu, Jiandong Chen, Xingyu Chen, Hong Zou, Thomas Shiu Tong, Long Chen, Shuke Fu","doi":"10.1038/s41597-025-05461-3","DOIUrl":null,"url":null,"abstract":"<p><p>The top-down method is widely used to estimate China's CO<sub>2</sub> emissions at the county level. However, studies have relied on a single indicator of regional total nighttime light brightness as an instrumental variable for prediction, leading to the assumption that there is a positive correlation between CO<sub>2</sub> emissions and total nighttime light brightness in all regions within the same province. This assumption overlooks other heterogeneous relationships and does not correspond to reality. Therefore, this study constructed a dataset of potential feature variables based on multisource data (improved and calibrated nighttime light data, urban and rural human settlement data, and socioeconomic indicator data based on statistical yearbooks). After the main feature variables were identified, a hybrid regression algorithm combining deep neural networks and CatBoost was constructed to generate instrumental variable for predicting CO<sub>2</sub> emissions. Compared with the total nighttime brightness, it has a stronger linear relationship with CO<sub>2</sub> emissions. Using the top-down algorithm, we estimated China's monthly CO<sub>2</sub> emissions at the county level from 2013 to 2021. This dataset provides a solid foundation for predicting the achievement of China's county-level \"dual carbon\" strategy. The methods used in this study can be generalized to other global regions.</p>","PeriodicalId":21597,"journal":{"name":"Scientific Data","volume":"12 1","pages":"1217"},"PeriodicalIF":6.9000,"publicationDate":"2025-07-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12259840/pdf/","citationCount":"0","resultStr":"{\"title\":\"China's county-level monthly CO<sub>2</sub> emissions during 2013-2021.\",\"authors\":\"Ming Gao, Chaofan Tu, Miaomiao Liu, Jiandong Chen, Xingyu Chen, Hong Zou, Thomas Shiu Tong, Long Chen, Shuke Fu\",\"doi\":\"10.1038/s41597-025-05461-3\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p><p>The top-down method is widely used to estimate China's CO<sub>2</sub> emissions at the county level. However, studies have relied on a single indicator of regional total nighttime light brightness as an instrumental variable for prediction, leading to the assumption that there is a positive correlation between CO<sub>2</sub> emissions and total nighttime light brightness in all regions within the same province. This assumption overlooks other heterogeneous relationships and does not correspond to reality. Therefore, this study constructed a dataset of potential feature variables based on multisource data (improved and calibrated nighttime light data, urban and rural human settlement data, and socioeconomic indicator data based on statistical yearbooks). After the main feature variables were identified, a hybrid regression algorithm combining deep neural networks and CatBoost was constructed to generate instrumental variable for predicting CO<sub>2</sub> emissions. Compared with the total nighttime brightness, it has a stronger linear relationship with CO<sub>2</sub> emissions. Using the top-down algorithm, we estimated China's monthly CO<sub>2</sub> emissions at the county level from 2013 to 2021. This dataset provides a solid foundation for predicting the achievement of China's county-level \\\"dual carbon\\\" strategy. The methods used in this study can be generalized to other global regions.</p>\",\"PeriodicalId\":21597,\"journal\":{\"name\":\"Scientific Data\",\"volume\":\"12 1\",\"pages\":\"1217\"},\"PeriodicalIF\":6.9000,\"publicationDate\":\"2025-07-14\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12259840/pdf/\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Scientific Data\",\"FirstCategoryId\":\"103\",\"ListUrlMain\":\"https://doi.org/10.1038/s41597-025-05461-3\",\"RegionNum\":2,\"RegionCategory\":\"综合性期刊\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"MULTIDISCIPLINARY SCIENCES\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Scientific Data","FirstCategoryId":"103","ListUrlMain":"https://doi.org/10.1038/s41597-025-05461-3","RegionNum":2,"RegionCategory":"综合性期刊","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"MULTIDISCIPLINARY SCIENCES","Score":null,"Total":0}
China's county-level monthly CO2 emissions during 2013-2021.
The top-down method is widely used to estimate China's CO2 emissions at the county level. However, studies have relied on a single indicator of regional total nighttime light brightness as an instrumental variable for prediction, leading to the assumption that there is a positive correlation between CO2 emissions and total nighttime light brightness in all regions within the same province. This assumption overlooks other heterogeneous relationships and does not correspond to reality. Therefore, this study constructed a dataset of potential feature variables based on multisource data (improved and calibrated nighttime light data, urban and rural human settlement data, and socioeconomic indicator data based on statistical yearbooks). After the main feature variables were identified, a hybrid regression algorithm combining deep neural networks and CatBoost was constructed to generate instrumental variable for predicting CO2 emissions. Compared with the total nighttime brightness, it has a stronger linear relationship with CO2 emissions. Using the top-down algorithm, we estimated China's monthly CO2 emissions at the county level from 2013 to 2021. This dataset provides a solid foundation for predicting the achievement of China's county-level "dual carbon" strategy. The methods used in this study can be generalized to other global regions.
期刊介绍:
Scientific Data is an open-access journal focused on data, publishing descriptions of research datasets and articles on data sharing across natural sciences, medicine, engineering, and social sciences. Its goal is to enhance the sharing and reuse of scientific data, encourage broader data sharing, and acknowledge those who share their data.
The journal primarily publishes Data Descriptors, which offer detailed descriptions of research datasets, including data collection methods and technical analyses validating data quality. These descriptors aim to facilitate data reuse rather than testing hypotheses or presenting new interpretations, methods, or in-depth analyses.