Francesco Biancalani, Giorgio Gnecco, Rodolfo Metulini, Massimo Riccaboni
{"title":"基于矩阵完成优化问题,预测国家和部门的年二氧化碳排放量","authors":"Francesco Biancalani, Giorgio Gnecco, Rodolfo Metulini, Massimo Riccaboni","doi":"10.1007/s11590-023-02052-2","DOIUrl":null,"url":null,"abstract":"Abstract In the recent past, annual CO $$_2$$ <mml:math xmlns:mml=\"http://www.w3.org/1998/Math/MathML\"> <mml:msub> <mml:mrow /> <mml:mn>2</mml:mn> </mml:msub> </mml:math> emissions at the international level were examined from various perspectives, motivated by rising concerns about pollution and climate change. Nevertheless, to the best of the authors’ knowledge, the problem of dealing with the potential inaccuracy/missingness of such data at the country and economic sector levels has been overlooked. Thereby, in this article we apply a supervised machine learning technique called Matrix Completion (MC) to predict, for each country in the available database, annual CO $$_2$$ <mml:math xmlns:mml=\"http://www.w3.org/1998/Math/MathML\"> <mml:msub> <mml:mrow /> <mml:mn>2</mml:mn> </mml:msub> </mml:math> emissions data at the sector level, based on past data related to all the sectors, and more recent data related to a subset of sectors. The core idea of MC consists in the formulation of a suitable optimization problem, namely the minimization of a proper trade-off between the approximation error over a set of observed elements of a matrix (training set) and a proxy of the rank of the reconstructed matrix, e.g., its nuclear norm. In the article, we apply MC to the imputation of (artificially) missing elements of country-specific matrices whose elements come from annual CO $$_2$$ <mml:math xmlns:mml=\"http://www.w3.org/1998/Math/MathML\"> <mml:msub> <mml:mrow /> <mml:mn>2</mml:mn> </mml:msub> </mml:math> emission levels related to different sectors, after proper pre-processing at the sector level. Results highlight typically a better performance of the combination of MC with suitably-constructed baseline estimates with respect to the baselines alone. Potential applications of our analysis arise in the prediction of currently missing elements of matrices of annual CO $$_2$$ <mml:math xmlns:mml=\"http://www.w3.org/1998/Math/MathML\"> <mml:msub> <mml:mrow /> <mml:mn>2</mml:mn> </mml:msub> </mml:math> emission levels and in the construction of counterfactuals, useful to estimate the effects of policy changes able to influence the annual CO $$_2$$ <mml:math xmlns:mml=\"http://www.w3.org/1998/Math/MathML\"> <mml:msub> <mml:mrow /> <mml:mn>2</mml:mn> </mml:msub> </mml:math> emission levels of specific sectors in selected countries.","PeriodicalId":49720,"journal":{"name":"Optimization Letters","volume":"4 1","pages":"0"},"PeriodicalIF":1.1000,"publicationDate":"2023-09-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Prediction of annual CO2 emissions at the country and sector levels, based on a matrix completion optimization problem\",\"authors\":\"Francesco Biancalani, Giorgio Gnecco, Rodolfo Metulini, Massimo Riccaboni\",\"doi\":\"10.1007/s11590-023-02052-2\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Abstract In the recent past, annual CO $$_2$$ <mml:math xmlns:mml=\\\"http://www.w3.org/1998/Math/MathML\\\"> <mml:msub> <mml:mrow /> <mml:mn>2</mml:mn> </mml:msub> </mml:math> emissions at the international level were examined from various perspectives, motivated by rising concerns about pollution and climate change. Nevertheless, to the best of the authors’ knowledge, the problem of dealing with the potential inaccuracy/missingness of such data at the country and economic sector levels has been overlooked. Thereby, in this article we apply a supervised machine learning technique called Matrix Completion (MC) to predict, for each country in the available database, annual CO $$_2$$ <mml:math xmlns:mml=\\\"http://www.w3.org/1998/Math/MathML\\\"> <mml:msub> <mml:mrow /> <mml:mn>2</mml:mn> </mml:msub> </mml:math> emissions data at the sector level, based on past data related to all the sectors, and more recent data related to a subset of sectors. The core idea of MC consists in the formulation of a suitable optimization problem, namely the minimization of a proper trade-off between the approximation error over a set of observed elements of a matrix (training set) and a proxy of the rank of the reconstructed matrix, e.g., its nuclear norm. In the article, we apply MC to the imputation of (artificially) missing elements of country-specific matrices whose elements come from annual CO $$_2$$ <mml:math xmlns:mml=\\\"http://www.w3.org/1998/Math/MathML\\\"> <mml:msub> <mml:mrow /> <mml:mn>2</mml:mn> </mml:msub> </mml:math> emission levels related to different sectors, after proper pre-processing at the sector level. Results highlight typically a better performance of the combination of MC with suitably-constructed baseline estimates with respect to the baselines alone. Potential applications of our analysis arise in the prediction of currently missing elements of matrices of annual CO $$_2$$ <mml:math xmlns:mml=\\\"http://www.w3.org/1998/Math/MathML\\\"> <mml:msub> <mml:mrow /> <mml:mn>2</mml:mn> </mml:msub> </mml:math> emission levels and in the construction of counterfactuals, useful to estimate the effects of policy changes able to influence the annual CO $$_2$$ <mml:math xmlns:mml=\\\"http://www.w3.org/1998/Math/MathML\\\"> <mml:msub> <mml:mrow /> <mml:mn>2</mml:mn> </mml:msub> </mml:math> emission levels of specific sectors in selected countries.\",\"PeriodicalId\":49720,\"journal\":{\"name\":\"Optimization Letters\",\"volume\":\"4 1\",\"pages\":\"0\"},\"PeriodicalIF\":1.1000,\"publicationDate\":\"2023-09-26\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Optimization Letters\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1007/s11590-023-02052-2\",\"RegionNum\":4,\"RegionCategory\":\"数学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q2\",\"JCRName\":\"MATHEMATICS, APPLIED\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Optimization Letters","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1007/s11590-023-02052-2","RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"MATHEMATICS, APPLIED","Score":null,"Total":0}
Prediction of annual CO2 emissions at the country and sector levels, based on a matrix completion optimization problem
Abstract In the recent past, annual CO $$_2$$ 2 emissions at the international level were examined from various perspectives, motivated by rising concerns about pollution and climate change. Nevertheless, to the best of the authors’ knowledge, the problem of dealing with the potential inaccuracy/missingness of such data at the country and economic sector levels has been overlooked. Thereby, in this article we apply a supervised machine learning technique called Matrix Completion (MC) to predict, for each country in the available database, annual CO $$_2$$ 2 emissions data at the sector level, based on past data related to all the sectors, and more recent data related to a subset of sectors. The core idea of MC consists in the formulation of a suitable optimization problem, namely the minimization of a proper trade-off between the approximation error over a set of observed elements of a matrix (training set) and a proxy of the rank of the reconstructed matrix, e.g., its nuclear norm. In the article, we apply MC to the imputation of (artificially) missing elements of country-specific matrices whose elements come from annual CO $$_2$$ 2 emission levels related to different sectors, after proper pre-processing at the sector level. Results highlight typically a better performance of the combination of MC with suitably-constructed baseline estimates with respect to the baselines alone. Potential applications of our analysis arise in the prediction of currently missing elements of matrices of annual CO $$_2$$ 2 emission levels and in the construction of counterfactuals, useful to estimate the effects of policy changes able to influence the annual CO $$_2$$ 2 emission levels of specific sectors in selected countries.
期刊介绍:
Optimization Letters is an international journal covering all aspects of optimization, including theory, algorithms, computational studies, and applications, and providing an outlet for rapid publication of short communications in the field. Originality, significance, quality and clarity are the essential criteria for choosing the material to be published.
Optimization Letters has been expanding in all directions at an astonishing rate during the last few decades. New algorithmic and theoretical techniques have been developed, the diffusion into other disciplines has proceeded at a rapid pace, and our knowledge of all aspects of the field has grown even more profound. At the same time one of the most striking trends in optimization is the constantly increasing interdisciplinary nature of the field.
Optimization Letters aims to communicate in a timely fashion all recent developments in optimization with concise short articles (limited to a total of ten journal pages). Such concise articles will be easily accessible by readers working in any aspects of optimization and wish to be informed of recent developments.