{"title":"WIP:数据架构的实时重构","authors":"W. Macey, Dali Wang, P. Thornton, A. Mockus","doi":"10.1109/SE4Science.2017.14","DOIUrl":null,"url":null,"abstract":"In large-scale Earth System simulation codes, such asthe Accelerated Climate Model for Energy (ACME), complex user derived data types (containing large numberof variables) are designed to represent the interactionsof atmosphere, ocean, land, ice, and biosphere toproject global climate under a wide variety of conditions. The following is our proposed approach to restructurethe data architecture of a land component within theACME project while the project is undergoing activedevelopment. The data architect for the land subsystemdefines the new datatype requirements that wouldgreatly simplify the implementation of terrestrial landsubmodels by converting more than 50 to just eight primarydata-types. Since the code is developed with thecommunity governance, we have to ensure that the restructuringdoes not interface the other developmentwhich, with dozens of changes occurring every day, makeit impossible to work on a shared development branch. The active development also occurs on almost five hundredbranches, making it extremely difficult to assesspotential interactions. To address these challenges we have designed andstarted an iterative procedure for implementing the datarestructuring and estimating both the effort it takes torestructure and the effort would save once the restructuringis implemented.","PeriodicalId":318588,"journal":{"name":"2017 IEEE/ACM 12th International Workshop on Software Engineering for Science (SE4Science)","volume":"99 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2017-05-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"WIP: Live Restructuring of Data Architecture\",\"authors\":\"W. Macey, Dali Wang, P. Thornton, A. Mockus\",\"doi\":\"10.1109/SE4Science.2017.14\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"In large-scale Earth System simulation codes, such asthe Accelerated Climate Model for Energy (ACME), complex user derived data types (containing large numberof variables) are designed to represent the interactionsof atmosphere, ocean, land, ice, and biosphere toproject global climate under a wide variety of conditions. The following is our proposed approach to restructurethe data architecture of a land component within theACME project while the project is undergoing activedevelopment. The data architect for the land subsystemdefines the new datatype requirements that wouldgreatly simplify the implementation of terrestrial landsubmodels by converting more than 50 to just eight primarydata-types. Since the code is developed with thecommunity governance, we have to ensure that the restructuringdoes not interface the other developmentwhich, with dozens of changes occurring every day, makeit impossible to work on a shared development branch. The active development also occurs on almost five hundredbranches, making it extremely difficult to assesspotential interactions. To address these challenges we have designed andstarted an iterative procedure for implementing the datarestructuring and estimating both the effort it takes torestructure and the effort would save once the restructuringis implemented.\",\"PeriodicalId\":318588,\"journal\":{\"name\":\"2017 IEEE/ACM 12th International Workshop on Software Engineering for Science (SE4Science)\",\"volume\":\"99 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2017-05-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2017 IEEE/ACM 12th International Workshop on Software Engineering for Science (SE4Science)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/SE4Science.2017.14\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2017 IEEE/ACM 12th International Workshop on Software Engineering for Science (SE4Science)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/SE4Science.2017.14","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
In large-scale Earth System simulation codes, such asthe Accelerated Climate Model for Energy (ACME), complex user derived data types (containing large numberof variables) are designed to represent the interactionsof atmosphere, ocean, land, ice, and biosphere toproject global climate under a wide variety of conditions. The following is our proposed approach to restructurethe data architecture of a land component within theACME project while the project is undergoing activedevelopment. The data architect for the land subsystemdefines the new datatype requirements that wouldgreatly simplify the implementation of terrestrial landsubmodels by converting more than 50 to just eight primarydata-types. Since the code is developed with thecommunity governance, we have to ensure that the restructuringdoes not interface the other developmentwhich, with dozens of changes occurring every day, makeit impossible to work on a shared development branch. The active development also occurs on almost five hundredbranches, making it extremely difficult to assesspotential interactions. To address these challenges we have designed andstarted an iterative procedure for implementing the datarestructuring and estimating both the effort it takes torestructure and the effort would save once the restructuringis implemented.