{"title":"Datasheets for Energy Datasets: An Ethically-Minded Approach to Documentation","authors":"Ilana Heintz","doi":"10.1145/3599733.3600249","DOIUrl":null,"url":null,"abstract":"This work presents an argument for the use of specific documentation for the ethical development, use, and sharing of energy datasets, and an evaluation of current practice in the energy AI community. Drawing on a recently developed resource from the broader machine learning community and applying it to the specific context of energy AI research, opportunities for more transparent collection and distribution of energy datasets are revealed. To help elucidate the utility of the datasheets and the energy community’s current level of documentation, two publicly available energy datasets are chosen for analysis. One has published documentation covering 66% of the datasheet questionnaire, while the second covers 42% of the suggested information. Two additional questions are recommended for energy-relevant datasheets that will promote ethical AI practices in the energy domain. A new resource for exploring and aligning energy datasets with demographic data is provided.","PeriodicalId":114998,"journal":{"name":"Companion Proceedings of the 14th ACM International Conference on Future Energy Systems","volume":"1 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2023-06-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Companion Proceedings of the 14th ACM International Conference on Future Energy Systems","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/3599733.3600249","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
This work presents an argument for the use of specific documentation for the ethical development, use, and sharing of energy datasets, and an evaluation of current practice in the energy AI community. Drawing on a recently developed resource from the broader machine learning community and applying it to the specific context of energy AI research, opportunities for more transparent collection and distribution of energy datasets are revealed. To help elucidate the utility of the datasheets and the energy community’s current level of documentation, two publicly available energy datasets are chosen for analysis. One has published documentation covering 66% of the datasheet questionnaire, while the second covers 42% of the suggested information. Two additional questions are recommended for energy-relevant datasheets that will promote ethical AI practices in the energy domain. A new resource for exploring and aligning energy datasets with demographic data is provided.