{"title":"以太坊智能合约分布的拟合与回归","authors":"Maher Alharby, A. Moorsel","doi":"10.1109/BRAINS49436.2020.9223314","DOIUrl":null,"url":null,"abstract":"To simulate blockchain systems as close to reality as possible, we need accurate estimates of the probability distribution of various variables. In this paper we obtain distributions for Ethereum smart contract transactions, with respect to Gas Limit, Used Gas, Gas Price and CPU Time. To determine these distributions we use publicly available Ethereum smart contract information, augmented with experimental data for over 300,000 smart contracts obtained on a test bed. We conclude that Gaussian Mixture Models are appropriate for distributions of smart contracts with respect to Used Gas and Gas Price, and use a uniform distribution for the distribution with respect to the Gas Limit. A correlation analysis shows that the CPU Time is strongly correlated with Used Gas and we therefore apply regression techniques to estimate the CPU Time conditioned on Used Gas. We experiment with three ensemble regression methods, namely Random Forest, Gradient Boosting Machine and Adaptive Boosting and conclude that Random Forest is both fast and accurate.","PeriodicalId":315392,"journal":{"name":"2020 2nd Conference on Blockchain Research & Applications for Innovative Networks and Services (BRAINS)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2020-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Fitting and Regression for Distributions of Ethereum Smart Contracts\",\"authors\":\"Maher Alharby, A. Moorsel\",\"doi\":\"10.1109/BRAINS49436.2020.9223314\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"To simulate blockchain systems as close to reality as possible, we need accurate estimates of the probability distribution of various variables. In this paper we obtain distributions for Ethereum smart contract transactions, with respect to Gas Limit, Used Gas, Gas Price and CPU Time. To determine these distributions we use publicly available Ethereum smart contract information, augmented with experimental data for over 300,000 smart contracts obtained on a test bed. We conclude that Gaussian Mixture Models are appropriate for distributions of smart contracts with respect to Used Gas and Gas Price, and use a uniform distribution for the distribution with respect to the Gas Limit. A correlation analysis shows that the CPU Time is strongly correlated with Used Gas and we therefore apply regression techniques to estimate the CPU Time conditioned on Used Gas. We experiment with three ensemble regression methods, namely Random Forest, Gradient Boosting Machine and Adaptive Boosting and conclude that Random Forest is both fast and accurate.\",\"PeriodicalId\":315392,\"journal\":{\"name\":\"2020 2nd Conference on Blockchain Research & Applications for Innovative Networks and Services (BRAINS)\",\"volume\":\"1 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2020-09-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2020 2nd Conference on Blockchain Research & Applications for Innovative Networks and Services (BRAINS)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/BRAINS49436.2020.9223314\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2020 2nd Conference on Blockchain Research & Applications for Innovative Networks and Services (BRAINS)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/BRAINS49436.2020.9223314","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
摘要
为了尽可能地模拟区块链系统,我们需要准确估计各种变量的概率分布。在本文中,我们获得了以太坊智能合约交易的分布,包括Gas Limit, Used Gas, Gas Price和CPU Time。为了确定这些分布,我们使用了公开可用的以太坊智能合约信息,并在测试台上获得了超过30万个智能合约的实验数据。我们得出结论,高斯混合模型适用于智能合约中使用的天然气和天然气价格的分布,并使用均匀分布的天然气限制的分布。相关分析表明,CPU时间与使用过的气体有很强的相关性,因此我们应用回归技术来估计CPU时间以使用过的气体为条件。我们对随机森林、梯度增强机和自适应增强三种集成回归方法进行了实验,结果表明随机森林既快速又准确。
Fitting and Regression for Distributions of Ethereum Smart Contracts
To simulate blockchain systems as close to reality as possible, we need accurate estimates of the probability distribution of various variables. In this paper we obtain distributions for Ethereum smart contract transactions, with respect to Gas Limit, Used Gas, Gas Price and CPU Time. To determine these distributions we use publicly available Ethereum smart contract information, augmented with experimental data for over 300,000 smart contracts obtained on a test bed. We conclude that Gaussian Mixture Models are appropriate for distributions of smart contracts with respect to Used Gas and Gas Price, and use a uniform distribution for the distribution with respect to the Gas Limit. A correlation analysis shows that the CPU Time is strongly correlated with Used Gas and we therefore apply regression techniques to estimate the CPU Time conditioned on Used Gas. We experiment with three ensemble regression methods, namely Random Forest, Gradient Boosting Machine and Adaptive Boosting and conclude that Random Forest is both fast and accurate.