Python代码中AWS最佳实践的静态分析

European Conference on Object-Oriented Programming Pub Date : 2022-05-09 DOI:10.48550/arXiv.2205.04432

Rajdeep Mukherjee, Omer Tripp, B. Liblit, Michael Wilson

{"title":"Python代码中AWS最佳实践的静态分析","authors":"Rajdeep Mukherjee, Omer Tripp, B. Liblit, Michael Wilson","doi":"10.48550/arXiv.2205.04432","DOIUrl":null,"url":null,"abstract":"Amazon Web Services (AWS) is a comprehensive and broadly adopted cloud provider, offering over 200 fully featured services, including compute, database, storage, networking and content delivery, machine learning, Internet of Things and many others. AWS SDKs provide access to AWS services through API endpoints. However, incorrect use of these APIs can lead to code defects, crashes, performance issues, and other problems. This paper presents automated static analysis rules, developed in the context of a commercial service for detection of code defects and security vulnerabilities, to identify deviations from AWS best practices in Python applications that use the AWS SDK. Such applications use the AWS SDK for Python, called\"Boto3\", to access AWS cloud services. However, precise static analysis of Python applications that use cloud SDKs requires robust type inference for inferring the types of cloud service clients. The dynamic style of Boto3 APIs poses unique challenges for type resolution, as does the interprocedural style in which service clients are used in practice. In support of our best-practices goal, we present a layered strategy for type inference that combines multiple type-resolution and tracking strategies in a staged manner. From our experiments across>3,000 popular Python GitHub repos that make use of the AWS SDK, our layered type inference system achieves 85% precision and 100% recall in inferring Boto3 clients in Python client code. Additionally, we present a representative sample of eight AWS best-practice rules that detect a wide range of issues including pagination, polling, and batch operations. We have assessed the efficacy of these rules based on real-world developer feedback. Developers have accepted more than 85% of the recommendations made by five out of eight Python rules, and almost 83% of all recommendations.","PeriodicalId":172012,"journal":{"name":"European Conference on Object-Oriented Programming","volume":"62 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-05-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"6","resultStr":"{\"title\":\"Static Analysis for AWS Best Practices in Python Code\",\"authors\":\"Rajdeep Mukherjee, Omer Tripp, B. Liblit, Michael Wilson\",\"doi\":\"10.48550/arXiv.2205.04432\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Amazon Web Services (AWS) is a comprehensive and broadly adopted cloud provider, offering over 200 fully featured services, including compute, database, storage, networking and content delivery, machine learning, Internet of Things and many others. AWS SDKs provide access to AWS services through API endpoints. However, incorrect use of these APIs can lead to code defects, crashes, performance issues, and other problems. This paper presents automated static analysis rules, developed in the context of a commercial service for detection of code defects and security vulnerabilities, to identify deviations from AWS best practices in Python applications that use the AWS SDK. Such applications use the AWS SDK for Python, called\\\"Boto3\\\", to access AWS cloud services. However, precise static analysis of Python applications that use cloud SDKs requires robust type inference for inferring the types of cloud service clients. The dynamic style of Boto3 APIs poses unique challenges for type resolution, as does the interprocedural style in which service clients are used in practice. In support of our best-practices goal, we present a layered strategy for type inference that combines multiple type-resolution and tracking strategies in a staged manner. From our experiments across>3,000 popular Python GitHub repos that make use of the AWS SDK, our layered type inference system achieves 85% precision and 100% recall in inferring Boto3 clients in Python client code. Additionally, we present a representative sample of eight AWS best-practice rules that detect a wide range of issues including pagination, polling, and batch operations. We have assessed the efficacy of these rules based on real-world developer feedback. Developers have accepted more than 85% of the recommendations made by five out of eight Python rules, and almost 83% of all recommendations.\",\"PeriodicalId\":172012,\"journal\":{\"name\":\"European Conference on Object-Oriented Programming\",\"volume\":\"62 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2022-05-09\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"6\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"European Conference on Object-Oriented Programming\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.48550/arXiv.2205.04432\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"European Conference on Object-Oriented Programming","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.48550/arXiv.2205.04432","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 6

摘要

亚马逊网络服务(AWS)是一家广泛采用的综合性云服务提供商，提供200多种功能齐全的服务，包括计算、数据库、存储、网络和内容交付、机器学习、物联网等。AWS sdk通过API端点提供对AWS服务的访问。但是，不正确地使用这些api可能会导致代码缺陷、崩溃、性能问题和其他问题。本文介绍了在商业服务环境中开发的自动静态分析规则，用于检测代码缺陷和安全漏洞，以识别使用AWS SDK的Python应用程序中与AWS最佳实践的偏差。这些应用程序使用AWS Python SDK(称为“Boto3”)来访问AWS云服务。然而，对使用云sdk的Python应用程序进行精确的静态分析需要可靠的类型推断来推断云服务客户端的类型。Boto3 api的动态风格对类型解析提出了独特的挑战，就像在实践中使用服务客户端的过程间风格一样。为了支持我们的最佳实践目标，我们提出了一种分层的类型推断策略，该策略以分阶段的方式结合了多种类型解析和跟踪策略。通过对使用AWS SDK的超过3000个流行Python GitHub版本的实验，我们的多层类型推断系统在推断Python客户端代码中的Boto3客户端时达到了85%的准确率和100%的召回率。此外，我们还提供了八个AWS最佳实践规则的代表性示例，这些规则可以检测各种问题，包括分页、轮询和批处理操作。我们已经根据现实世界开发者的反馈评估了这些规则的有效性。开发人员已经接受了超过85%的Python规则中5条的建议，几乎接受了所有建议的83%。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Static Analysis for AWS Best Practices in Python Code

Amazon Web Services (AWS) is a comprehensive and broadly adopted cloud provider, offering over 200 fully featured services, including compute, database, storage, networking and content delivery, machine learning, Internet of Things and many others. AWS SDKs provide access to AWS services through API endpoints. However, incorrect use of these APIs can lead to code defects, crashes, performance issues, and other problems. This paper presents automated static analysis rules, developed in the context of a commercial service for detection of code defects and security vulnerabilities, to identify deviations from AWS best practices in Python applications that use the AWS SDK. Such applications use the AWS SDK for Python, called"Boto3", to access AWS cloud services. However, precise static analysis of Python applications that use cloud SDKs requires robust type inference for inferring the types of cloud service clients. The dynamic style of Boto3 APIs poses unique challenges for type resolution, as does the interprocedural style in which service clients are used in practice. In support of our best-practices goal, we present a layered strategy for type inference that combines multiple type-resolution and tracking strategies in a staged manner. From our experiments across>3,000 popular Python GitHub repos that make use of the AWS SDK, our layered type inference system achieves 85% precision and 100% recall in inferring Boto3 clients in Python client code. Additionally, we present a representative sample of eight AWS best-practice rules that detect a wide range of issues including pagination, polling, and batch operations. We have assessed the efficacy of these rules based on real-world developer feedback. Developers have accepted more than 85% of the recommendations made by five out of eight Python rules, and almost 83% of all recommendations.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

European Conference on Object-Oriented Programming

自引率

0.00%

发文量