Effective, Platform-Independent GUI Testing via Image Embedding and Reinforcement Learning

IF 6.2 2区计算机科学 Q1 COMPUTER SCIENCE, SOFTWARE ENGINEERING

ACM Transactions on Software Engineering and Methodology Pub Date : 2024-06-21 DOI:10.1145/3674728

Shengcheng Yu, Chunrong Fang, Xin Li, Yuchen Ling, Zhenyu Chen, Zhendong Su

{"title":"Effective, Platform-Independent GUI Testing via Image Embedding and Reinforcement Learning","authors":"Shengcheng Yu, Chunrong Fang, Xin Li, Yuchen Ling, Zhenyu Chen, Zhendong Su","doi":"10.1145/3674728","DOIUrl":null,"url":null,"abstract":"Software applications (apps) have been playing an increasingly important role in various aspects of society. In particular, mobile apps and web apps are the most prevalent among all applications and are widely used in various industries as well as in people’s daily lives. To help ensure mobile and web app quality, many approaches have been introduced to improve app GUI testing via automated exploration, including random testing, model-based testing, learning-based testing, etc. Despite the extensive effort, existing approaches are still limited in reaching high code coverage, constructing high-quality models, and being generally applicable. Reinforcement learning-based approaches, as a group of representative and advanced approaches for automated GUI exploration testing, are faced with difficult challenges, including effective app state abstraction, reward function design, etc. Moreover, they heavily depend on the specific execution platforms (i.e., Android or Web), thus leading to poor generalizability and being unable to adapt to different platforms.This work specifically tackles these challenges based on the high-level observation that apps from distinct platforms share commonalities in GUI design. Indeed, we propose PIRLTEST, an effective platform-independent approach for app testing. Specifically, PIRLTEST utilizes computer vision and reinforcement learning techniques in a novel, synergistic manner for automated testing. It extracts the GUI widgets from GUI pages and characterizes the corresponding GUI layouts, embedding the GUI pages as states. The app GUI state combines the macroscopic perspective (app GUI layout) and the microscopic perspective (app GUI widget), and attaches the critical semantic information from GUI images. This enables PIRLTEST to be platform-independent and makes the testing approach generally applicable on different platforms. PIRLTEST explores apps with the guidance of a curiosity-driven strategy, which uses a Q-network to estimate the values of specific state-action pairs to encourage more exploration in uncovered pages without platform dependency. The exploration will be assigned with rewards for all actions, which are designed considering both the app GUI states and the concrete widgets, to help the framework explore more uncovered pages. We conduct an empirical study on 20 mobile apps and 5 web apps, and the results show that PIRLTEST is zero-cost when being adapted to different platforms, and can perform better than the baselines, covering 6.3–41.4% more code on mobile apps and 1.5–51.1% more code on web apps. PIRLTEST is capable of detecting 128 unique bugs on mobile and web apps, including 100 bugs that cannot be detected by the baselines.","PeriodicalId":50933,"journal":{"name":"ACM Transactions on Software Engineering and Methodology","volume":"51 1","pages":""},"PeriodicalIF":6.2000,"publicationDate":"2024-06-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"ACM Transactions on Software Engineering and Methodology","FirstCategoryId":"94","ListUrlMain":"https://doi.org/10.1145/3674728","RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, SOFTWARE ENGINEERING","Score":null,"Total":0}

引用次数: 0

Abstract

Software applications (apps) have been playing an increasingly important role in various aspects of society. In particular, mobile apps and web apps are the most prevalent among all applications and are widely used in various industries as well as in people’s daily lives. To help ensure mobile and web app quality, many approaches have been introduced to improve app GUI testing via automated exploration, including random testing, model-based testing, learning-based testing, etc. Despite the extensive effort, existing approaches are still limited in reaching high code coverage, constructing high-quality models, and being generally applicable. Reinforcement learning-based approaches, as a group of representative and advanced approaches for automated GUI exploration testing, are faced with difficult challenges, including effective app state abstraction, reward function design, etc. Moreover, they heavily depend on the specific execution platforms (i.e., Android or Web), thus leading to poor generalizability and being unable to adapt to different platforms.

This work specifically tackles these challenges based on the high-level observation that apps from distinct platforms share commonalities in GUI design. Indeed, we propose PIRLT_EST, an effective platform-independent approach for app testing. Specifically, PIRLT_EST utilizes computer vision and reinforcement learning techniques in a novel, synergistic manner for automated testing. It extracts the GUI widgets from GUI pages and characterizes the corresponding GUI layouts, embedding the GUI pages as states. The app GUI state combines the macroscopic perspective (app GUI layout) and the microscopic perspective (app GUI widget), and attaches the critical semantic information from GUI images. This enables PIRLT_EST to be platform-independent and makes the testing approach generally applicable on different platforms. PIRLT_EST explores apps with the guidance of a curiosity-driven strategy, which uses a Q-network to estimate the values of specific state-action pairs to encourage more exploration in uncovered pages without platform dependency. The exploration will be assigned with rewards for all actions, which are designed considering both the app GUI states and the concrete widgets, to help the framework explore more uncovered pages. We conduct an empirical study on 20 mobile apps and 5 web apps, and the results show that PIRLT_EST is zero-cost when being adapted to different platforms, and can perform better than the baselines, covering 6.3–41.4% more code on mobile apps and 1.5–51.1% more code on web apps. PIRLT_EST is capable of detecting 128 unique bugs on mobile and web apps, including 100 bugs that cannot be detected by the baselines.

查看原文本刊更多论文

通过图像嵌入和强化学习进行有效、独立于平台的图形用户界面测试

软件应用程序（应用程序）在社会的各个方面发挥着越来越重要的作用。其中，移动应用程序和网络应用程序在所有应用程序中最为普遍，被广泛应用于各行各业和人们的日常生活中。为了帮助确保移动应用程序和网络应用程序的质量，人们引入了许多方法来通过自动探索改进应用程序图形用户界面测试，包括随机测试、基于模型的测试、基于学习的测试等。尽管付出了大量努力，但现有方法在实现高代码覆盖率、构建高质量模型和普遍适用性方面仍有局限。基于强化学习的方法作为图形用户界面自动探索测试的一组具有代表性的先进方法，面临着有效的应用程序状态抽象、奖励函数设计等难题。此外，这些方法严重依赖于特定的执行平台（如 Android 或 Web），因此导致普适性差，无法适应不同的平台。这项工作基于对不同平台的应用程序在图形用户界面设计方面具有共性的高层次观察，专门应对这些挑战。事实上，我们提出的 PIRLTEST 是一种独立于平台的有效应用程序测试方法。具体来说，PIRLTEST 以一种新颖、协同的方式利用计算机视觉和强化学习技术进行自动测试。它从图形用户界面页面中提取图形用户界面部件，并描述相应的图形用户界面布局，将图形用户界面页面嵌入为状态。应用程序图形用户界面状态结合了宏观视角（应用程序图形用户界面布局）和微观视角（应用程序图形用户界面部件），并附加了图形用户界面图像的关键语义信息。这使得 PIRLTEST 与平台无关，并使测试方法普遍适用于不同平台。PIRLTEST 在好奇心驱动策略的指导下探索应用程序，该策略使用 Q 网络来估算特定状态-动作对的值，以鼓励在不依赖平台的情况下对未覆盖页面进行更多探索。探索过程中的所有操作都将获得奖励，奖励的设计同时考虑了应用程序图形用户界面的状态和具体的小部件，以帮助框架探索更多未覆盖的页面。我们在 20 个移动应用程序和 5 个网络应用程序上进行了实证研究，结果表明，PIRLTEST 在适应不同平台时是零成本的，而且性能比基线更好，在移动应用程序上覆盖的代码比基线多 6.3-41.4%，在网络应用程序上覆盖的代码比基线多 1.5-51.1%。PIRLTEST 能够在移动和网络应用程序中检测出 128 个独特的错误，其中包括 100 个基线无法检测到的错误。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

ACM Transactions on Software Engineering and Methodology 工程技术-计算机：软件工程

CiteScore

6.30

自引率

4.50%

发文量

164

审稿时长

>12 weeks

期刊介绍： Designing and building a large, complex software system is a tremendous challenge. ACM Transactions on Software Engineering and Methodology (TOSEM) publishes papers on all aspects of that challenge: specification, design, development and maintenance. It covers tools and methodologies, languages, data structures, and algorithms. TOSEM also reports on successful efforts, noting practical lessons that can be scaled and transferred to other projects, and often looks at applications of innovative technologies. The tone is scholarly but readable; the content is worthy of study; the presentation is effective.