自动生成声明式用户界面代码，架起设计与开发的桥梁

arXiv - CS - Software Engineering Pub Date : 2024-09-18 DOI:arxiv-2409.11667

Ting Zhou, Yanjie Zhao, Xinyi Hou, Xiaoyu Sun, Kai Chen, Haoyu Wang

{"title":"自动生成声明式用户界面代码，架起设计与开发的桥梁","authors":"Ting Zhou, Yanjie Zhao, Xinyi Hou, Xiaoyu Sun, Kai Chen, Haoyu Wang","doi":"arxiv-2409.11667","DOIUrl":null,"url":null,"abstract":"Declarative UI frameworks have gained widespread adoption in mobile app\ndevelopment, offering benefits such as improved code readability and easier\nmaintenance. Despite these advantages, the process of translating UI designs\ninto functional code remains challenging and time-consuming. Recent\nadvancements in multimodal large language models (MLLMs) have shown promise in\ndirectly generating mobile app code from user interface (UI) designs. However,\nthe direct application of MLLMs to this task is limited by challenges in\naccurately recognizing UI components and comprehensively capturing interaction\nlogic. To address these challenges, we propose DeclarUI, an automated approach that\nsynergizes computer vision (CV), MLLMs, and iterative compiler-driven\noptimization to generate and refine declarative UI code from designs. DeclarUI\nenhances visual fidelity, functional completeness, and code quality through\nprecise component segmentation, Page Transition Graphs (PTGs) for modeling\ncomplex inter-page relationships, and iterative optimization. In our\nevaluation, DeclarUI outperforms baselines on React Native, a widely adopted\ndeclarative UI framework, achieving a 96.8% PTG coverage rate and a 98%\ncompilation success rate. Notably, DeclarUI demonstrates significant\nimprovements over state-of-the-art MLLMs, with a 123% increase in PTG coverage\nrate, up to 55% enhancement in visual similarity scores, and a 29% boost in\ncompilation success rate. We further demonstrate DeclarUI's generalizability\nthrough successful applications to Flutter and ArkUI frameworks.","PeriodicalId":501278,"journal":{"name":"arXiv - CS - Software Engineering","volume":"41 1","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2024-09-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Bridging Design and Development with Automated Declarative UI Code Generation\",\"authors\":\"Ting Zhou, Yanjie Zhao, Xinyi Hou, Xiaoyu Sun, Kai Chen, Haoyu Wang\",\"doi\":\"arxiv-2409.11667\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Declarative UI frameworks have gained widespread adoption in mobile app\\ndevelopment, offering benefits such as improved code readability and easier\\nmaintenance. Despite these advantages, the process of translating UI designs\\ninto functional code remains challenging and time-consuming. Recent\\nadvancements in multimodal large language models (MLLMs) have shown promise in\\ndirectly generating mobile app code from user interface (UI) designs. However,\\nthe direct application of MLLMs to this task is limited by challenges in\\naccurately recognizing UI components and comprehensively capturing interaction\\nlogic. To address these challenges, we propose DeclarUI, an automated approach that\\nsynergizes computer vision (CV), MLLMs, and iterative compiler-driven\\noptimization to generate and refine declarative UI code from designs. DeclarUI\\nenhances visual fidelity, functional completeness, and code quality through\\nprecise component segmentation, Page Transition Graphs (PTGs) for modeling\\ncomplex inter-page relationships, and iterative optimization. In our\\nevaluation, DeclarUI outperforms baselines on React Native, a widely adopted\\ndeclarative UI framework, achieving a 96.8% PTG coverage rate and a 98%\\ncompilation success rate. Notably, DeclarUI demonstrates significant\\nimprovements over state-of-the-art MLLMs, with a 123% increase in PTG coverage\\nrate, up to 55% enhancement in visual similarity scores, and a 29% boost in\\ncompilation success rate. We further demonstrate DeclarUI's generalizability\\nthrough successful applications to Flutter and ArkUI frameworks.\",\"PeriodicalId\":501278,\"journal\":{\"name\":\"arXiv - CS - Software Engineering\",\"volume\":\"41 1\",\"pages\":\"\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2024-09-18\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"arXiv - CS - Software Engineering\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/arxiv-2409.11667\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"arXiv - CS - Software Engineering","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/arxiv-2409.11667","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

摘要

声明式用户界面框架在移动应用开发中得到了广泛采用，它具有提高代码可读性和便于维护等优点。尽管有这些优势，但将用户界面设计转化为功能代码的过程仍然充满挑战且耗费时间。多模态大语言模型（MLLMs）的最新进展表明，从用户界面（UI）设计间接生成移动应用代码是大有可为的。然而，由于无法准确识别用户界面组件和全面捕捉交互逻辑，MLLMs 在这项任务中的直接应用受到了限制。为了应对这些挑战，我们提出了一种自动方法--DeclarUI，它将计算机视觉（CV）、MLLMs 和迭代编译器驱动的优化结合起来，从设计中生成并完善声明式 UI 代码。DeclarUI 通过精确的组件分割、用于模拟复杂页面间关系的页面转换图 (PTG)，以及迭代优化，提高了视觉保真度、功能完整性和代码质量。在我们的评估中，DeclarUI 在 React Native（一种被广泛采用的分解式用户界面框架）上的表现优于基准，实现了 96.8% 的 PTG 覆盖率和 98% 的编译成功率。值得注意的是，与最先进的 MLLM 相比，DeclarUI 的性能有了显著提高，PTG 覆盖率提高了 123%，视觉相似性得分提高了 55%，编译成功率提高了 29%。通过在 Flutter 和 ArkUI 框架中的成功应用，我们进一步证明了 DeclarUI 的通用性。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Bridging Design and Development with Automated Declarative UI Code Generation

Declarative UI frameworks have gained widespread adoption in mobile app development, offering benefits such as improved code readability and easier maintenance. Despite these advantages, the process of translating UI designs into functional code remains challenging and time-consuming. Recent advancements in multimodal large language models (MLLMs) have shown promise in directly generating mobile app code from user interface (UI) designs. However, the direct application of MLLMs to this task is limited by challenges in accurately recognizing UI components and comprehensively capturing interaction logic. To address these challenges, we propose DeclarUI, an automated approach that synergizes computer vision (CV), MLLMs, and iterative compiler-driven optimization to generate and refine declarative UI code from designs. DeclarUI enhances visual fidelity, functional completeness, and code quality through precise component segmentation, Page Transition Graphs (PTGs) for modeling complex inter-page relationships, and iterative optimization. In our evaluation, DeclarUI outperforms baselines on React Native, a widely adopted declarative UI framework, achieving a 96.8% PTG coverage rate and a 98% compilation success rate. Notably, DeclarUI demonstrates significant improvements over state-of-the-art MLLMs, with a 123% increase in PTG coverage rate, up to 55% enhancement in visual similarity scores, and a 29% boost in compilation success rate. We further demonstrate DeclarUI's generalizability through successful applications to Flutter and ArkUI frameworks.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

arXiv - CS - Software Engineering

自引率

0.00%

发文量