MAGAN-RT:用于低功耗边缘设备上实时卡通化的轻量级对抗性风格传输网络

IF 0.5 Q4 TELECOMMUNICATIONS
Peng Guo
{"title":"MAGAN-RT:用于低功耗边缘设备上实时卡通化的轻量级对抗性风格传输网络","authors":"Peng Guo","doi":"10.1002/itl2.70104","DOIUrl":null,"url":null,"abstract":"<div>\n \n <p>Recent advances in neural style transfer (NST) and generative adversarial networks (GANs) have enabled photorealistic and artistic image stylization. However, deploying such models on resource-constrained edge devices remains challenging due to their high computational and memory demands. In this paper, we propose MAGAN-RT, a lightweight adversarial style transfer framework optimized for real-time cartoon-style transformation on low-power mobile and embedded platforms. MAGAN-RT integrates depthwise separable convolutions, inverted bottleneck residual blocks, and a multi-scale perceptual distillation strategy with auxiliary RGB supervision to enable efficient and expressive stylization. Furthermore, a real-image-based adversarial loss is employed to enhance realism while avoiding the artifacts commonly inherited from teacher models. Experimental results demonstrate that MAGAN-RT outperforms existing lightweight and mobile-compatible style transfer networks in both visual quality and runtime efficiency. It achieves state-of-the-art LPIPS, FID, and SSIM scores, while maintaining sub-10 ms inference latency on commercial smartphones, making it suitable for real-time applications such as mobile AR and video filters.</p>\n </div>","PeriodicalId":100725,"journal":{"name":"Internet Technology Letters","volume":"8 5","pages":""},"PeriodicalIF":0.5000,"publicationDate":"2025-09-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"MAGAN-RT: A Lightweight Adversarial Style Transfer Network for Real-Time Cartoonization on Low-Power Edge Devices\",\"authors\":\"Peng Guo\",\"doi\":\"10.1002/itl2.70104\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div>\\n \\n <p>Recent advances in neural style transfer (NST) and generative adversarial networks (GANs) have enabled photorealistic and artistic image stylization. However, deploying such models on resource-constrained edge devices remains challenging due to their high computational and memory demands. In this paper, we propose MAGAN-RT, a lightweight adversarial style transfer framework optimized for real-time cartoon-style transformation on low-power mobile and embedded platforms. MAGAN-RT integrates depthwise separable convolutions, inverted bottleneck residual blocks, and a multi-scale perceptual distillation strategy with auxiliary RGB supervision to enable efficient and expressive stylization. Furthermore, a real-image-based adversarial loss is employed to enhance realism while avoiding the artifacts commonly inherited from teacher models. Experimental results demonstrate that MAGAN-RT outperforms existing lightweight and mobile-compatible style transfer networks in both visual quality and runtime efficiency. It achieves state-of-the-art LPIPS, FID, and SSIM scores, while maintaining sub-10 ms inference latency on commercial smartphones, making it suitable for real-time applications such as mobile AR and video filters.</p>\\n </div>\",\"PeriodicalId\":100725,\"journal\":{\"name\":\"Internet Technology Letters\",\"volume\":\"8 5\",\"pages\":\"\"},\"PeriodicalIF\":0.5000,\"publicationDate\":\"2025-09-02\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Internet Technology Letters\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://onlinelibrary.wiley.com/doi/10.1002/itl2.70104\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q4\",\"JCRName\":\"TELECOMMUNICATIONS\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Internet Technology Letters","FirstCategoryId":"1085","ListUrlMain":"https://onlinelibrary.wiley.com/doi/10.1002/itl2.70104","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q4","JCRName":"TELECOMMUNICATIONS","Score":null,"Total":0}
引用次数: 0

摘要

神经风格迁移(NST)和生成对抗网络(gan)的最新进展使照片真实感和艺术图像风格化成为可能。然而,在资源受限的边缘设备上部署这样的模型仍然具有挑战性,因为它们对计算和内存的要求很高。在本文中,我们提出了MAGAN-RT,这是一个轻量级的对抗性风格转换框架,针对低功耗移动和嵌入式平台上的实时卡通风格转换进行了优化。MAGAN-RT集成了深度可分离卷积、反向瓶颈残差块和多尺度感知蒸馏策略以及辅助RGB监督,以实现高效和富有表现力的风格化。此外,采用基于真实图像的对抗损失来增强真实感,同时避免了通常从教师模型继承的伪影。实验结果表明,MAGAN-RT在视觉质量和运行效率方面都优于现有的轻量级和移动兼容风格传递网络。它实现了最先进的LPIPS, FID和SSIM分数,同时在商用智能手机上保持低于10毫秒的推理延迟,使其适用于移动AR和视频过滤器等实时应用。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
MAGAN-RT: A Lightweight Adversarial Style Transfer Network for Real-Time Cartoonization on Low-Power Edge Devices

Recent advances in neural style transfer (NST) and generative adversarial networks (GANs) have enabled photorealistic and artistic image stylization. However, deploying such models on resource-constrained edge devices remains challenging due to their high computational and memory demands. In this paper, we propose MAGAN-RT, a lightweight adversarial style transfer framework optimized for real-time cartoon-style transformation on low-power mobile and embedded platforms. MAGAN-RT integrates depthwise separable convolutions, inverted bottleneck residual blocks, and a multi-scale perceptual distillation strategy with auxiliary RGB supervision to enable efficient and expressive stylization. Furthermore, a real-image-based adversarial loss is employed to enhance realism while avoiding the artifacts commonly inherited from teacher models. Experimental results demonstrate that MAGAN-RT outperforms existing lightweight and mobile-compatible style transfer networks in both visual quality and runtime efficiency. It achieves state-of-the-art LPIPS, FID, and SSIM scores, while maintaining sub-10 ms inference latency on commercial smartphones, making it suitable for real-time applications such as mobile AR and video filters.

求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
CiteScore
3.10
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信