{"title":"MAGAN-RT: A Lightweight Adversarial Style Transfer Network for Real-Time Cartoonization on Low-Power Edge Devices","authors":"Peng Guo","doi":"10.1002/itl2.70104","DOIUrl":null,"url":null,"abstract":"<div>\n \n <p>Recent advances in neural style transfer (NST) and generative adversarial networks (GANs) have enabled photorealistic and artistic image stylization. However, deploying such models on resource-constrained edge devices remains challenging due to their high computational and memory demands. In this paper, we propose MAGAN-RT, a lightweight adversarial style transfer framework optimized for real-time cartoon-style transformation on low-power mobile and embedded platforms. MAGAN-RT integrates depthwise separable convolutions, inverted bottleneck residual blocks, and a multi-scale perceptual distillation strategy with auxiliary RGB supervision to enable efficient and expressive stylization. Furthermore, a real-image-based adversarial loss is employed to enhance realism while avoiding the artifacts commonly inherited from teacher models. Experimental results demonstrate that MAGAN-RT outperforms existing lightweight and mobile-compatible style transfer networks in both visual quality and runtime efficiency. It achieves state-of-the-art LPIPS, FID, and SSIM scores, while maintaining sub-10 ms inference latency on commercial smartphones, making it suitable for real-time applications such as mobile AR and video filters.</p>\n </div>","PeriodicalId":100725,"journal":{"name":"Internet Technology Letters","volume":"8 5","pages":""},"PeriodicalIF":0.5000,"publicationDate":"2025-09-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Internet Technology Letters","FirstCategoryId":"1085","ListUrlMain":"https://onlinelibrary.wiley.com/doi/10.1002/itl2.70104","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q4","JCRName":"TELECOMMUNICATIONS","Score":null,"Total":0}
引用次数: 0
Abstract
Recent advances in neural style transfer (NST) and generative adversarial networks (GANs) have enabled photorealistic and artistic image stylization. However, deploying such models on resource-constrained edge devices remains challenging due to their high computational and memory demands. In this paper, we propose MAGAN-RT, a lightweight adversarial style transfer framework optimized for real-time cartoon-style transformation on low-power mobile and embedded platforms. MAGAN-RT integrates depthwise separable convolutions, inverted bottleneck residual blocks, and a multi-scale perceptual distillation strategy with auxiliary RGB supervision to enable efficient and expressive stylization. Furthermore, a real-image-based adversarial loss is employed to enhance realism while avoiding the artifacts commonly inherited from teacher models. Experimental results demonstrate that MAGAN-RT outperforms existing lightweight and mobile-compatible style transfer networks in both visual quality and runtime efficiency. It achieves state-of-the-art LPIPS, FID, and SSIM scores, while maintaining sub-10 ms inference latency on commercial smartphones, making it suitable for real-time applications such as mobile AR and video filters.