{"title":"MAGAN-RT:用于低功耗边缘设备上实时卡通化的轻量级对抗性风格传输网络","authors":"Peng Guo","doi":"10.1002/itl2.70104","DOIUrl":null,"url":null,"abstract":"<div>\n \n <p>Recent advances in neural style transfer (NST) and generative adversarial networks (GANs) have enabled photorealistic and artistic image stylization. However, deploying such models on resource-constrained edge devices remains challenging due to their high computational and memory demands. In this paper, we propose MAGAN-RT, a lightweight adversarial style transfer framework optimized for real-time cartoon-style transformation on low-power mobile and embedded platforms. MAGAN-RT integrates depthwise separable convolutions, inverted bottleneck residual blocks, and a multi-scale perceptual distillation strategy with auxiliary RGB supervision to enable efficient and expressive stylization. Furthermore, a real-image-based adversarial loss is employed to enhance realism while avoiding the artifacts commonly inherited from teacher models. Experimental results demonstrate that MAGAN-RT outperforms existing lightweight and mobile-compatible style transfer networks in both visual quality and runtime efficiency. It achieves state-of-the-art LPIPS, FID, and SSIM scores, while maintaining sub-10 ms inference latency on commercial smartphones, making it suitable for real-time applications such as mobile AR and video filters.</p>\n </div>","PeriodicalId":100725,"journal":{"name":"Internet Technology Letters","volume":"8 5","pages":""},"PeriodicalIF":0.5000,"publicationDate":"2025-09-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"MAGAN-RT: A Lightweight Adversarial Style Transfer Network for Real-Time Cartoonization on Low-Power Edge Devices\",\"authors\":\"Peng Guo\",\"doi\":\"10.1002/itl2.70104\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div>\\n \\n <p>Recent advances in neural style transfer (NST) and generative adversarial networks (GANs) have enabled photorealistic and artistic image stylization. However, deploying such models on resource-constrained edge devices remains challenging due to their high computational and memory demands. In this paper, we propose MAGAN-RT, a lightweight adversarial style transfer framework optimized for real-time cartoon-style transformation on low-power mobile and embedded platforms. MAGAN-RT integrates depthwise separable convolutions, inverted bottleneck residual blocks, and a multi-scale perceptual distillation strategy with auxiliary RGB supervision to enable efficient and expressive stylization. Furthermore, a real-image-based adversarial loss is employed to enhance realism while avoiding the artifacts commonly inherited from teacher models. Experimental results demonstrate that MAGAN-RT outperforms existing lightweight and mobile-compatible style transfer networks in both visual quality and runtime efficiency. It achieves state-of-the-art LPIPS, FID, and SSIM scores, while maintaining sub-10 ms inference latency on commercial smartphones, making it suitable for real-time applications such as mobile AR and video filters.</p>\\n </div>\",\"PeriodicalId\":100725,\"journal\":{\"name\":\"Internet Technology Letters\",\"volume\":\"8 5\",\"pages\":\"\"},\"PeriodicalIF\":0.5000,\"publicationDate\":\"2025-09-02\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Internet Technology Letters\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://onlinelibrary.wiley.com/doi/10.1002/itl2.70104\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q4\",\"JCRName\":\"TELECOMMUNICATIONS\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Internet Technology Letters","FirstCategoryId":"1085","ListUrlMain":"https://onlinelibrary.wiley.com/doi/10.1002/itl2.70104","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q4","JCRName":"TELECOMMUNICATIONS","Score":null,"Total":0}
MAGAN-RT: A Lightweight Adversarial Style Transfer Network for Real-Time Cartoonization on Low-Power Edge Devices
Recent advances in neural style transfer (NST) and generative adversarial networks (GANs) have enabled photorealistic and artistic image stylization. However, deploying such models on resource-constrained edge devices remains challenging due to their high computational and memory demands. In this paper, we propose MAGAN-RT, a lightweight adversarial style transfer framework optimized for real-time cartoon-style transformation on low-power mobile and embedded platforms. MAGAN-RT integrates depthwise separable convolutions, inverted bottleneck residual blocks, and a multi-scale perceptual distillation strategy with auxiliary RGB supervision to enable efficient and expressive stylization. Furthermore, a real-image-based adversarial loss is employed to enhance realism while avoiding the artifacts commonly inherited from teacher models. Experimental results demonstrate that MAGAN-RT outperforms existing lightweight and mobile-compatible style transfer networks in both visual quality and runtime efficiency. It achieves state-of-the-art LPIPS, FID, and SSIM scores, while maintaining sub-10 ms inference latency on commercial smartphones, making it suitable for real-time applications such as mobile AR and video filters.