PatchHAR: A MLP-Like Architecture for Efficient Activity Recognition Using Wearables

IEEE transactions on biometrics, behavior, and identity science Pub Date : 2024-01-16 DOI:10.1109/TBIOM.2024.3354261

Shuoyuan Wang;Lei Zhang;Xing Wang;Wenbo Huang;Hao Wu;Aiguo Song

{"title":"PatchHAR: A MLP-Like Architecture for Efficient Activity Recognition Using Wearables","authors":"Shuoyuan Wang;Lei Zhang;Xing Wang;Wenbo Huang;Hao Wu;Aiguo Song","doi":"10.1109/TBIOM.2024.3354261","DOIUrl":null,"url":null,"abstract":"To date, convolutional neural networks have played a dominant role in sensor-based human activity recognition (HAR) scenarios. In 2021, researchers from four institutions almost simultaneously released their newest work to arXiv.org, where each of them independently presented new network architectures mainly consisting of linear layers. This arouses a heated debate whether the current research hotspot in deep learning architectures is returning to MLPs. Inspired by the recent success achieved by MLPs, in this paper, we first propose a lightweight network architecture called all-MLP for HAR, which is entirely built on MLP layers with a gating unit. By dividing multi-channel sensor time series into nonoverlapping patches, all linear layers directly process sensor patches to automatically extract local features, which is able to effectively reduce computational cost. Compared with convolutional architectures, it takes fewer FLOPs and parameters but achieves comparable classification score on WISDM, OPPORTUNITY, PAMAP2 and USC-HAD HAR benchmarks. The additional benefit is that all involved computations are matrix multiplication, which can be readily optimized with popular deep learning libraries. This advantage can promote practical HAR deployment in wearable devices. Finally, we evaluate the actual operation of all-MLP model on a Raspberry Pi platform for real-world human activity recognition simulation. We conclude that the new architecture is not a simple reuse of traditional MLPs in HAR scenario, but is a significant advance over them.","PeriodicalId":73307,"journal":{"name":"IEEE transactions on biometrics, behavior, and identity science","volume":"6 2","pages":"169-181"},"PeriodicalIF":0.0000,"publicationDate":"2024-01-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE transactions on biometrics, behavior, and identity science","FirstCategoryId":"1085","ListUrlMain":"https://ieeexplore.ieee.org/document/10400955/","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

Abstract

To date, convolutional neural networks have played a dominant role in sensor-based human activity recognition (HAR) scenarios. In 2021, researchers from four institutions almost simultaneously released their newest work to arXiv.org, where each of them independently presented new network architectures mainly consisting of linear layers. This arouses a heated debate whether the current research hotspot in deep learning architectures is returning to MLPs. Inspired by the recent success achieved by MLPs, in this paper, we first propose a lightweight network architecture called all-MLP for HAR, which is entirely built on MLP layers with a gating unit. By dividing multi-channel sensor time series into nonoverlapping patches, all linear layers directly process sensor patches to automatically extract local features, which is able to effectively reduce computational cost. Compared with convolutional architectures, it takes fewer FLOPs and parameters but achieves comparable classification score on WISDM, OPPORTUNITY, PAMAP2 and USC-HAD HAR benchmarks. The additional benefit is that all involved computations are matrix multiplication, which can be readily optimized with popular deep learning libraries. This advantage can promote practical HAR deployment in wearable devices. Finally, we evaluate the actual operation of all-MLP model on a Raspberry Pi platform for real-world human activity recognition simulation. We conclude that the new architecture is not a simple reuse of traditional MLPs in HAR scenario, but is a significant advance over them.

查看原文本刊更多论文

PatchHAR：利用可穿戴设备高效识别活动的类 MLP 架构

迄今为止，卷积神经网络在基于传感器的人类活动识别（HAR）场景中一直发挥着主导作用。2021 年，来自四家机构的研究人员几乎同时在 arXiv.org 上发布了他们的最新研究成果，各自独立提出了主要由线性层组成的新网络架构。这引起了一场激烈的争论：当前深度学习架构的研究热点是否会回归到 MLPs？受 MLP 近年来取得的成功启发，本文首先提出了一种名为 all-MLP for HAR 的轻量级网络架构，该架构完全基于 MLP 层，并带有门控单元。通过将多通道传感器时间序列划分为不重叠的斑块，所有线性层直接处理传感器斑块，自动提取局部特征，从而有效降低了计算成本。与卷积架构相比，它所需的 FLOP 和参数更少，但在 WISDM、OPPORTUNITY、PAMAP2 和 USC-HAD HAR 基准测试中的分类得分却不相上下。它的另一个好处是，所有涉及的计算都是矩阵乘法，可以使用流行的深度学习库进行优化。这一优势可以促进 HAR 在可穿戴设备中的实际应用。最后，我们评估了全 MLP 模型在 Raspberry Pi 平台上的实际运行情况，以进行真实世界的人类活动识别模拟。我们得出的结论是，新架构并不是传统 MLP 在 HAR 场景中的简单重用，而是在此基础上的重大进步。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

IEEE transactions on biometrics, behavior, and identity science

CiteScore

10.90

自引率

0.00%

发文量