Anniek Eerdekens, M. Deruyck, Jaron Fontaine, L. Martens, E. D. Poorter, D. Plets, W. Joseph
{"title":"基于可穿戴传感器加速度计数据卷积神经网络的马行为分类重采样和数据增强","authors":"Anniek Eerdekens, M. Deruyck, Jaron Fontaine, L. Martens, E. D. Poorter, D. Plets, W. Joseph","doi":"10.1109/COINS49042.2020.9191639","DOIUrl":null,"url":null,"abstract":"Monitoring horses’ behaviors through sensors can yield important information about their health and welfare. Sampling frequency majorly affects the classification accuracy in activity recognition and energy needs for the sensor. The aim of this study was to evaluate the effect of sampling rate reduction of a tri-axial accelerometer on the recognition accuracy by resampling a 50 Hz experimental dataset to four lower sampling rates (5 Hz, 10 Hz, 12.5 Hz and 25 Hz). Also, in this work we investigate the ‘reality gap’ that incorporates changes in the data that are primarily characterized as sensor rotations or measurement noise through various data augmentation techniques such as rotation and jittering. Finally, another factor influencing activity recognition are the subjects themselves and therefore the model is evaluated on different horse types. A deep learning-based approach for activity detection of equines is proposed to automatically classify 2238 manually annotated 2 s samples tri-axial accelerometer leg data data of seven different activities performed by six different subjects. The raw data are preprocessed and fed into a convolutional neural network (CNN) from which features are extracted automatically by using strong computing capabilities. Furthermore, the neural network was intentionally designed to minimize running time, enabling us to imagine the future use of the built model in embedded constrained devices. The complexity of these automatic learning techniques can be decreased while achieving high accuracies using ten-fold-cross validation using a computationally less intensive received signal length data (99.32% at 5 Hz vs 99.74% at 25 Hz). This indicates that sampling at 5 Hz with a 2 s window will offer advantages for activity surveillance thanks to decreased energy requirements, since validation time decreases 16-fold (784 microseconds at 50 Hz to 48 microseconds at 5 Hz). Moreover, in this work we show that rotating the training or validation signal with 10 degrees over the X, Y and Z-axis increases the generalization capabilities of our model (99.61 % vs 99.93%) while adding small amounts of noise (smaller than 0.3 standard deviation (STD)) does not decrease the classification accuracy under 99%. Finally, the performance and ability of the model to generalize is validated on data from unseen horses at the cost of only 4.1% and 2.45% reduction in accuracy when validated on a pony and a lame horse, respectively.","PeriodicalId":350108,"journal":{"name":"2020 International Conference on Omni-layer Intelligent Systems (COINS)","volume":null,"pages":null},"PeriodicalIF":0.0000,"publicationDate":"2020-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"5","resultStr":"{\"title\":\"Resampling and Data Augmentation For Equines’ Behaviour Classification Based on Wearable Sensor Accelerometer Data Using a Convolutional Neural Network\",\"authors\":\"Anniek Eerdekens, M. Deruyck, Jaron Fontaine, L. Martens, E. D. Poorter, D. Plets, W. Joseph\",\"doi\":\"10.1109/COINS49042.2020.9191639\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Monitoring horses’ behaviors through sensors can yield important information about their health and welfare. Sampling frequency majorly affects the classification accuracy in activity recognition and energy needs for the sensor. The aim of this study was to evaluate the effect of sampling rate reduction of a tri-axial accelerometer on the recognition accuracy by resampling a 50 Hz experimental dataset to four lower sampling rates (5 Hz, 10 Hz, 12.5 Hz and 25 Hz). Also, in this work we investigate the ‘reality gap’ that incorporates changes in the data that are primarily characterized as sensor rotations or measurement noise through various data augmentation techniques such as rotation and jittering. Finally, another factor influencing activity recognition are the subjects themselves and therefore the model is evaluated on different horse types. A deep learning-based approach for activity detection of equines is proposed to automatically classify 2238 manually annotated 2 s samples tri-axial accelerometer leg data data of seven different activities performed by six different subjects. The raw data are preprocessed and fed into a convolutional neural network (CNN) from which features are extracted automatically by using strong computing capabilities. Furthermore, the neural network was intentionally designed to minimize running time, enabling us to imagine the future use of the built model in embedded constrained devices. The complexity of these automatic learning techniques can be decreased while achieving high accuracies using ten-fold-cross validation using a computationally less intensive received signal length data (99.32% at 5 Hz vs 99.74% at 25 Hz). This indicates that sampling at 5 Hz with a 2 s window will offer advantages for activity surveillance thanks to decreased energy requirements, since validation time decreases 16-fold (784 microseconds at 50 Hz to 48 microseconds at 5 Hz). Moreover, in this work we show that rotating the training or validation signal with 10 degrees over the X, Y and Z-axis increases the generalization capabilities of our model (99.61 % vs 99.93%) while adding small amounts of noise (smaller than 0.3 standard deviation (STD)) does not decrease the classification accuracy under 99%. Finally, the performance and ability of the model to generalize is validated on data from unseen horses at the cost of only 4.1% and 2.45% reduction in accuracy when validated on a pony and a lame horse, respectively.\",\"PeriodicalId\":350108,\"journal\":{\"name\":\"2020 International Conference on Omni-layer Intelligent Systems (COINS)\",\"volume\":null,\"pages\":null},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2020-08-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"5\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2020 International Conference on Omni-layer Intelligent Systems (COINS)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/COINS49042.2020.9191639\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2020 International Conference on Omni-layer Intelligent Systems (COINS)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/COINS49042.2020.9191639","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 5
摘要
通过传感器监测马的行为可以获得有关它们健康和福利的重要信息。采样频率主要影响活动识别的分类精度和传感器的能量需求。本研究的目的是通过将50 Hz的实验数据集重新采样到4个更低的采样率(5 Hz, 10 Hz, 12.5 Hz和25 Hz)来评估三轴加速度计采样率降低对识别精度的影响。此外,在这项工作中,我们研究了“现实差距”,它包含了数据中的变化,这些变化主要表现为传感器旋转或测量噪声,通过各种数据增强技术(如旋转和抖动)。最后,影响活动识别的另一个因素是受试者本身,因此该模型在不同的马类型上进行了评估。提出了一种基于深度学习的马活动检测方法,对2238个人工标注的2 s样本三轴加速度计腿部数据进行自动分类,这些数据来自6个不同受试者的7种不同活动。原始数据经过预处理后输入卷积神经网络(CNN),利用强大的计算能力自动提取特征。此外,神经网络被有意地设计为最小化运行时间,使我们能够想象未来在嵌入式受限设备中使用所构建的模型。这些自动学习技术的复杂性可以降低,同时使用10倍交叉验证实现高精度,使用计算强度较低的接收信号长度数据(5 Hz时99.32% vs 25 Hz时99.74%)。这表明,由于降低了能量需求,以5 Hz采样和2 s窗口将为活动监视提供优势,因为验证时间减少了16倍(50 Hz时为784微秒,5 Hz时为48微秒)。此外,在这项工作中,我们表明,将训练或验证信号在X、Y和z轴上旋转10度可以提高模型的泛化能力(99.61% vs 99.93%),而添加少量噪声(小于0.3标准偏差(STD))不会降低99%以下的分类精度。最后,在未见过的马的数据上验证了模型的性能和泛化能力,在小马和瘸腿马的数据上验证的准确率分别仅降低了4.1%和2.45%。
Resampling and Data Augmentation For Equines’ Behaviour Classification Based on Wearable Sensor Accelerometer Data Using a Convolutional Neural Network
Monitoring horses’ behaviors through sensors can yield important information about their health and welfare. Sampling frequency majorly affects the classification accuracy in activity recognition and energy needs for the sensor. The aim of this study was to evaluate the effect of sampling rate reduction of a tri-axial accelerometer on the recognition accuracy by resampling a 50 Hz experimental dataset to four lower sampling rates (5 Hz, 10 Hz, 12.5 Hz and 25 Hz). Also, in this work we investigate the ‘reality gap’ that incorporates changes in the data that are primarily characterized as sensor rotations or measurement noise through various data augmentation techniques such as rotation and jittering. Finally, another factor influencing activity recognition are the subjects themselves and therefore the model is evaluated on different horse types. A deep learning-based approach for activity detection of equines is proposed to automatically classify 2238 manually annotated 2 s samples tri-axial accelerometer leg data data of seven different activities performed by six different subjects. The raw data are preprocessed and fed into a convolutional neural network (CNN) from which features are extracted automatically by using strong computing capabilities. Furthermore, the neural network was intentionally designed to minimize running time, enabling us to imagine the future use of the built model in embedded constrained devices. The complexity of these automatic learning techniques can be decreased while achieving high accuracies using ten-fold-cross validation using a computationally less intensive received signal length data (99.32% at 5 Hz vs 99.74% at 25 Hz). This indicates that sampling at 5 Hz with a 2 s window will offer advantages for activity surveillance thanks to decreased energy requirements, since validation time decreases 16-fold (784 microseconds at 50 Hz to 48 microseconds at 5 Hz). Moreover, in this work we show that rotating the training or validation signal with 10 degrees over the X, Y and Z-axis increases the generalization capabilities of our model (99.61 % vs 99.93%) while adding small amounts of noise (smaller than 0.3 standard deviation (STD)) does not decrease the classification accuracy under 99%. Finally, the performance and ability of the model to generalize is validated on data from unseen horses at the cost of only 4.1% and 2.45% reduction in accuracy when validated on a pony and a lame horse, respectively.