Oshana Iddi Dissanayake , Sarah E. McPherson , Joseph Allyndrée , Emer Kennedy , Pádraig Cunningham , Lucile Riaboff
{"title":"ActBeCalf: Accelerometer-based multivariate time-series dataset for calf behavior classification","authors":"Oshana Iddi Dissanayake , Sarah E. McPherson , Joseph Allyndrée , Emer Kennedy , Pádraig Cunningham , Lucile Riaboff","doi":"10.1016/j.dib.2025.111462","DOIUrl":null,"url":null,"abstract":"<div><div>Getting new insights on pre-weaned calf behavioral adaptation to routine challenges (transport, group relocation, etc.) and diseases (respiratory diseases, diarrhea, etc.) is a promising way to improve calf welfare in dairy farms. A classic approach to automatically monitoring behavior is to equip animals with accelerometers attached to neck collars and to develop machine learning models from accelerometer time series. However, this accelerometer time series data must be aligned with labels describing behaviors (gold standard) to be used for model development. Obtaining these labels requires annotating behaviors from direct observations or videos, which is highly time-consuming and labor-intensive. In addition, accurate alignment between accelerometer data and behaviors is always challenging due to time synchronization issues. However, the procedure used for the timestamp alignment has not been described in other studies. We introduce <strong>ActBeCalf</strong>: <em>Accelerometer Time-Series Dataset for Calf Behavior classification</em> to address this challenge. Thirty pre-weaned dairy calves (Holstein Friesian and Jersey) housed in 4 group pens at Teagasc Moorepark Research Farm (Fermoy, Co. Cork, Ireland) were equipped with a 3D-accelerometer sensor (sampling rate: 25 Hz) attached to a neck-collar from one week of birth for 13 weeks. The calves were simultaneously filmed with a high-up camera in each pen. Every 15 days, accelerometers were removed from the collars to recharge the battery, collect the data, and reattached to the neck collar again. At the end of the trial, behaviors were manually annotated from the videos using the Behavioral Observation Research Interactive Software (BORIS) by three observers (Cohen's Kappa = 0.72 ± 0.01) using an ethogram with 23 pre-weaned dairy calves' behaviors. Videos were carefully selected to annotate calves’ behaviors. Observations were synchronized with the accelerometer timestamps using an external clock and aligned to the corresponding accelerometer time series. Thereby, ActBeCalf contains 27.4 h of accelerometer data from 30 pre-weaned calves (age 23.7 ± 10.7 days) aligned adequately with calf behaviors. The dataset includes the main behaviors of the calf time-budget, like lying, standing, walking, and running, as well as less prominent behaviors, such as sniffing, scratching, social interaction, and grooming. The reliability of ActBeCalf was validated by developing two machine learning models designed to classify behaviors into two and four classes, respectively. Good predictive performance was achieved for both models (balanced accuracy: 92 % and 84 %, respectively), thereby confirming ActBeCalf's reliability for model development in the field. The code utilized for the classification is publicly available in the dataset repository. ActBeCalf is a comprehensive, ready-to-use dataset, ideal for advancing research in two key areas: classifying pre-weaned calf behavior to support animal welfare initiatives and developing innovative time-series classification models in machine learning.</div></div>","PeriodicalId":10973,"journal":{"name":"Data in Brief","volume":"60 ","pages":"Article 111462"},"PeriodicalIF":1.0000,"publicationDate":"2025-03-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Data in Brief","FirstCategoryId":"1085","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S2352340925001945","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"MULTIDISCIPLINARY SCIENCES","Score":null,"Total":0}
引用次数: 0
Abstract
Getting new insights on pre-weaned calf behavioral adaptation to routine challenges (transport, group relocation, etc.) and diseases (respiratory diseases, diarrhea, etc.) is a promising way to improve calf welfare in dairy farms. A classic approach to automatically monitoring behavior is to equip animals with accelerometers attached to neck collars and to develop machine learning models from accelerometer time series. However, this accelerometer time series data must be aligned with labels describing behaviors (gold standard) to be used for model development. Obtaining these labels requires annotating behaviors from direct observations or videos, which is highly time-consuming and labor-intensive. In addition, accurate alignment between accelerometer data and behaviors is always challenging due to time synchronization issues. However, the procedure used for the timestamp alignment has not been described in other studies. We introduce ActBeCalf: Accelerometer Time-Series Dataset for Calf Behavior classification to address this challenge. Thirty pre-weaned dairy calves (Holstein Friesian and Jersey) housed in 4 group pens at Teagasc Moorepark Research Farm (Fermoy, Co. Cork, Ireland) were equipped with a 3D-accelerometer sensor (sampling rate: 25 Hz) attached to a neck-collar from one week of birth for 13 weeks. The calves were simultaneously filmed with a high-up camera in each pen. Every 15 days, accelerometers were removed from the collars to recharge the battery, collect the data, and reattached to the neck collar again. At the end of the trial, behaviors were manually annotated from the videos using the Behavioral Observation Research Interactive Software (BORIS) by three observers (Cohen's Kappa = 0.72 ± 0.01) using an ethogram with 23 pre-weaned dairy calves' behaviors. Videos were carefully selected to annotate calves’ behaviors. Observations were synchronized with the accelerometer timestamps using an external clock and aligned to the corresponding accelerometer time series. Thereby, ActBeCalf contains 27.4 h of accelerometer data from 30 pre-weaned calves (age 23.7 ± 10.7 days) aligned adequately with calf behaviors. The dataset includes the main behaviors of the calf time-budget, like lying, standing, walking, and running, as well as less prominent behaviors, such as sniffing, scratching, social interaction, and grooming. The reliability of ActBeCalf was validated by developing two machine learning models designed to classify behaviors into two and four classes, respectively. Good predictive performance was achieved for both models (balanced accuracy: 92 % and 84 %, respectively), thereby confirming ActBeCalf's reliability for model development in the field. The code utilized for the classification is publicly available in the dataset repository. ActBeCalf is a comprehensive, ready-to-use dataset, ideal for advancing research in two key areas: classifying pre-weaned calf behavior to support animal welfare initiatives and developing innovative time-series classification models in machine learning.
期刊介绍:
Data in Brief provides a way for researchers to easily share and reuse each other''s datasets by publishing data articles that: -Thoroughly describe your data, facilitating reproducibility. -Make your data, which is often buried in supplementary material, easier to find. -Increase traffic towards associated research articles and data, leading to more citations. -Open up doors for new collaborations. Because you never know what data will be useful to someone else, Data in Brief welcomes submissions that describe data from all research areas.