Paul R Hibbing, Samuel R LaMunion, Haileab Hilafu, Scott E Crouter
{"title":"Evaluating the Performance of Sensor-based Bout Detection Algorithms: The Transition Pairing Method.","authors":"Paul R Hibbing, Samuel R LaMunion, Haileab Hilafu, Scott E Crouter","doi":"10.1123/jmpb.2019-0039","DOIUrl":null,"url":null,"abstract":"<p><p>Bout detection algorithms are used to segment data from wearable sensors, but it is challenging to assess segmentation correctness.</p><p><strong>Purpose: </strong>To present and demonstrate the Transition Pairing Method (TPM), a new method for evaluating the performance of bout detection algorithms.</p><p><strong>Methods: </strong>The TPM compares predicted transitions to a criterion measure in terms of number and timing. A true positive is defined as a predicted transition that corresponds with one criterion transition in a mutually exclusive pair. The pairs are established using an extended Gale-Shapley algorithm, and the user specifies a maximum allowable within-pair time lag, above which pairs cannot be formed. Unpaired predictions and criteria are false positives and false negatives, respectively. The demonstration used raw acceleration data from 88 youth who wore ActiGraph GT9X monitors (right hip and non-dominant wrist) during simulated free-living. Youth Sojourn bout detection algorithms were applied (one for each attachment site), and the TPM was used to compare predicted bout transitions to the criterion measure (direct observation). Performance metrics were calculated for each participant, and hip-versus-wrist means were compared using paired T-tests (α = 0.05).</p><p><strong>Results: </strong>When the maximum allowable lag was 1-s, both algorithms had recall <20% (2.4% difference from one another, p<0.01) and precision <10% (1.4% difference from one another, p<0.001). That is, >80% of criterion transitions were undetected, and >90% of predicted transitions were false positives.</p><p><strong>Conclusion: </strong>The TPM improves on conventional analyses by providing specific information about bout detection in a standardized way that applies to any bout detection algorithm.</p>","PeriodicalId":73572,"journal":{"name":"Journal for the measurement of physical behaviour","volume":" ","pages":"219-227"},"PeriodicalIF":0.0000,"publicationDate":"2020-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8274497/pdf/nihms-1599163.pdf","citationCount":"5","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal for the measurement of physical behaviour","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1123/jmpb.2019-0039","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2020/5/20 0:00:00","PubModel":"Epub","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 5
Abstract
Bout detection algorithms are used to segment data from wearable sensors, but it is challenging to assess segmentation correctness.
Purpose: To present and demonstrate the Transition Pairing Method (TPM), a new method for evaluating the performance of bout detection algorithms.
Methods: The TPM compares predicted transitions to a criterion measure in terms of number and timing. A true positive is defined as a predicted transition that corresponds with one criterion transition in a mutually exclusive pair. The pairs are established using an extended Gale-Shapley algorithm, and the user specifies a maximum allowable within-pair time lag, above which pairs cannot be formed. Unpaired predictions and criteria are false positives and false negatives, respectively. The demonstration used raw acceleration data from 88 youth who wore ActiGraph GT9X monitors (right hip and non-dominant wrist) during simulated free-living. Youth Sojourn bout detection algorithms were applied (one for each attachment site), and the TPM was used to compare predicted bout transitions to the criterion measure (direct observation). Performance metrics were calculated for each participant, and hip-versus-wrist means were compared using paired T-tests (α = 0.05).
Results: When the maximum allowable lag was 1-s, both algorithms had recall <20% (2.4% difference from one another, p<0.01) and precision <10% (1.4% difference from one another, p<0.001). That is, >80% of criterion transitions were undetected, and >90% of predicted transitions were false positives.
Conclusion: The TPM improves on conventional analyses by providing specific information about bout detection in a standardized way that applies to any bout detection algorithm.