Xinyu Li, Yanyi Zhang, Jianyu Zhang, Shuhong Chen, Yue Gu, Richard A Farneth, Ivan Marsic, Randall S Burd
{"title":"海报摘要:多传感器三维活动定位。","authors":"Xinyu Li, Yanyi Zhang, Jianyu Zhang, Shuhong Chen, Yue Gu, Richard A Farneth, Ivan Marsic, Randall S Burd","doi":"10.1145/3055031.3055057","DOIUrl":null,"url":null,"abstract":"<p><p>We present a deep learning framework for fast 3D activity localization and tracking in a dynamic and crowded real world setting. Our training approach reverses the traditional activity localization approach, which first estimates the possible location of activities and then predicts their occurrence. Instead, we first trained a deep convolutional neural network for activity recognition using depth video and RFID data as input, and then used the activation maps of the network to locate the recognized activity in the 3D space. Our system achieved around 20cm average localization error (in a 4<i>m</i> × 5<i>m</i> room) which is comparable to Kinect's body skeleton tracking error (10-20cm), but our system tracks activities instead of Kinect's location of people.</p>","PeriodicalId":90559,"journal":{"name":"IPSN : [proceedings]. IPSN (Conference)","volume":"2017 ","pages":"297-298"},"PeriodicalIF":0.0000,"publicationDate":"2017-04-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1145/3055031.3055057","citationCount":"0","resultStr":"{\"title\":\"Poster Abstract: 3D Activity Localization With Multiple Sensors.\",\"authors\":\"Xinyu Li, Yanyi Zhang, Jianyu Zhang, Shuhong Chen, Yue Gu, Richard A Farneth, Ivan Marsic, Randall S Burd\",\"doi\":\"10.1145/3055031.3055057\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p><p>We present a deep learning framework for fast 3D activity localization and tracking in a dynamic and crowded real world setting. Our training approach reverses the traditional activity localization approach, which first estimates the possible location of activities and then predicts their occurrence. Instead, we first trained a deep convolutional neural network for activity recognition using depth video and RFID data as input, and then used the activation maps of the network to locate the recognized activity in the 3D space. Our system achieved around 20cm average localization error (in a 4<i>m</i> × 5<i>m</i> room) which is comparable to Kinect's body skeleton tracking error (10-20cm), but our system tracks activities instead of Kinect's location of people.</p>\",\"PeriodicalId\":90559,\"journal\":{\"name\":\"IPSN : [proceedings]. IPSN (Conference)\",\"volume\":\"2017 \",\"pages\":\"297-298\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2017-04-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://sci-hub-pdf.com/10.1145/3055031.3055057\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"IPSN : [proceedings]. IPSN (Conference)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1145/3055031.3055057\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"IPSN : [proceedings]. IPSN (Conference)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/3055031.3055057","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Poster Abstract: 3D Activity Localization With Multiple Sensors.
We present a deep learning framework for fast 3D activity localization and tracking in a dynamic and crowded real world setting. Our training approach reverses the traditional activity localization approach, which first estimates the possible location of activities and then predicts their occurrence. Instead, we first trained a deep convolutional neural network for activity recognition using depth video and RFID data as input, and then used the activation maps of the network to locate the recognized activity in the 3D space. Our system achieved around 20cm average localization error (in a 4m × 5m room) which is comparable to Kinect's body skeleton tracking error (10-20cm), but our system tracks activities instead of Kinect's location of people.