{"title":"基于多命令队列和步进并行迭代粒子滤波的CPU-GPU加速实时三维球跟踪","authors":"Yilin Hou, Xina Cheng, T. Ikenaga","doi":"10.1109/ICMIP.2017.59","DOIUrl":null,"url":null,"abstract":"3D ball tracking is a critical function in manyapplications such as game and players behavior analysis, andreal time implementation has become increasingly importantfor it can be used for live broadcast and TV contents. To reacha high accuracy, algorithms usually are time consuming due toa large set of calculations which is challenging to meet realtime demanding. This paper proposes multiple commandqueues, tactical threads allocation and stepped iterativeaddition to empower such a capacity on the CPU-GPUplatform. Multiple command queues achieves a parallelismbetween tasks in the algorithm. Secondly, the tactical threadsallocation helps mapping the algorithm into GPU andenhances synchronism between threads. And this paperproposes stepped iterative addition to achieve partialparallelism in a sequential operation. This work implements inan Intel Core i7-6700 GPU and AMD Radeon R9 FURY GPU.Tracking speed of our work increases 37.8 times from original431ms to 11.7ms while the success rate of the algorithm retainsover 99%. This result fully meets the requirement of 16.6msper frame for 60fps video real-time tracking.","PeriodicalId":227455,"journal":{"name":"2017 2nd International Conference on Multimedia and Image Processing (ICMIP)","volume":"19 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2017-12-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"9","resultStr":"{\"title\":\"Real-Time 3D Ball Tracking with CPU-GPU Acceleration Using Particle Filter with Multi-command Queues and Stepped Parallelism Iteration\",\"authors\":\"Yilin Hou, Xina Cheng, T. Ikenaga\",\"doi\":\"10.1109/ICMIP.2017.59\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"3D ball tracking is a critical function in manyapplications such as game and players behavior analysis, andreal time implementation has become increasingly importantfor it can be used for live broadcast and TV contents. To reacha high accuracy, algorithms usually are time consuming due toa large set of calculations which is challenging to meet realtime demanding. This paper proposes multiple commandqueues, tactical threads allocation and stepped iterativeaddition to empower such a capacity on the CPU-GPUplatform. Multiple command queues achieves a parallelismbetween tasks in the algorithm. Secondly, the tactical threadsallocation helps mapping the algorithm into GPU andenhances synchronism between threads. And this paperproposes stepped iterative addition to achieve partialparallelism in a sequential operation. This work implements inan Intel Core i7-6700 GPU and AMD Radeon R9 FURY GPU.Tracking speed of our work increases 37.8 times from original431ms to 11.7ms while the success rate of the algorithm retainsover 99%. This result fully meets the requirement of 16.6msper frame for 60fps video real-time tracking.\",\"PeriodicalId\":227455,\"journal\":{\"name\":\"2017 2nd International Conference on Multimedia and Image Processing (ICMIP)\",\"volume\":\"19 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2017-12-15\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"9\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2017 2nd International Conference on Multimedia and Image Processing (ICMIP)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ICMIP.2017.59\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2017 2nd International Conference on Multimedia and Image Processing (ICMIP)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICMIP.2017.59","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Real-Time 3D Ball Tracking with CPU-GPU Acceleration Using Particle Filter with Multi-command Queues and Stepped Parallelism Iteration
3D ball tracking is a critical function in manyapplications such as game and players behavior analysis, andreal time implementation has become increasingly importantfor it can be used for live broadcast and TV contents. To reacha high accuracy, algorithms usually are time consuming due toa large set of calculations which is challenging to meet realtime demanding. This paper proposes multiple commandqueues, tactical threads allocation and stepped iterativeaddition to empower such a capacity on the CPU-GPUplatform. Multiple command queues achieves a parallelismbetween tasks in the algorithm. Secondly, the tactical threadsallocation helps mapping the algorithm into GPU andenhances synchronism between threads. And this paperproposes stepped iterative addition to achieve partialparallelism in a sequential operation. This work implements inan Intel Core i7-6700 GPU and AMD Radeon R9 FURY GPU.Tracking speed of our work increases 37.8 times from original431ms to 11.7ms while the success rate of the algorithm retainsover 99%. This result fully meets the requirement of 16.6msper frame for 60fps video real-time tracking.