William T. Tarimo, Moustafa M. Sabra, Shonan Hendre
{"title":"Real-Time Deep Learning-Based Object Detection Framework","authors":"William T. Tarimo, Moustafa M. Sabra, Shonan Hendre","doi":"10.1109/SSCI47803.2020.9308493","DOIUrl":null,"url":null,"abstract":"Recently real-time detection, and recognition of an object of interest are becoming vital tasks in visual data processing and computer vision. Various models have been deployed to implement object detection and tracking in multiple fields. However, conventional classifiers are often faced with challenging tasks that visual frames come distorted due to overlapping, camera motion blur, changing subject appearances, and environmental variations. Models using OpenCV-based HAAR feature-based cascade classifiers, without integrating any error minimizing object detection algorithm, were unable to accurately detect an object and track it in a changing environment. Therefore, developing an embedded powerful framework for realtime object detection and recognition becomes more of a vital need for future implementation in various fields. This study presents a powerful technique for a real-time detector that utilizes integrated Deep Learning Neural Networks (DNN) for optimal computational accuracy. Deploying such a framework will ensure the flexibility and reliability of the detector by eliminating the sources of distortion previously mentioned. The model relies on integrating the ImageAI deep learning libraries and You Only Look Once (YOLO-v3) object detection method with a DarkNet53 architecture. The algorithm was trained using the TensorFlow framework to ensure accurate data processing. This paper targets one vital component of our long-term project of developing a multi-agent system, as the proposed model is to be implemented in autonomous agents for the detection of landmines, ocean debris, and wildlife beside environmental scanning missions. In this study the performance of the model has been assessed through detecting and collecting tennis balls as a preliminary test for real-world applications. The model was able to approach the desirable result of surpassing the accuracy of conventional detectors.","PeriodicalId":413489,"journal":{"name":"2020 IEEE Symposium Series on Computational Intelligence (SSCI)","volume":"27 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2020-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"4","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2020 IEEE Symposium Series on Computational Intelligence (SSCI)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/SSCI47803.2020.9308493","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 4
Abstract
Recently real-time detection, and recognition of an object of interest are becoming vital tasks in visual data processing and computer vision. Various models have been deployed to implement object detection and tracking in multiple fields. However, conventional classifiers are often faced with challenging tasks that visual frames come distorted due to overlapping, camera motion blur, changing subject appearances, and environmental variations. Models using OpenCV-based HAAR feature-based cascade classifiers, without integrating any error minimizing object detection algorithm, were unable to accurately detect an object and track it in a changing environment. Therefore, developing an embedded powerful framework for realtime object detection and recognition becomes more of a vital need for future implementation in various fields. This study presents a powerful technique for a real-time detector that utilizes integrated Deep Learning Neural Networks (DNN) for optimal computational accuracy. Deploying such a framework will ensure the flexibility and reliability of the detector by eliminating the sources of distortion previously mentioned. The model relies on integrating the ImageAI deep learning libraries and You Only Look Once (YOLO-v3) object detection method with a DarkNet53 architecture. The algorithm was trained using the TensorFlow framework to ensure accurate data processing. This paper targets one vital component of our long-term project of developing a multi-agent system, as the proposed model is to be implemented in autonomous agents for the detection of landmines, ocean debris, and wildlife beside environmental scanning missions. In this study the performance of the model has been assessed through detecting and collecting tennis balls as a preliminary test for real-world applications. The model was able to approach the desirable result of surpassing the accuracy of conventional detectors.
近年来,实时检测和识别感兴趣的目标已成为视觉数据处理和计算机视觉的重要任务。已经部署了各种模型来实现多个领域的目标检测和跟踪。然而,传统的分类器往往面临着视觉帧由于重叠、相机运动模糊、主体外观变化和环境变化而失真的挑战。使用基于opencv的HAAR特征级联分类器的模型,没有集成任何最小化错误的目标检测算法,无法在不断变化的环境中准确地检测和跟踪目标。因此,开发一种嵌入式的、功能强大的实时目标检测和识别框架成为未来各个领域实现的迫切需要。本研究提出了一种强大的实时检测器技术,该技术利用集成深度学习神经网络(DNN)实现最佳计算精度。部署这样一个框架将通过消除前面提到的失真源来确保检测器的灵活性和可靠性。该模型依赖于集成ImageAI深度学习库和带有DarkNet53架构的You Only Look Once (YOLO-v3)对象检测方法。该算法使用TensorFlow框架进行训练,以确保数据处理的准确性。本文的目标是我们开发多智能体系统的长期项目的一个重要组成部分,因为所提出的模型将在自主智能体中实施,用于检测地雷,海洋碎片和野生动物以及环境扫描任务。在这项研究中,通过检测和收集网球来评估模型的性能,作为实际应用的初步测试。该模型能够接近超越传统探测器精度的理想结果。