Kiwon Sohn, Insup Choi, Seongwan Kim, Jaeho Lee, Jungyong Lee, Joonghang Kim
{"title":"A Strategy to Maximize the Utilization of AI Neural Processors on an Automotive Computing Platform","authors":"Kiwon Sohn, Insup Choi, Seongwan Kim, Jaeho Lee, Jungyong Lee, Joonghang Kim","doi":"10.1109/ICCE59016.2024.10444298","DOIUrl":null,"url":null,"abstract":"Advancements in AI are transforming the automotive industry, creating opportunities for AI-powered software and hardware. AI-driven features in automobiles are increasingly embraced due to their potential to significantly improve the driving experience. High-performance computing, particularly with NPUs, becomes crucial for executing the AI features. To maximize the efficiency and utilization of NPUs, DAIMO-NPU optimizes the inference sequence of the DNN models that form the backbones of the AI features. Not only does it organize and schedule the model inference tasks but also supports the tasks to be executed on heterogeneous NPU settings. Three main components are involved in the implementation of DAIMO-NPU. The schedule-table generator is responsible for creating a detailed plan for the model inference tasks, which is to be updated whenever an AI feature is added, removed, or upgraded. The onboard operator reads the schedule table and carries out the tasks accordingly. And, by dividing models into smaller segments, while not mandatory, the schedule table can be further optimized. In the subsequent developments, the integration of additional NPU hardware properties into DAIMO-NPU will be pursued.","PeriodicalId":518694,"journal":{"name":"2024 IEEE International Conference on Consumer Electronics (ICCE)","volume":"22 6","pages":"1-4"},"PeriodicalIF":0.0000,"publicationDate":"2024-01-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2024 IEEE International Conference on Consumer Electronics (ICCE)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICCE59016.2024.10444298","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
Advancements in AI are transforming the automotive industry, creating opportunities for AI-powered software and hardware. AI-driven features in automobiles are increasingly embraced due to their potential to significantly improve the driving experience. High-performance computing, particularly with NPUs, becomes crucial for executing the AI features. To maximize the efficiency and utilization of NPUs, DAIMO-NPU optimizes the inference sequence of the DNN models that form the backbones of the AI features. Not only does it organize and schedule the model inference tasks but also supports the tasks to be executed on heterogeneous NPU settings. Three main components are involved in the implementation of DAIMO-NPU. The schedule-table generator is responsible for creating a detailed plan for the model inference tasks, which is to be updated whenever an AI feature is added, removed, or upgraded. The onboard operator reads the schedule table and carries out the tasks accordingly. And, by dividing models into smaller segments, while not mandatory, the schedule table can be further optimized. In the subsequent developments, the integration of additional NPU hardware properties into DAIMO-NPU will be pursued.