{"title":"Cross-Granularity Fusion Network for Fine-Grained Image Classification","authors":"Wenjin Pang, Wei Song","doi":"10.1109/AINIT59027.2023.10212436","DOIUrl":"https://doi.org/10.1109/AINIT59027.2023.10212436","url":null,"abstract":"Fine-grained image classification (FGIC) aims to identify subtle visual differences among subcategories, which is challenging due to the small inter-class variances. Existing methods recognize subcategories mainly by locating discriminative parts which exists in the regions with high responses in deep feature maps. However, the regions with high responses in deep feature maps correspond to large receptive fields in the input image, leading to the result that subtle visual differences among subcategories cannot be captured precisely. In this paper we propose a novel Cross-Granularity Fusion Network (CGFN), which excavates subtle yet discriminative granularity features within each part and captures potential interactions among granularity features to build powerful part feature representations. The CGFN consists of two modules: First, the Multi-Granularity Proposal (MGP) module locates diverse and discriminative parts and focuses context-complementary granularities across different hierarchies within each part. Second, a Cross-Granularity Fusion (CGF) module is developed by fusing granularity features to acquire robust part features for the final classification. We conduct a series of experiments on publicly available datasets i.e., CUB-200-2011, Stanford Cars and FGVC-Aircraft datasets and experimental results demonstrate that the CGFN achieves state-of-the-art performance.","PeriodicalId":276778,"journal":{"name":"2023 4th International Seminar on Artificial Intelligence, Networking and Information Technology (AINIT)","volume":"3 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-06-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"130821699","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Hongxia Xu, Liang Jiang, Zhongwen Cao, Xiangwen Bao
{"title":"Design study of a rotorless Unmanned Aerial Vehicle","authors":"Hongxia Xu, Liang Jiang, Zhongwen Cao, Xiangwen Bao","doi":"10.1109/AINIT59027.2023.10210723","DOIUrl":"https://doi.org/10.1109/AINIT59027.2023.10210723","url":null,"abstract":"The modeling software Fusion360 was used to design a compact and symmetric rotorless Unmanned Aerial Vehicle (UAV) that aims to address the safety hazards and high noise levels associated with rotor blades rotating at high speeds. The rotorless UAV utilizes four bladeless fans instead of external paddles and places the motors needed to provide power inside the fuselage. The locations of the rotorless UAV's air vents, air inlets, camera, and sled mount structures are analyzed, while the operating parameters of the main structural energy supply module (battery), power module, and control module are studied. The safety and noise problems caused by the rotor blades during the flight of the rotorless UAV and the damage of the motor easily caused by external factors are explored and solved.","PeriodicalId":276778,"journal":{"name":"2023 4th International Seminar on Artificial Intelligence, Networking and Information Technology (AINIT)","volume":"15 2","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-06-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"114076021","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Datian Tao, Xuedong Liu, Tingchao Shi, Yi Zhong, Yi Han
{"title":"Research on GIS-Based Open Simulation System for Visualization of Complex Underwater Environment","authors":"Datian Tao, Xuedong Liu, Tingchao Shi, Yi Zhong, Yi Han","doi":"10.1109/AINIT59027.2023.10212727","DOIUrl":"https://doi.org/10.1109/AINIT59027.2023.10212727","url":null,"abstract":"With the rapid development of science and technology, marine resources are receiving increasing attention. However, due to the special nature of the marine environment, it is difficult to achieve autonomous marine protection as well as efficient management by human resources alone, which seriously restricts the exploration and development of marine resources. To address this need, the paper proposes an open simulation system for visualizing complex underwater environments based on Geographic Information System (GIS). The system firstly realizes the 3D visualization of underwater topography; secondly, the rendering of hydrographic data is realized in the front-end interface. The results show that the simulation system for the visual representation of complex underwater environment is more suitable for realistic scenarios, and in the process of obtaining effective marine data, it can effectively avoid the influence of disturbing factors such as turbid water and insufficient light.","PeriodicalId":276778,"journal":{"name":"2023 4th International Seminar on Artificial Intelligence, Networking and Information Technology (AINIT)","volume":"46 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-06-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"123375001","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Double encryption of digital grid edge cloud data in complex dynamic networks","authors":"Xiang Huang, Yuxu Chen, Liming Wan, Xichi Zeng, Guopeng Liang","doi":"10.1109/AINIT59027.2023.10212441","DOIUrl":"https://doi.org/10.1109/AINIT59027.2023.10212441","url":null,"abstract":"The current conventional data encryption algorithms for digital grid edge cloud data mainly realize data encryption by generating dislocation matrix, which leads to poor encryption effect due to the weak access control performance of network data. In this regard, a double encryption algorithm for digital grid edge cloud data in complex dynamic networks is proposed. By constructing a complex dynamic network model, and constructing a privacy database according to the model characteristics. The binary tree algorithm is used to access control the digital grid edge cloud data, and the double encryption of the edge cloud data is realized by dynamic keys. In the experiments, the encryption performance of the designed encryption method is tested. The final results can prove that after the proposed method is used to encrypt the edge cloud data, the information integrity of the database is high and has a more excellent data encryption performance.","PeriodicalId":276778,"journal":{"name":"2023 4th International Seminar on Artificial Intelligence, Networking and Information Technology (AINIT)","volume":"87 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-06-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"122589955","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Kai Zhao, Xiaolin Song, Y. Xu, Ruiheng Mao, Sheng Fan, Jia-bo Zhang
{"title":"Autonomous Path Planning Logistics Sorting Robot Experimental Teaching Platform Based on Aruco Marker","authors":"Kai Zhao, Xiaolin Song, Y. Xu, Ruiheng Mao, Sheng Fan, Jia-bo Zhang","doi":"10.1109/AINIT59027.2023.10212971","DOIUrl":"https://doi.org/10.1109/AINIT59027.2023.10212971","url":null,"abstract":"In order to optimize the teaching content of “Engineering Training II” for Automation Specialty, the curriculum teaching system has been reshaped with project-based teaching as the center based on the characteristics of the discipline. We have designed a logistics robot experimental teaching platform that combines intelligent vehicles and robotic arms based on ARUCO Marker positioning. The system uses STM32 as the main control chip, Quanzhi H6 as the computing power platform, L298N and DC deceleration motor as running drivers, and the Yuejiang robotic arm as the cargo grabbing device. At the same time, combining ARUCO Marker recognition, Dijkstra optimal path planning, and incremental PID as intelligent algorithms, it achieves functions such as navigation control, cargo handling, and fixed point stacking for the logistics sorting robot system. The experimental results show that the system has fast recognition speed, high positioning accuracy, low price, and is easy to expand. It can effectively improve students' ability to comprehensively apply basic theoretical knowledge to solve practical problems and enhance their engineering literacy.","PeriodicalId":276778,"journal":{"name":"2023 4th International Seminar on Artificial Intelligence, Networking and Information Technology (AINIT)","volume":"5 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-06-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"124865994","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Balanced Overall and Local: Improving Image Captioning with Enhanced Transformer Model","authors":"Haotian Xian, Baoyi Guo, Youyu Zhou","doi":"10.1109/AINIT59027.2023.10212804","DOIUrl":"https://doi.org/10.1109/AINIT59027.2023.10212804","url":null,"abstract":"This article proposes a Transformer-based image captioning model using computer vision and natural language processing techniques. The model is based on Multi-Featured Attention Module and Grid-Augmented Module and outperforms the original Transformer model on all evaluation metrics. Specifically, with a beam size of 7, the model achieves a BLEU-4 score of 0.409 and a CIDEr score of 1.008.","PeriodicalId":276778,"journal":{"name":"2023 4th International Seminar on Artificial Intelligence, Networking and Information Technology (AINIT)","volume":"9 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-06-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"125144077","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Research on positioning and navigation of medical robot based on RGB-D visual SLAM","authors":"Pei-fa Jia","doi":"10.1109/AINIT59027.2023.10212975","DOIUrl":"https://doi.org/10.1109/AINIT59027.2023.10212975","url":null,"abstract":"With the increasing demand for medical service robots and the development of RGB-D-based visual SLAM technology, there is a growing interest in research on medical service robots that can navigate autonomously. This paper uses mathematical language to introduce the camera imaging principle, image feature matching algorithms, and pose estimation algorithms in visual SLAM in detail. We have selected an experimental scene in a targeted manner and have combined the improved ORB feature points and feature mismatch elimination algorithm (RANSAC) to enhance the original algorithm. We analyze and demonstrate the advantages and disadvantages of the algorithm before and after the improvement, and provide insights into future developments in autonomous navigation for medical service robots.","PeriodicalId":276778,"journal":{"name":"2023 4th International Seminar on Artificial Intelligence, Networking and Information Technology (AINIT)","volume":"71 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-06-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"127326207","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Distracted Driver Detection with MobileVGG Network","authors":"Yueying Zhu","doi":"10.1109/AINIT59027.2023.10212841","DOIUrl":"https://doi.org/10.1109/AINIT59027.2023.10212841","url":null,"abstract":"The escalation of road traffic fatalities in recent years has highlighted the issue of distracted driving as a significant problem that warrants attention. This paper presents a CNN-based approach for identifying and categorizing distracted driving behavior, catering to the requirements of advanced driver assistance systems. The proposed algorithm demonstrates an optimal balance between accuracy and efficiency, with respect to memory consumption and processing speed. The architecture employed, termed mobile VGG, is founded on the principles of deeply separable convolution. The outcome of de-duplicating the American University in Cairo's (AUC) dataset for distracted driving detection reveals that the proposed mobile VGG architecture has just 2.2M parameters and achieves 95.50% accuracy on the AUC dataset with 38% less computing time than alternative methods.","PeriodicalId":276778,"journal":{"name":"2023 4th International Seminar on Artificial Intelligence, Networking and Information Technology (AINIT)","volume":"2010 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-06-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"127350808","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Research on reinforced corrosion monitoring system based on Multi-sensor data fusion and wireless sensor network","authors":"A. Yu, Ziyang Shang, Hongbing Sun, Hao Kuang","doi":"10.1109/AINIT59027.2023.10212580","DOIUrl":"https://doi.org/10.1109/AINIT59027.2023.10212580","url":null,"abstract":"This paper applies multi-sensor data fusion technology and wireless sensor network (WSN) to monitor and predict steel corrosion parameters in real-time. To overcome the difficulties and low accuracy in identifying rebar corrosion, this study selected five parameters for data fusion, including chloride ion concentration, pH value, rebar corrosion potential, and internal temperature and humidity of concrete. A three-level data fusion structure is designed with corresponding fusion algorithms chosen for each level. The primary fusion is completed through data cleaning and median average filtering methods, followed by using adaptive weighting algorithms to fuse sensor data of the same type to obtain parameter characteristics of the region. Finally, an improved PSO-BP neural network fuses the data from the previous level of fusion to achieve prediction of steel corrosion. Experimental results show that the steel corrosion monitoring system based on multi-sensor data fusion technology and WSN has higher reliability and accuracy compared to traditional corrosion monitoring methods.","PeriodicalId":276778,"journal":{"name":"2023 4th International Seminar on Artificial Intelligence, Networking and Information Technology (AINIT)","volume":"37 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-06-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"114668347","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Weisen Deng, Jizhuang Hui, Kai Ding, Haixin Zhang, Shaowei Zhi
{"title":"Research on the Robustness of Degree-Fitness-Distance Evolutionary Networks Based on Response Surface Methodology","authors":"Weisen Deng, Jizhuang Hui, Kai Ding, Haixin Zhang, Shaowei Zhi","doi":"10.1109/AINIT59027.2023.10212689","DOIUrl":"https://doi.org/10.1109/AINIT59027.2023.10212689","url":null,"abstract":"This paper introduces a Degree-Fitness-Distance (DFD) evolutionary network, utilizing Response Surface Methodology to investigate the impact of degree strength, fitness strength, distance strength, and their interactions on the robustness of the DFD network. The regression equation was determined, followed by a variance analysis of different factors affecting the target response. The results show that the influence of different factors on the size of the largest connected component and the overall efficiency of the network are in the following order: degree strength > fitness strength > distance strength. When degree strength is 1, fitness strength is 2, and distance strength is 3, the size of the largest connected component of the network and the overall efficiency reach their peak values, respectively at 56.57% and 10.15%. Multi-objective optimization was performed on the DFD network; when degree strength is 1, fitness strength is 1, and distance strength is 3, the predicted size of the largest connected component is 58.39%, and the overall efficiency is 10.37%. These figures deviate by approximately 5% from the actual values, which demonstrates that the predictive model possesses high accuracy and reliability.","PeriodicalId":276778,"journal":{"name":"2023 4th International Seminar on Artificial Intelligence, Networking and Information Technology (AINIT)","volume":"170 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-06-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"114396576","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}