{"title":"The Impact of Learning Rate Decay and Periodical Learning Rate Restart on Artificial Neural Network","authors":"Yimin Ding","doi":"10.1145/3460268.3460270","DOIUrl":"https://doi.org/10.1145/3460268.3460270","url":null,"abstract":"There is no denying that learning rate is one of the most important hyper-parameter for model training. In this paper, two typical strategies, namely learning rate decay and periodical learning rate restart are tested in artificial neural networks (ANN) and compared with the fixed learning rate. Experiments demonstrate that learning rate adjustment strategies surpass fixed learning rate in model training, including fast convergence, high validation accuracy and low training loss. Besides, periodical learning rate restart strategy tends to take fewer epochs than learning rate decay to get the same accuracy. Thus, increasing the learning rate appropriately will better fit the model and achieve excellent performance.","PeriodicalId":215905,"journal":{"name":"Proceedings of the 2021 2nd International Conference on Artificial Intelligence in Electronics Engineering","volume":"12 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2021-01-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"125337116","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Learning-based Point Cloud Registration: A Short Review and Evaluation","authors":"Weixuan Tang, Danping Zou, Ping Li","doi":"10.1145/3460268.3460273","DOIUrl":"https://doi.org/10.1145/3460268.3460273","url":null,"abstract":"∗ Point cloud registration is an important task for range scan align-ment, pose estimation, and localization. Traditional point cloud registration methods rely on hand-craft descriptors, which are sometimes not so descriptive and make the pose solver easy to fail because of false matchings. Recently, many researchers seek to improve the traditional method by deep learning-based approach. In this paper, we summarize the main pipeline of point cloud registration in traditional and learning-based approaches. Then we review some of the recent start-of-art methods, mainly in the end-to-end learning approach. We also review the criteria used to evaluate the registration performance and give complete testing results, some of which are not provided by those papers.","PeriodicalId":215905,"journal":{"name":"Proceedings of the 2021 2nd International Conference on Artificial Intelligence in Electronics Engineering","volume":"79 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2021-01-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"133540295","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Research on the application of artificial intelligence technology in the development of sports teaching video compression algorithm","authors":"Yibo Dai","doi":"10.1145/3460268.3460271","DOIUrl":"https://doi.org/10.1145/3460268.3460271","url":null,"abstract":"Abstract: in order to improve the performance of traditional sports teaching video compression algorithm, this paper selects spatial KL transform technology and GPCA segmentation technology as research tools, and proposes a new video compression algorithm. The diamond search method is introduced in the algorithm, and the resolution of video frame is completed efficiently. After XL transformation, the minimum error is determined. The test results show that the proposed algorithm not only has high SNR and more compressed frames, but also has high definition and compression efficiency.","PeriodicalId":215905,"journal":{"name":"Proceedings of the 2021 2nd International Conference on Artificial Intelligence in Electronics Engineering","volume":"36 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2021-01-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"127509174","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"DQ-DPS Data Partition Strategy Based on Distributed Machine Learning","authors":"Jiaming Wu, Zunhao Liu, Bowen Yang","doi":"10.1145/3460268.3460272","DOIUrl":"https://doi.org/10.1145/3460268.3460272","url":null,"abstract":"With the expansion of the data scale, machine learning develops from centralized to distributed. Generally, distributed machine learning uses parameter server architecture to train in synchronous mode. At this time, data samples are statically and symmetrically allocated to each computing node according to the batch size. Each worker trains synchronously and iterates until the model converges. However, due to the different number of resources at each compute node in a mixed-load scenario, the traditional data partition strategy is usually to statically configure batch size parameters or require manual setting of batch size parameters, which makes the computational efficiency of distributed machine learning model training operations inefficient, and the data adjustment for each node will have an impact on the accuracy of the model. To solve this problem, on the premise of ensuring the accuracy of the distributed machine learning model training task, this paper proposes an optimal configuration scheme for a batch size of distributed machine learning model training task data: a data partition strategy based on distributed machine learning (DQ-DPS). DQ-DPS solves the problem of low computational efficiency caused by static data partitioning, improves the computational efficiency of distributed machine learning tasks, and ensures the accuracy of distributed machine learning training model. Through a large number of experiments, we have proved the effectiveness of DQ-DPS. Compared with the traditional data partition strategy, DQ-DPS improves the computing efficiency of each training round by 38%.","PeriodicalId":215905,"journal":{"name":"Proceedings of the 2021 2nd International Conference on Artificial Intelligence in Electronics Engineering","volume":"35 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2021-01-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"122640222","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"An LSTM Approach for Predicting the Short-time Passenger Flow of Urban Bus","authors":"Yingying Xu, Kezhong Jin","doi":"10.1145/3460268.3460274","DOIUrl":"https://doi.org/10.1145/3460268.3460274","url":null,"abstract":"The advantages of LSTM as a deep learning neural network algorithm in time series prediction of passenger flow have gradually emerged. In addition to general passenger flow data that can be used for prediction, other context information can improve prediction performance. This paper proposed an LSTM approach for predicting the short-time passenger flow of urban bus, by using historical passenger data and other context information, e.g. weather type, holiday information and day of the week. The key parameters and structure of the long short-term memory (LSTM) neural network are deeply optimized. Adequate experiments with are conducted on the practical data of working day. The experimental result shows that the prediction of proposed LSTM outperforms the support vector regression (SVR) and k-nearest neighbor (KNN) algorithm. And the importing of weather data can improved performance of LSTM in the root mean squared error (RMSE) and the mean absolute percent error (MAPE).","PeriodicalId":215905,"journal":{"name":"Proceedings of the 2021 2nd International Conference on Artificial Intelligence in Electronics Engineering","volume":"75 4 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2021-01-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"131594458","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"An Exploration for Generative Adversarial Networks Via Adding A Screening Model","authors":"Kangle Sun","doi":"10.1145/3460268.3460269","DOIUrl":"https://doi.org/10.1145/3460268.3460269","url":null,"abstract":"As an emerging deep learning model, generative adversarial networks has enough creativity and potential in the application and advanced studies. However, many problems should be tackled in its training process. Based on the exist studies of generative adversarial networks and relevent ideas of games theories, this article adds a new screening model for the framework of generative adversarial networks to solve vanishing gradient in the training process, which contains the function of filter and measurer. This model does not interfere training process, only find and delete discriminators which might cause vanishing gradient, and output a value to represent the progress of training process.","PeriodicalId":215905,"journal":{"name":"Proceedings of the 2021 2nd International Conference on Artificial Intelligence in Electronics Engineering","volume":"2015 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2021-01-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"121486288","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Recursive relocation of lost intelligent agent by Aitkin acceleration method and extended Kalman filtering algorithm","authors":"Lei Sun, Xiao Yang, Guizhen Wang","doi":"10.1145/3460268.3460282","DOIUrl":"https://doi.org/10.1145/3460268.3460282","url":null,"abstract":"The state space model of signal and noise is used to find the missing position of multiple mobile intelligent agents. The estimation value of other mobile agents and the observed value at the present time are used to update the estimation of the state variable, and the estimation value of the missing position of mobile agents is obtained. In order to improve the convergence speed of the iterative method, the extended Kalman filtering algorithm and the aitkin accelerating convergence method are used to recurse the location of the mobile agent with all the lost positions.","PeriodicalId":215905,"journal":{"name":"Proceedings of the 2021 2nd International Conference on Artificial Intelligence in Electronics Engineering","volume":"15 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2021-01-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"129645657","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"CPDD: A Cascaded-Parallel Defect Detector with Application to Intelligent Inspection in Substation","authors":"Han Sun, Jing Wang, Kunlun Li, Qingwei Zhang","doi":"10.1145/3460268.3460277","DOIUrl":"https://doi.org/10.1145/3460268.3460277","url":null,"abstract":"The intelligent inspection is a detection problem that aims to recognize abnormalities in substations. Defects acquired by various devices with small size, truncation, and similar appearance are easily confused, which biases the evaluation metrics. How to correctly explore the relationships between equipment and defects, and fully utilize results from different models is critical for this task. In this work, we propose a novel solution to these problems based on the cascaded-parallel defect detection (CPDD) algorithm. Specifically, it consists of two key components: (1) The cascaded model aims to mine the detailed relationships and filter out the illogical bounding boxes. This way can reduce the miss detection rate. (2) The parallel model is to fuse results from the mentioned cascaded model. It can utilize the information from these two-stage models and promote the detectable rate. Extensive empirical results on the dataset, acquired by our designed inspection system in different voltage-level substations, demonstrate the superiority of our proposed method. It can achieve state-of-the-art performance.","PeriodicalId":215905,"journal":{"name":"Proceedings of the 2021 2nd International Conference on Artificial Intelligence in Electronics Engineering","volume":"386 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2021-01-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"127465812","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"A Novel UAV Aerial Vehicle Detection Method Based on Attention Mechanism and Multi-scale Feature Cross Fusion","authors":"Zhigang Hou, Jin Yan, Bo Yang, Zhiming Ding","doi":"10.1145/3460268.3460276","DOIUrl":"https://doi.org/10.1145/3460268.3460276","url":null,"abstract":"With the rapid development of artificial intelligence science, more and more researchers try to use deep learning to train neural networks and have achieved great success in object detection. Vehicle detection based on UAV image is a special field of object detection. Due to the low resolution of the vehicle object, complex background, and less image information, it is challenging to extract robust visual and spatial features from the depth network and accurately locate the object in complex scenes. In this paper, combining the characteristics of vehicles in aerial images, we design a novel feature pyramid network called channel-spatial attention fused feature pyramid network (CSF-FPN) with Faster R-CNN as the basic framework. In CSF-FPN, a hybrid attention mechanism and feature cross-fusion module are introduced, so that feature maps can be generated with enhanced spatial and channel interdependence to extract richer semantic information. After our CSF-FPN is integrated into the Faster R-CNN network, the detection performance of small objects is greatly improved. The experimental results based on the VEDIA Dataset showed that the proposed framework could effectively detect the vehicle in large scene azimuth. Compared with the existing advanced methods, mAP and F1-score are improved.","PeriodicalId":215905,"journal":{"name":"Proceedings of the 2021 2nd International Conference on Artificial Intelligence in Electronics Engineering","volume":"38 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2021-01-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"115586130","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"The Intelligent Agent NLP-based Customer Service System","authors":"C. Huang","doi":"10.1145/3460268.3460275","DOIUrl":"https://doi.org/10.1145/3460268.3460275","url":null,"abstract":"In Nowadays, communication has been more and more important in our lives. It is difficult for us to complete many things without it. With the support of the developing technology, communication is able to become faster and easier. The human-to-machine communication is a creative application used in field of research and industry. To improve the interaction between human and machine, a communication system is specially designed. The technology of natural language processing is implemented in the system to handle the understanding and generation of the chatting language. For higher efficiency, the system is enhanced by designing in the multi-agent system. Which let agents deal with the detailed tasks in the process of NLP by interacting and integrating them together. The system is based on sending and receiving so that it is able to communicate asking question and responding answer. Provide a textbook to the Chatbot, it is capable to understanding the content of book. And then serve as a teaching assistant to help student solve their problems by answering their questions. In this thesis, we will reveal the design and implementation of different parts in the communication system. Which consists of natural language processing system, multi-agent system, user interface (Model-View-Controller Frame), and knowledge base. System is implemented by Java programme language.","PeriodicalId":215905,"journal":{"name":"Proceedings of the 2021 2nd International Conference on Artificial Intelligence in Electronics Engineering","volume":"40 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2021-01-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"123397111","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}