{"title":"Local Perception based Auxiliary Neural Network for Object Tracking","authors":"Qing Lei, Dongyun Xu, Yun Gao","doi":"10.1145/3461353.3461392","DOIUrl":"https://doi.org/10.1145/3461353.3461392","url":null,"abstract":"In recent years, tracking algorithms based on deep learning frameworks in the field of object tracking have attracted much attention. The focus of their research is to improve the performance of algorithms by designing deep network models and constructing training data sets. However, this type of algorithm usually uses an end-to-end method to output the predicted target position, and the predicted result always has a certain deviation from the real target position. This paper proposes a object tracking algorithm based on local perception-assisted neural network. The algorithm expands the training dataset. The expanded dataset adds training samples with partial target content and enhances the training label. Train an auxiliary neural network model with local perception capabilities, when the predicted result of the tracking algorithm deviates from the real position, the auxiliary neural network model is used to offset the tracking result to promote the accuracy of the tracking result. Experiments on the VOT2018 and VOT2016 benchmark datasets show that compared with the 6 mainstream algorithms in the object tracking field, the algorithm in this paper has better tracking performance under the three indicators of EAO, accuracy, robustness.","PeriodicalId":114871,"journal":{"name":"Proceedings of the 2021 5th International Conference on Innovation in Artificial Intelligence","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2021-03-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"130543122","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"HMA: An Efficient Training Method for NLP Models","authors":"Yuetong Yang, Zhiquan Lai, Lei Cai, Dongsheng Li","doi":"10.1145/3461353.3461384","DOIUrl":"https://doi.org/10.1145/3461353.3461384","url":null,"abstract":"Nowadays, deep learning has been widely used for solving natural language processing (NLP) problems. Embedding matrices are common-used in the NLP deep models for automatical feature learning. However, the sparsity of embedding matrices makes it challenging to efficiently train the NLP models in data parallelism. When training with synchronous optimization methods, the aggregation on sparse gradients brings high communication cost and low scalability for distributed training. In this paper, we combine Model Average (MA) and synchronous optimization methods together, and propose HMA, a hybrid training method for NLP deep models. Furthermore, we implement HMA method in Horovod+TensorFlow training framework and conduct experimental evaluation with representative NLP models. For NLP models with a large number of sparse parameters, HMA saves over 30% wall-clock time compared with the state-of-the-art distributed training framework, while maintaining the same final training loss.","PeriodicalId":114871,"journal":{"name":"Proceedings of the 2021 5th International Conference on Innovation in Artificial Intelligence","volume":"5 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2021-03-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"132975858","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Reversible Data Hiding in Encrypted Images Based on Bit-plane Rearrangement and Huffman Coding","authors":"Mengqi Liu, Tiegang Gao","doi":"10.1145/3461353.3461372","DOIUrl":"https://doi.org/10.1145/3461353.3461372","url":null,"abstract":"In this paper, an improved reversible data hiding in encrypted image based on bit-plane rearrangement and Huffman coding is proposed. In the scheme, the eight bit-planes of the image are first rearranged using sixteen image-based rearrangement ways. After that, sixteen one-dimensional bit-streams are generated from each bit-plane, and then, the numbers of consecutive 0s or consecutive 1s in a bit-stream are used to form a sequence, the generated sequence is then encoded into a binary stream by Huffman coding, which is the compressed bit-stream. XOR-encryption and scrambling operation will be applied to the compressed image to ensure security. The experimental results show that our scheme can achieve large embedding capacity and high security.","PeriodicalId":114871,"journal":{"name":"Proceedings of the 2021 5th International Conference on Innovation in Artificial Intelligence","volume":"41 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2021-03-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"134178632","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"KABI: Class-Incremental Learning via knowledge Amalgamation and Batch Identification","authors":"Caixia Li, Wenhua Xu, Xizhu Si, Ping Song","doi":"10.1145/3461353.3461367","DOIUrl":"https://doi.org/10.1145/3461353.3461367","url":null,"abstract":"In class-incremental learning setting, classes are typically presented batch by batch over time. Incremental learning often suffers from catastrophic forgetting: the performance on previous classes abruptly degrades when adapting a model to new classes. We find that incremental models trained using knowledge distillation are skilled at discriminating classes within a batch, whereas they have confusion among classes in different batches. We propose a class-incremental learning approach with knowledge amalgamation and batch identification (KABI), which can effectively alleviate catastrophic forgetting. The idea is to first train an expert model for new classes at current state, and then train an amalgamation model by amalgamating knowledge from the expert model and the amalgamation model of previous state to discriminate different classes within a batch, and particularly train a batch identifier to discriminate different batches. We conduct extensive experiments on three datasets: MNIST, CIFAR-100, ILSVRC 2012, and show that KABI outperforms the second-best approach by 1.29%, 14.26% and 21.51% respectively. Surprisingly, classification accuracies of our approach are even sometimes higher than the oracle results which is obtained by training a model using all training samples from all classes.","PeriodicalId":114871,"journal":{"name":"Proceedings of the 2021 5th International Conference on Innovation in Artificial Intelligence","volume":"18 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2021-03-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"122501898","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Jingyi Ge, Yi Wang, Jiayi Li, Huiwen Bai, Linsheng Liu, Shengfa Wang, Xinwei Xue, Fengqi Li
{"title":"A Reinforcement Learning-based Path Planning Method for Complex Thin-walled Structures in 3D Printing","authors":"Jingyi Ge, Yi Wang, Jiayi Li, Huiwen Bai, Linsheng Liu, Shengfa Wang, Xinwei Xue, Fengqi Li","doi":"10.1145/3461353.3461382","DOIUrl":"https://doi.org/10.1145/3461353.3461382","url":null,"abstract":"Path planning is an important part of the 3D printing process. The optimized path planning method can improve not only effect of the molding but also the efficiency of printing process. However, traditional path planning methods are not satisfactory in 3D printing, especially when printing the entities with complex thin-wall structures. We propose an intelligent path planning method named Q-Path, based on reinforcement learning for complex thin-walled structures. We first convert the path planning task to a full-path traversing problem. Then we use the Q-learning algorithm to find the optimal solution with the constraints of 3D printing, such as the minimum number of lifts and turns of the print head. Experimental results show that the proposed methods are superior to the traditional methods in printing complex thin-walled structures.","PeriodicalId":114871,"journal":{"name":"Proceedings of the 2021 5th International Conference on Innovation in Artificial Intelligence","volume":"46 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2021-03-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"116536628","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"On the Innovation of Foreign Language Teaching in the Era of Artificial Intelligence","authors":"Jinxiang Xue","doi":"10.1145/3461353.3461355","DOIUrl":"https://doi.org/10.1145/3461353.3461355","url":null,"abstract":"As human society steps into the era of artificial intelligence, artificial intelligence has a profound impact on education, which provides necessary conditions and have new requirements for the innovation of foreign language teaching. The innovation of foreign language teaching in the era of artificial intelligence is mainly reflected in four aspects: the innovation of educational organization form, the innovation of education and teaching mode, the innovation of teachers’ skills and concepts, and the innovation of teaching resources selection and application. To improve the level of campus intelligence, to carry out highly customized learning, to create a smart classroom, and to establish a collaborative relationship between teachers and artificial intelligence are the ways and methods of foreign language teaching innovation. In this process, society, schools, teachers and students should cooperate well and work closely to create a better and new situation of foreign language teaching in the era of artificial intelligence.","PeriodicalId":114871,"journal":{"name":"Proceedings of the 2021 5th International Conference on Innovation in Artificial Intelligence","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2021-03-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"130542090","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Chengjin Lyu, Patrick Heyer-Wollenberg, A. Munir, L. Platisa, C. Micheloni, B. Goossens, Wilfried Philips
{"title":"Visible-Thermal Pedestrian Detection via Unsupervised Transfer Learning","authors":"Chengjin Lyu, Patrick Heyer-Wollenberg, A. Munir, L. Platisa, C. Micheloni, B. Goossens, Wilfried Philips","doi":"10.1145/3461353.3461369","DOIUrl":"https://doi.org/10.1145/3461353.3461369","url":null,"abstract":"Recently, pedestrian detection using visible-thermal pairs plays a key role in around-the-clock applications, such as public surveillance and autonomous driving. However, the performance of a well-trained pedestrian detector may drop significantly when it is applied to a new scenario. Normally, to achieve a good performance on the new scenario, manual annotation of the dataset is necessary, while it is costly and unscalable. In this work, an unsupervised transfer learning framework is proposed for visible-thermal pedestrian detection tasks. Given well-trained detectors from a source dataset, the proposed framework utilizes an iterative process to generate and fuse training labels automatically, with the help of two auxiliary single-modality detectors (visible and thermal). To achieve label fusion, the knowledge of daytime and nighttime is adopted to assign priorities to labels according to their illumination, which improves the quality of generated training labels. After each iteration, the existing detectors are updated using new training labels. Experimental results demonstrate that the proposed method obtains state-of-the-art performance without any manual training labels on the target dataset.","PeriodicalId":114871,"journal":{"name":"Proceedings of the 2021 5th International Conference on Innovation in Artificial Intelligence","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2021-03-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"127257431","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Soft-Gated Self-Supervision Network for Action Reasoning: Soft-Gated Self-Supervision Network with Attention Mechanism and Joint Multi-Task Training Strategy for Action Reasoning","authors":"Shengli Wang, Lifang Wang, Liang Gu, Shichang He, Huijuan Hao, Meiling Yao","doi":"10.1145/3461353.3461377","DOIUrl":"https://doi.org/10.1145/3461353.3461377","url":null,"abstract":"Behavior decision-making in the dialogue domain is the key to the success of the dialogue system. Existing machine learning algorithm based on statistics and an deep learning algorithm based on pre-training are difficult to adapt to all tasks. Therefore, this paper proposes a multi-task classification model based on soft gating. Firstly, based on the existing models, a multi-task model suitable for specific fields is designed, which makes the model fully consider the characteristics of the data set itself. Secondly, soft gating is used to divide the model to reduce the interaction between model parameters. Then, we manually annotate and extend the existing session behavior data set. Finally, the trained model is used to recognize dialogue behavior. The experimental results show that the accuracy of the score is 89.1%. The experimental results show that the performance of SGSAM better than other models.","PeriodicalId":114871,"journal":{"name":"Proceedings of the 2021 5th International Conference on Innovation in Artificial Intelligence","volume":"108 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2021-03-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"127328312","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}