{"title":"Handwritten String Recognition Based on YOLOv4 and CRNN on Different Deep Learning Frameworks","authors":"X.-W. Yin, Zhongli Ma, Xu Chen, Qiaoling Zhou","doi":"10.1109/acait53529.2021.9731149","DOIUrl":"https://doi.org/10.1109/acait53529.2021.9731149","url":null,"abstract":"Character recognition is one of the most important contents of pattern recognition research. Among them, due to the instability and continuity of individual differences in handwritten strings, it is difficult to extract features in recognition. At present, the recognition of handwritten strings is based on single character segmentation. However, due to the difficulty of character segmentation of handwritten string, the accuracy of recognition based on segmentation method is relatively low. In this paper, we use the non segmentation method based on RNN for character recognition, combined with the time sequence characteristics to improve the recognition accuracy of continuous strings. Firstly, handwritten string detection methods are studied respectively, based on YOLOv4, YOLOv4-Tiny model and OpenCV, the detection methods are tested; then the handwritten string recognition methods are studied, the ‘CNN+RNN+CTC’ pattern is used to calculate loss function model, thus can identify undivided strings, and then through CTC decoding, string results with higher accuracy can be obtained; finally, after testing was carried out, the experimental result shows that the method proposed in this paper can solve the problem of stylus and variable length of handwritten strings, therefore improve the efficiency of converting handwritten manuscripts into electronic manuscripts.","PeriodicalId":173633,"journal":{"name":"2021 5th Asian Conference on Artificial Intelligence Technology (ACAIT)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2021-10-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"130960679","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Jianhan Pan, Teng Cui, Mingjing Du, Qingyang Zhang, Bingbing Song, Qiaoli Qu
{"title":"Multiple Latent Spaces Learning for Cross-Domain Text Classification","authors":"Jianhan Pan, Teng Cui, Mingjing Du, Qingyang Zhang, Bingbing Song, Qiaoli Qu","doi":"10.1109/acait53529.2021.9730891","DOIUrl":"https://doi.org/10.1109/acait53529.2021.9730891","url":null,"abstract":"When the training data and test data are drawn from similar but different data distributions, transfer learning (TL) can be exploited to learn a consistent distribution for knowledge transfer. To reduce distribution differences, some recent transfer learning approaches typically build potential feature spaces to exploit the potential information and learn multiple high-level concepts to model a latent potential shared structure. However, only utilizing the potential information in one latent space will neglect some other potential information existing in different latent feature spaces. And this neglected potential information may also help model potential structures shared as bridges. In this paper, we propose Multiple Latent Spaces Learning (MLSL), a novel approach which mines a massive amount of potential information on multiple latent spaces to construct a shared bridge (or multiple bridges) across domains by learning different high-level concepts. Our strategy can dig out the latent information that exists in the latent space ignored by the previous methods to build a knowledge transfer bridge. Compared with the TL method that only learns a latent space, our strategy is more suitable for actual scenarios, and the use of data is also fuller. In addition, an iterative algorithm is developed to solve the optimization problem. Finally, the system test on benchmark data sets shows the superiority of the MLSL method.","PeriodicalId":173633,"journal":{"name":"2021 5th Asian Conference on Artificial Intelligence Technology (ACAIT)","volume":"5 5 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2021-10-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"130905188","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Qiwei Ye, Linbo Qiao, Hongyi Chen, Q. Tao, Jingjing Xiao
{"title":"Automatic Cardiomyopathy Diagnosis with a Cost-sensitive Ensemble Classifier","authors":"Qiwei Ye, Linbo Qiao, Hongyi Chen, Q. Tao, Jingjing Xiao","doi":"10.1109/acait53529.2021.9731304","DOIUrl":"https://doi.org/10.1109/acait53529.2021.9731304","url":null,"abstract":"This paper proposed a cost-sensitive ensemble classifier for automatic cardiomyopathy diagnosis using features extracted from cardiac magnetic resonance images. However, with numerous features extracted from images, it is hard for a single classifier to achieve accurate prediction. In contrast, an ensemble classifier combines multiple weak classifiers which could benefit from each others and improve the performance. Therefore, we proposed a cost-sensitive ensemble classifier assembling five heterogeneous classifiers: logistic regression (LR), Gaussian naive bayes (GNB), support vector machine (SVM), multi-layer perception(MLP), and convolutional neural network(CNN). The weight of each classifier was determined according to the special cost-sensitive function. In the experiment, the proposed method was evaluated on a publicly available Automated Cardiac Diagnosis Challenge (ACDC) dataset [1], where the proposed ensemble classifier achieves a considerable improvement.","PeriodicalId":173633,"journal":{"name":"2021 5th Asian Conference on Artificial Intelligence Technology (ACAIT)","volume":"38 4","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2021-10-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"114011207","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Types and Evolutionary Boundaries of Agents","authors":"Feng Liu, Ying Liu","doi":"10.1109/acait53529.2021.9731137","DOIUrl":"https://doi.org/10.1109/acait53529.2021.9731137","url":null,"abstract":"Through the analysis of the characteristics of natural intelligent systems, the three laws of intelligence are exploringly proposed. The most important point is that any agent is a system with knowledge input, output, (dynamic) storage and creation abilities Since an agent has four abilities, and each ability has three states, including zero, infinity, and finite (between zero and infinity), 81 types of agents can be formed. Therefore, it can be inferred that Alpha point (No. 1), where all the four abilities of an agent are zero, and Omega point (No. 81), where all four abilities are infinite are the boundary of the evolution of Agent. This paper designs experiments to detect these 81 agents, and finds that 11 of them can be confirmed by experiments at present. Through the discussion of the states of Agents No. 64 to No. 81, fresh thinking is proposed for the generation and realization of the paradox.","PeriodicalId":173633,"journal":{"name":"2021 5th Asian Conference on Artificial Intelligence Technology (ACAIT)","volume":"63 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2021-10-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"121920077","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Alignment Analysis of English Chinese Bilingual Corpora based on CRFS Model","authors":"Miao Wu","doi":"10.1109/acait53529.2021.9731293","DOIUrl":"https://doi.org/10.1109/acait53529.2021.9731293","url":null,"abstract":"Bilingual corpus alignment analysis, as an important research method of machine translation, is of great value as a source of translation knowledge. In view of this, this study carries out the research on English Chinese bilingual corpus alignment based on CRFs model. That is, on the basis of chunk aligned corpus, combined with the respective language characteristics of English and Chinese, the word alignment between chunks is realized with the help of CRFs model. The results show that the accuracy and recall of maximum entropy are 45.77% and 45.58% respectively, the accuracy and recall of log linear model are 45.73% and 45.68% respectively, while the accuracy and recall of CRFs model are 47.99% and 47.94%. It indicates that CRFs model can effectively alleviate the asymmetric problem of English and Chinese database alignment, also the alignment error rate of CRFs model is reduced accordingly. Therefore, it has a good effect on the alignment of English and Chinese bilingual corpora.","PeriodicalId":173633,"journal":{"name":"2021 5th Asian Conference on Artificial Intelligence Technology (ACAIT)","volume":"37 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2021-10-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"115555862","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Xiang Chen, Mengxing Huang, Siling Feng, Yuan-hsin Chen, Wenquan Li
{"title":"Automatic Classification of User Requirements Information Based on Convolutional Neural Network","authors":"Xiang Chen, Mengxing Huang, Siling Feng, Yuan-hsin Chen, Wenquan Li","doi":"10.1109/acait53529.2021.9731258","DOIUrl":"https://doi.org/10.1109/acait53529.2021.9731258","url":null,"abstract":"User requirements information is semi-structured or unstructured data with sparse features and short lengths. Since manual labeling methods nowadays are time-consuming, which cannot meet increasing requirements for classifying massive user requirements information, an automatic classification method of user requirements information based on Convolutional Neural Networks (CNN) is proposed. Firstly, NLPIR word segmentation tool is used to preprocess word segmentation and remove stop words in user requirements information. Secondly the word vector is trained by word vector model (word2vec). Thirdly convolutional neural network is used to extract abstract features of text information. And finally these features are used as input,and the softmax classifier is used to achieve automatic classification of user requirements information. Compared with the traditional classification method based on probability and statistics, the classification accuracy of the automatic classification model of user requirements information based on CNN has been improved by 13.54%. The experimental results show that the CNN classification algorithm is more suitable for the automatic classification of the user requirements information.","PeriodicalId":173633,"journal":{"name":"2021 5th Asian Conference on Artificial Intelligence Technology (ACAIT)","volume":"27 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2021-10-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"123160092","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Research on Extraction and Translation of English Public Signs in Tourist Attractions Based on Machine Vision","authors":"Rongjing Meng","doi":"10.1109/acait53529.2021.9731335","DOIUrl":"https://doi.org/10.1109/acait53529.2021.9731335","url":null,"abstract":"There are many English public signs in tourist attractions. Using digital imaging technology and computer technology to extract and translate these English public language is helpful for tourists to understand or warn the surrounding environment. This study constructs the recognition system based on machine vision technology and convolutional neural network (CNN) to extract, recognize and translate English public signs. The results show that the average recognition accuracy of the system is 98%; the average accuracy of translation is 96.5%. The above results show that the recognition system can effectively extract and translate English public signs, which is helpful for tourists to understand the information conveyed by scenic spots.","PeriodicalId":173633,"journal":{"name":"2021 5th Asian Conference on Artificial Intelligence Technology (ACAIT)","volume":"31 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2021-10-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"129844091","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Cross-Dataset Pose Estimation of Faces In The Wild","authors":"Mo Zhao, Ya Ma, Zhendong Li, Hao Liu","doi":"10.1109/acait53529.2021.9731187","DOIUrl":"https://doi.org/10.1109/acait53529.2021.9731187","url":null,"abstract":"In this paper, we propose a domain generalization method for cross-dataset pose estimation of faces captured in wild conditions. Conventional methods mainly devote efforts on extracting discriminative features to reason the three-dimension pose. Due to the large distribution discrepancies between widely-used synthetic training and real-world testing data, it is challenging to seek a domain-generalized feature space especially for the new test samples in real-world applications. To alleviate the influence of dataset bias, our model aims to learn the domain-invariant features across different domains. In detail, a carefully-designed domain discriminator is plugged to the features extracted from different domains, meanwhile the feature encoder is trained to enforce features from different domains confused by game-theorem iterations. With the adversarial manner, our model learns a generalized pose-relevant feature space shared across different domains. Extensive experimental results on the standard benchmark under the cross-dataset setting indicate the superiority of our method in comparisons with most state of the arts.","PeriodicalId":173633,"journal":{"name":"2021 5th Asian Conference on Artificial Intelligence Technology (ACAIT)","volume":"115 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2021-10-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"129153425","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Guoqiang Li, Chong Chen, Zhenguo Song, Jun Wu, C. Deng, Zuoyi Chen
{"title":"Intelligent CNN-based Fault Diagnosis of Rotating Machinery with Small Fault Samples","authors":"Guoqiang Li, Chong Chen, Zhenguo Song, Jun Wu, C. Deng, Zuoyi Chen","doi":"10.1109/acait53529.2021.9731233","DOIUrl":"https://doi.org/10.1109/acait53529.2021.9731233","url":null,"abstract":"Up to now, convolution neural network (CNN) has been widely applied in fault diagnosis of rotating machinery. The CNN-based diagnostic methods often with the help of the fault data to implement the optimization. However, in real industrial applications, it is difficult and costly to obtain the fault data. In this paper, a ResNet-based diagnostic method combined a designed data transformation combination is proposed to achieve the fault diagnosis under small fault samples. Specifically, several data transformation of image are selected to deal with the small samples, where the parameter of the transformation is determined by the mutual information between the inputted sample and the corresponding transformed sample. Meanwhile, these obtained transformations are used as the input layer of the ResNet. Then, a fault diagnosis model is established by the constructed ResNet, and which is trained only by using small samples. Noting that the introduced data transformation is randomly used for the training samples to increase the complexity of the inputted samples in the training process for alleviating the overfitting risk. The bearing fault dataset is used to evaluated the effectiveness of the proposed method. From the experimental result, it is found the proposed has the capability to implement the effective fault diagnosis under small samples, and achieve a higher diagnostic performance than other existing methods.","PeriodicalId":173633,"journal":{"name":"2021 5th Asian Conference on Artificial Intelligence Technology (ACAIT)","volume":"26 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2021-10-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"115844684","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Qian Liu, Zhiming Jiao, Fangbo Gong, Hong-chao Ji, Jie Chen
{"title":"Research on Transmission Line Fault Detection Method based on M-Apriori Algorithm","authors":"Qian Liu, Zhiming Jiao, Fangbo Gong, Hong-chao Ji, Jie Chen","doi":"10.1109/acait53529.2021.9731306","DOIUrl":"https://doi.org/10.1109/acait53529.2021.9731306","url":null,"abstract":"Ensuring the stability of transmission line is the key to ensure the normal operation of the whole power grid system. In order to realize intelligent detection of transmission line fault and big data analysis, a transmission line fault detection method based on improved M-Apriori optimization algorithm is proposed. Firstly, the transmission line fault detection index system is constructed, and the M-Apriori algorithm is optimized and improved based on the traditional Apriori algorithm. In order to verify the comprehensive performance of the algorithm, five common transmission line fault types are selected for simulation this time, and the execution time of Apriori and M-Apriori algorithms are compared and analyzed respectively when the number of things with the same support is different, the system degree is different, and the number of things is the same. The simulation results show that the improved M-apriori has better algorithm efficiency, better recognition rate than BP neural network algorithm, and can realize the automatic monitoring of transmission line fault.","PeriodicalId":173633,"journal":{"name":"2021 5th Asian Conference on Artificial Intelligence Technology (ACAIT)","volume":"42 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2021-10-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"125458314","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}