{"title":"Sequence Recognition of Scene Text Based on CRNN and CTPN Models","authors":"Yiyi Liu","doi":"10.1145/3573428.3573462","DOIUrl":"https://doi.org/10.1145/3573428.3573462","url":null,"abstract":"Image-based sequence recognition has lately emerged as a prominent study subject in the science of computer vision, while text detection and identification in natural situations has emerged as an active research field. Based on scene text data, this paper addresses the theory of deep learning-based CRNN and CTPN models and the process of processing text. Using CRNN, text recognition can be turned into a time-dependent sequence learning issue, which is commonly employed for indeterminate-length text sequences. Contextual relationships between text images are learned using BLSTM and CTC, thus effectively improving text recognition accuracy and making the model more robust. It also excels in text recognition tests for wordless and lexical-based scenes, as it is not constrained by any predefined language. It produces a more efficient, but smaller, model that is more suited to real-world settings. CRNN recognition accuracy is lower for short texts with large morphological changes, such as artistic words, or texts with large changes in natural scenes. Because of the Anchor setting, CTPN can only detect horizontally distributed text, but a small improvement can detect vertical text by adding horizontal Anchor. As a result of the limitations of the framework, the irregularly inclined text can be detected very broadly.","PeriodicalId":314698,"journal":{"name":"Proceedings of the 2022 6th International Conference on Electronic Information Technology and Computer Engineering","volume":"18 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-10-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"115426595","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Xiaofan Liu, Zhiming Chen, Xiaoran Li, Xinghua Wang, Lei Zhang
{"title":"Design of a Time Detector with Adjustable Resolution","authors":"Xiaofan Liu, Zhiming Chen, Xiaoran Li, Xinghua Wang, Lei Zhang","doi":"10.1145/3573428.3573525","DOIUrl":"https://doi.org/10.1145/3573428.3573525","url":null,"abstract":"With the continuous improvement of integrated circuit technology, the time detector, especially one with adjustable resolution, has a good development prospect. In this paper, a multi-bit time detector with adjustable resolution and configurable measurement range is presented, whose timing link composed of identical timing units is similar to ring oscillator. To adjust the resolution, the current and link of the timing unit are controlled by control code. The proposed time detector is implemented in CMOS 180nm technology. Simulation results show that the measuring range can reach the microsecond level under a 1.8-V supply. Meanwhile, the resolution corresponding to the control code 00, 01, 10 and 11 are 13.3ns, 25.8ns, 39.7ns and 48.5ns, respectively. Therefore, there is a linear relationship between the resolution and the value of control code.","PeriodicalId":314698,"journal":{"name":"Proceedings of the 2022 6th International Conference on Electronic Information Technology and Computer Engineering","volume":"5 4 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-10-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"115504852","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"3D reconstruction based on monocular image sequences","authors":"Shuo Dai, Changxin Nai, Peng Wang","doi":"10.1145/3573428.3573686","DOIUrl":"https://doi.org/10.1145/3573428.3573686","url":null,"abstract":"Aiming at the complex operation of active 3D reconstruction technology, which is easily affected by external environment and equipment, this paper adopts a series of monocular image sequences taken by cell phone cameras from different angles to realize 3D scene reconstruction. Firstly, Structure-From-Motion (SFM) algorithm is used for sparse point cloud reconstruction, and the similarities and differences between incremental and global SFM algorithms for sparse reconstruction are compared; secondly, Multi-View-Stereo (MVS) is used for dense point cloud reconstruction, and finally, the surface and texture information of the object is recovered. The surface and texture information of the objects are recovered to achieve the reconstruction of 3D scenes. The experimental results show that the method can achieve better 3D reconstruction of monocular image sequences.","PeriodicalId":314698,"journal":{"name":"Proceedings of the 2022 6th International Conference on Electronic Information Technology and Computer Engineering","volume":"131 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-10-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"114662291","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"A Faster Time Series Data Prediction Method Based on LSTM","authors":"Xu Song","doi":"10.1145/3573428.3573447","DOIUrl":"https://doi.org/10.1145/3573428.3573447","url":null,"abstract":"The complex structure of LSTM increases the number of parameters and leads to an increase in training time. We propose an improved prediction method for time series data based on LSTM, which can significantly reduce the training time while ensuring a certain prediction accuracy. Our method first uses wavelet decomposition to decompose the data into low-frequency data and high-frequency data and then uses LSTM to learn the characteristics of low-frequency data, use Random Forest to learn the characteristics of high-frequency data, and finally uses wavelet reconstruction to reconstruct the predictions of LSTM and Random Forest for different frequency data into prediction data. Test results on datasets in three different domains show that our method can predict the overall trend of time series data well, but the prediction results for local details are slightly worse. Compared with using LSTM directly, our method increases the average mae by 15.52% and the average mse by 31.10% on the three datasets but reduces the average training time by 69.66%.","PeriodicalId":314698,"journal":{"name":"Proceedings of the 2022 6th International Conference on Electronic Information Technology and Computer Engineering","volume":"17 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-10-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"124739522","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Neural Network Models Performance Analysis of Large-Scale Text Recognition∗","authors":"Yunchao Zou","doi":"10.1145/3573428.3573742","DOIUrl":"https://doi.org/10.1145/3573428.3573742","url":null,"abstract":"The continuous development of computer technology leads to booming image data and throws a tricky question to scholars about how to process these data intelligently. Luckily, it is a dream come true to the recognition of images with the help of progressive deep-learning technology. Nowadays, image recognition based on neural networks is widely used, and recognizing a large scale of text information is one of the critical applications. Therefore, this paper will first review the development history of image recognition technology and introduce the concept of the convolutional neural network model. After that, it will analyze the performance of multiple algorithms in recognizing a large amount of text information based on Reginal Convolutional Neural Network, Spatial Pyramid Pooling, Fast Region Convolutional Neural Network, and Faster Convolutional Neural Network. Last but not least, it also points out the prospect of the future development direction of the current image processing technology and its defections. Analysis shows that the biggest drawback of deep learning technology is its dependence on training data. More specifically, when the training data is incomplete, it will be hard for the network model to maintain its recognition accuracy, especially in large-scale text recognition. To further improve the image recognition technology, we should put the effort into constructing a deep neural network model, optimize the training data, reduce the model training parameters, and improve the model accuracy.","PeriodicalId":314698,"journal":{"name":"Proceedings of the 2022 6th International Conference on Electronic Information Technology and Computer Engineering","volume":"86 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-10-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"124840899","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
S. Liu, Xilong Pei, Jiali Wang, Jing-song Huang, Jian-Mei Wang, Ning Wang
{"title":"An Intelligent Cockpit System HMI Engine Based on COMO","authors":"S. Liu, Xilong Pei, Jiali Wang, Jing-song Huang, Jian-Mei Wang, Ning Wang","doi":"10.1145/3573428.3573528","DOIUrl":"https://doi.org/10.1145/3573428.3573528","url":null,"abstract":"ICS (Intelligent Cockpit System) is a Human-Machine Interface (HMI) technology that integrates In-Vehicle Infotainment (IVI), Head Up Display (HUD), and Navigation (NAVI). The HMI technology stack in ICS involves engineering human-machine interaction creativity, component-based graphics systems and vehicle-level hardware system, etc. The HMI engine we developed is a middleware that implements human-machine interaction computing in in-vehicle electronic equipment, and the engine is also a software stack for graphics computing, scheduling management of computing devices in the cockpit and software runtime functions, it operates the display hardware through OpenGL. This paper introduces a COMO-based ICS HMI technology with functional safety SOA architecture. With the support of COMO RPC, devices are abstracted as services and integrated together. Under the premise of ensuring functional safety, it has a variety of ICS-oriented 2D, 3D controls with running state and design state, suitable for creative people and automotive engineers to work together.","PeriodicalId":314698,"journal":{"name":"Proceedings of the 2022 6th International Conference on Electronic Information Technology and Computer Engineering","volume":"36 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-10-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"123791838","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Personalized recommendation algorithm of books based on the diffusion of reader's interest","authors":"Lei Min","doi":"10.1145/3573428.3573733","DOIUrl":"https://doi.org/10.1145/3573428.3573733","url":null,"abstract":"The ever-growing books help readers acquire knowledge faster than ever before. But the huge scale of these resources also easily makes people fall into the dilemma of \"Information-Explosion\", which prevents the reader from easily locating the books that are really suitable for them. To alleviate this dilemma, we analyzes the principle of the \"Networks-Based-Inference\" algorithm (NBI), which is a classical heuristic algorithm for recommendation. We also proposes an improved algorithm—NBI algorithm using Interest Diffusion (NBI-ID), that derives from this classical algorithm. This improved algorithm inherits the advantages of NBI method in simplicity and effectiveness, and optimizes the allocation of initial information in the process of information diffusion with an interest related indicator. Thus increasing the efficiency of the recommendation results. Experiments on the GoodBooks dataset show that the proposed algorithm improves in accuracy, recall and diversity compared to the classic NBI, CF (Collaborative Filtering) and GRM (Global Ranking Method) algorithms.","PeriodicalId":314698,"journal":{"name":"Proceedings of the 2022 6th International Conference on Electronic Information Technology and Computer Engineering","volume":"21 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-10-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"125283548","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Research on financial social public opinion communication model based on variation mechanism","authors":"Maojun Huang, Mei Hong, Lin Dong, Dayu Yuan","doi":"10.1145/3573428.3573440","DOIUrl":"https://doi.org/10.1145/3573428.3573440","url":null,"abstract":"In the context of rapid development of the Internet, the place of spreading financial public opinion is converted from traditional offline places to major online social platforms. Mastering the development mechanism of financial social opinion dissemination on online social media can effectively estimate the length of influence of public opinion and the scope of affected people, and provide effective guidance to relevant staff. Based on the Susceptible Infected Recovered Model, this paper divides the people involved in opinion diffusion into commenters and discussers, and introduces the variation mechanism to design the Susceptible-comment-discussion-removal model, then simulates the model to study the effects of different initial states and parameters on the model, and finally verifies the validity of the model by combining the real data of stock bars. The simulation experiments and validation show that the model can effectively describe the spread of public opinion among the user groups of financial social platforms when it occurs, and provide a valid reference for related workers. However, the content of public opinion, the personal influence of communicators, and the lag effect of communication all have an impact on communication, and these issues need to be addressed in future research.","PeriodicalId":314698,"journal":{"name":"Proceedings of the 2022 6th International Conference on Electronic Information Technology and Computer Engineering","volume":"s1-10 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-10-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"125538854","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Daoshun Xie, Zongyue Wang, Guorong Cai, Qiming Xia, Yidong Chen, S. Yang
{"title":"Monocular Camera Video Based Reconstruction of 3D human model","authors":"Daoshun Xie, Zongyue Wang, Guorong Cai, Qiming Xia, Yidong Chen, S. Yang","doi":"10.1145/3573428.3573670","DOIUrl":"https://doi.org/10.1145/3573428.3573670","url":null,"abstract":"This paper addresses a method to obtain an accurate 3D human body model and a photorealistic free-view image of an arbitrary person from a monocular camera video. Recent works has shown that it is possible to reconstruct a human model at a level of detail from a single image. However, inferring a complete 3D human model from a network model will be ill-posed if rely on a single photograph of a person. In order to reasonably infer the 3D human model, we propose method based on implicit field representation to integrate the information of video frames by a set of structured latent code. The core of our method is to construct the implicit field by relatively sparse structured latent code. Meanwhile, align the vertices of the parametric human model and structured latent code to the same coordinate system. Extensive experimental results on monocular datasets demonstrate the effectiveness of our approach in generating accurate 3D human models. Our method utilizes a monocular camera to obtain a 3D model which enables consumers create their personality digital model.","PeriodicalId":314698,"journal":{"name":"Proceedings of the 2022 6th International Conference on Electronic Information Technology and Computer Engineering","volume":"24 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-10-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"122387290","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Research on a Simulation Algorithm for Display Effect of Rotating LED Device","authors":"Jianan Lin, Xinkai Weng","doi":"10.1145/3573428.3573509","DOIUrl":"https://doi.org/10.1145/3573428.3573509","url":null,"abstract":"Rotating LED device, because it can achieve the screen floating in the air display effect, is becoming more and more popular in everyday applications. It is necessary to simulate its display effect. The simulation algorithm of the display effect of the rotating LED device can obtain the predicted display effect in the software simulation after the hardware and software design of the rotating LED lamp is completed, without waiting for the completion of the hardware entity. After testing, the display effect predicted by the simulation algorithm is very close to the actual effect. So it has good practical value.","PeriodicalId":314698,"journal":{"name":"Proceedings of the 2022 6th International Conference on Electronic Information Technology and Computer Engineering","volume":"97 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-10-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"122536864","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}