Proceedings of the 2022 6th International Conference on Electronic Information Technology and Computer Engineering最新文献

Sequence Recognition of Scene Text Based on CRNN and CTPN Models 基于CRNN和CTPN模型的场景文本序列识别

Proceedings of the 2022 6th International Conference on Electronic Information Technology and Computer Engineering Pub Date : 2022-10-21 DOI: 10.1145/3573428.3573462

Yiyi Liu

{"title":"Sequence Recognition of Scene Text Based on CRNN and CTPN Models","authors":"Yiyi Liu","doi":"10.1145/3573428.3573462","DOIUrl":"https://doi.org/10.1145/3573428.3573462","url":null,"abstract":"Image-based sequence recognition has lately emerged as a prominent study subject in the science of computer vision, while text detection and identification in natural situations has emerged as an active research field. Based on scene text data, this paper addresses the theory of deep learning-based CRNN and CTPN models and the process of processing text. Using CRNN, text recognition can be turned into a time-dependent sequence learning issue, which is commonly employed for indeterminate-length text sequences. Contextual relationships between text images are learned using BLSTM and CTC, thus effectively improving text recognition accuracy and making the model more robust. It also excels in text recognition tests for wordless and lexical-based scenes, as it is not constrained by any predefined language. It produces a more efficient, but smaller, model that is more suited to real-world settings. CRNN recognition accuracy is lower for short texts with large morphological changes, such as artistic words, or texts with large changes in natural scenes. Because of the Anchor setting, CTPN can only detect horizontally distributed text, but a small improvement can detect vertical text by adding horizontal Anchor. As a result of the limitations of the framework, the irregularly inclined text can be detected very broadly.","PeriodicalId":314698,"journal":{"name":"Proceedings of the 2022 6th International Conference on Electronic Information Technology and Computer Engineering","volume":"18 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-10-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"115426595","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Design of a Time Detector with Adjustable Resolution 可调分辨率时间检测器的设计

Proceedings of the 2022 6th International Conference on Electronic Information Technology and Computer Engineering Pub Date : 2022-10-21 DOI: 10.1145/3573428.3573525

Xiaofan Liu, Zhiming Chen, Xiaoran Li, Xinghua Wang, Lei Zhang

引用次数: 0

3D reconstruction based on monocular image sequences 基于单眼图像序列的三维重建

Proceedings of the 2022 6th International Conference on Electronic Information Technology and Computer Engineering Pub Date : 2022-10-21 DOI: 10.1145/3573428.3573686

Shuo Dai, Changxin Nai, Peng Wang

引用次数: 0

A Faster Time Series Data Prediction Method Based on LSTM 基于LSTM的时间序列数据快速预测方法

Proceedings of the 2022 6th International Conference on Electronic Information Technology and Computer Engineering Pub Date : 2022-10-21 DOI: 10.1145/3573428.3573447

Xu Song

引用次数: 0

Neural Network Models Performance Analysis of Large-Scale Text Recognition∗ 大规模文本识别的神经网络模型性能分析*

Proceedings of the 2022 6th International Conference on Electronic Information Technology and Computer Engineering Pub Date : 2022-10-21 DOI: 10.1145/3573428.3573742

Yunchao Zou

{"title":"Neural Network Models Performance Analysis of Large-Scale Text Recognition∗","authors":"Yunchao Zou","doi":"10.1145/3573428.3573742","DOIUrl":"https://doi.org/10.1145/3573428.3573742","url":null,"abstract":"The continuous development of computer technology leads to booming image data and throws a tricky question to scholars about how to process these data intelligently. Luckily, it is a dream come true to the recognition of images with the help of progressive deep-learning technology. Nowadays, image recognition based on neural networks is widely used, and recognizing a large scale of text information is one of the critical applications. Therefore, this paper will first review the development history of image recognition technology and introduce the concept of the convolutional neural network model. After that, it will analyze the performance of multiple algorithms in recognizing a large amount of text information based on Reginal Convolutional Neural Network, Spatial Pyramid Pooling, Fast Region Convolutional Neural Network, and Faster Convolutional Neural Network. Last but not least, it also points out the prospect of the future development direction of the current image processing technology and its defections. Analysis shows that the biggest drawback of deep learning technology is its dependence on training data. More specifically, when the training data is incomplete, it will be hard for the network model to maintain its recognition accuracy, especially in large-scale text recognition. To further improve the image recognition technology, we should put the effort into constructing a deep neural network model, optimize the training data, reduce the model training parameters, and improve the model accuracy.","PeriodicalId":314698,"journal":{"name":"Proceedings of the 2022 6th International Conference on Electronic Information Technology and Computer Engineering","volume":"86 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-10-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"124840899","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

An Intelligent Cockpit System HMI Engine Based on COMO 基于COMO的智能座舱系统人机界面引擎

Proceedings of the 2022 6th International Conference on Electronic Information Technology and Computer Engineering Pub Date : 2022-10-21 DOI: 10.1145/3573428.3573528

S. Liu, Xilong Pei, Jiali Wang, Jing-song Huang, Jian-Mei Wang, Ning Wang

引用次数: 0

Personalized recommendation algorithm of books based on the diffusion of reader's interest 基于读者兴趣扩散的图书个性化推荐算法

Proceedings of the 2022 6th International Conference on Electronic Information Technology and Computer Engineering Pub Date : 2022-10-21 DOI: 10.1145/3573428.3573733

Lei Min

引用次数: 0

Research on financial social public opinion communication model based on variation mechanism 基于变异机制的金融社会舆情传播模型研究

Proceedings of the 2022 6th International Conference on Electronic Information Technology and Computer Engineering Pub Date : 2022-10-21 DOI: 10.1145/3573428.3573440

Maojun Huang, Mei Hong, Lin Dong, Dayu Yuan

{"title":"Research on financial social public opinion communication model based on variation mechanism","authors":"Maojun Huang, Mei Hong, Lin Dong, Dayu Yuan","doi":"10.1145/3573428.3573440","DOIUrl":"https://doi.org/10.1145/3573428.3573440","url":null,"abstract":"In the context of rapid development of the Internet, the place of spreading financial public opinion is converted from traditional offline places to major online social platforms. Mastering the development mechanism of financial social opinion dissemination on online social media can effectively estimate the length of influence of public opinion and the scope of affected people, and provide effective guidance to relevant staff. Based on the Susceptible Infected Recovered Model, this paper divides the people involved in opinion diffusion into commenters and discussers, and introduces the variation mechanism to design the Susceptible-comment-discussion-removal model, then simulates the model to study the effects of different initial states and parameters on the model, and finally verifies the validity of the model by combining the real data of stock bars. The simulation experiments and validation show that the model can effectively describe the spread of public opinion among the user groups of financial social platforms when it occurs, and provide a valid reference for related workers. However, the content of public opinion, the personal influence of communicators, and the lag effect of communication all have an impact on communication, and these issues need to be addressed in future research.","PeriodicalId":314698,"journal":{"name":"Proceedings of the 2022 6th International Conference on Electronic Information Technology and Computer Engineering","volume":"s1-10 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-10-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"125538854","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Monocular Camera Video Based Reconstruction of 3D human model 基于单目摄像机视频的三维人体模型重建

Proceedings of the 2022 6th International Conference on Electronic Information Technology and Computer Engineering Pub Date : 2022-10-21 DOI: 10.1145/3573428.3573670

Daoshun Xie, Zongyue Wang, Guorong Cai, Qiming Xia, Yidong Chen, S. Yang

{"title":"Monocular Camera Video Based Reconstruction of 3D human model","authors":"Daoshun Xie, Zongyue Wang, Guorong Cai, Qiming Xia, Yidong Chen, S. Yang","doi":"10.1145/3573428.3573670","DOIUrl":"https://doi.org/10.1145/3573428.3573670","url":null,"abstract":"This paper addresses a method to obtain an accurate 3D human body model and a photorealistic free-view image of an arbitrary person from a monocular camera video. Recent works has shown that it is possible to reconstruct a human model at a level of detail from a single image. However, inferring a complete 3D human model from a network model will be ill-posed if rely on a single photograph of a person. In order to reasonably infer the 3D human model, we propose method based on implicit field representation to integrate the information of video frames by a set of structured latent code. The core of our method is to construct the implicit field by relatively sparse structured latent code. Meanwhile, align the vertices of the parametric human model and structured latent code to the same coordinate system. Extensive experimental results on monocular datasets demonstrate the effectiveness of our approach in generating accurate 3D human models. Our method utilizes a monocular camera to obtain a 3D model which enables consumers create their personality digital model.","PeriodicalId":314698,"journal":{"name":"Proceedings of the 2022 6th International Conference on Electronic Information Technology and Computer Engineering","volume":"24 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-10-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"122387290","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Research on a Simulation Algorithm for Display Effect of Rotating LED Device 旋转LED器件显示效果仿真算法研究

Proceedings of the 2022 6th International Conference on Electronic Information Technology and Computer Engineering Pub Date : 2022-10-21 DOI: 10.1145/3573428.3573509

Jianan Lin, Xinkai Weng

引用次数: 0