{"title":"Scene text detection with gradient guidance","authors":"Siyu Chen, Enqi Zhan, Manjie Zhang","doi":"10.1117/12.2682595","DOIUrl":"https://doi.org/10.1117/12.2682595","url":null,"abstract":"In the process of text detection, we frequently encounter numerous indistinct images, which can easily result in text omission and misdetection. Inspired by the SPSR model, we introduce gradient branching to guide the training of text detection models in order to address this problem. By preserving more image edge features, we expect to improve the text detection performance of fuzzy images and fuzzy regions. The experiment demonstrates that the gradient guidance-based text detection model can detect text in ambiguous images more accurately and reduce instances of missing and incorrect detection.","PeriodicalId":177416,"journal":{"name":"Conference on Electronic Information Engineering and Data Processing","volume":"55 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-05-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"115157287","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Prairie mouse hole target detection technology based on deep learning","authors":"C. Li, Xiaoling Luo","doi":"10.1117/12.2682261","DOIUrl":"https://doi.org/10.1117/12.2682261","url":null,"abstract":"Rat hole detection is a key work in the prevention of rat damage. The digital rat hole detection method of UAV combined with target detection is studied to replace the manual carpet rat hole detection, so as to improve the detection efficiency. In this paper, low-altitude remote sensing of unmanned aerial vehicle was used to collect rat hole images on the Edolechuan grassland in Hohhot, Inner Mongolia. Combined with deep learning models: Faster-Rcnn, Yolov3 and SSD, rat hole detection was compared and analyzed. Data was cut and labeled through image preprocessing method. Then, by comparing the three groups of target detection models, the results show that the SSD model has the best effect on rat hole detection, the accuracy rate can reach 91.8%, and the reasoning speed can reach 7.9ms.","PeriodicalId":177416,"journal":{"name":"Conference on Electronic Information Engineering and Data Processing","volume":"2 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-05-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"126652876","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Wenbo Zhang, Jun Xie, Xinxiu Liu, Langlang Zhang, Pan Geng
{"title":"CNN-BiLSTM sewage treatment dissolved oxygen concentration prediction model based on attention mechanism","authors":"Wenbo Zhang, Jun Xie, Xinxiu Liu, Langlang Zhang, Pan Geng","doi":"10.1117/12.2682282","DOIUrl":"https://doi.org/10.1117/12.2682282","url":null,"abstract":"Aiming at the characteristics of complex biochemical reaction, nonlinearity and difficult prediction of dissolved oxygen in sewage treatment process, this paper proposes a dissolved oxygen concentration prediction model based on CNN-BiLSTM hybrid artificial neural network. Firstly, the abnormal data is identified and eliminated by data preprocessing, and the missing data is filled by interpolation method. Then, the Pearson correlation coefficient is used to analyze the correlation between dissolved oxygen and other variables. Multiple variable data with good correlation are selected and input into the CNN-BiLSTM network model. The dissolved oxygen concentration is predicted by CNN convolution operation combined with bidirectional long-term and short-term memory neural network (Bi-LSTM), and the time attention mechanism is introduced to learn the weight distribution between different time steps, focusing on the time step that has the greatest impact on dissolved oxygen concentration, so as to improve the prediction accuracy of the model. Compared with LSTM, GRU, CNN-LSTM and CNN-GRU models, the simulation results show that the proposed model can predict the dissolved oxygen more accurately and has higher prediction accuracy.","PeriodicalId":177416,"journal":{"name":"Conference on Electronic Information Engineering and Data Processing","volume":"19 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-05-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"122048426","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Zhenhai Zhang, L. Niu, Lei Wang, Hongbo Li, Xiaoyu He, Yanqiang Zhai
{"title":"Fault diagnosis of three-level active power filter based on hybrid logic model","authors":"Zhenhai Zhang, L. Niu, Lei Wang, Hongbo Li, Xiaoyu He, Yanqiang Zhai","doi":"10.1117/12.2682383","DOIUrl":"https://doi.org/10.1117/12.2682383","url":null,"abstract":"With the increase of voltage level and capacity of modern power system, three-level converter, as the main circuit of active power filter, meets the requirement of improving compensation capacity. However, the number of power devices of three-level converter is twice as large as that of two-level converter, and the probability of power device failure is greater, which affects the normal operation of the system. Aiming at the fault of three-level active power filter, this paper studies the fault characteristics of different power components of the main circuit T-type three-level converter when open-circuit fault occurs, applies the hybrid theory to the fault diagnosis of the converter, establishes the normal model and the fault model of different power components when open-circuit fault occurs, and realizes the location of the fault components according to the residual generated by the predicted output and the actual output of the model, the effectiveness of this method is verified by simulation.","PeriodicalId":177416,"journal":{"name":"Conference on Electronic Information Engineering and Data Processing","volume":"39 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-05-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"128267909","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"RGB-D SLAM in dynamic environments with deep learning","authors":"W. Chen, Deji Li","doi":"10.1117/12.2682598","DOIUrl":"https://doi.org/10.1117/12.2682598","url":null,"abstract":"Traditional visual Simultaneous Localization and Mapping (SLAM) is mostly based on the assumption of static environment, which is susceptible to receive dynamic targets in dynamic environment, leading to the degradation of localization accuracy. In this paper, we introduce the instance segmentation network SOLOv2, which combined with motion consistency detection can effectively eliminate the dynamic feature points in the environment and improve the visual SLAM accuracy with the depth map hole repair algorithm. Tested on the TUM dataset, the positional estimation accuracy in dynamic environments is significantly improved compared to ORB-SLAM2.","PeriodicalId":177416,"journal":{"name":"Conference on Electronic Information Engineering and Data Processing","volume":"20 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-05-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"126665011","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Reinforced model-agnostic counterfactual explanations for recommender systems","authors":"Ao Chang, Qingxian Wang","doi":"10.1117/12.2682249","DOIUrl":"https://doi.org/10.1117/12.2682249","url":null,"abstract":"Explanation is an important requirement for transparent and trustworthy recommender systems. When the recommendation model itself is not explainable, an explanation must be generated post-hoc. In contrast to traditional post-hoc explanation methods, counterfactual methods can provide scrutable and actionable explanations with high fidelity. Existing counterfactual explanation methods for recommender systems are either not generalizable or face a huge search space. In this work, we propose a reinforcement learning counterfactual explanation method MACER (Model-Agnostic Counterfactual Explanations for Recommender Systems) which generates item-based explanations for recommender systems. We embed the discrete action space into a continuous space, making it possible to use the process of finding counterfactual explanations as a task of reinforcement learning. This method treats the recommender system as a black box (model-agnostic) and has no requirement on the type of recommender system, and thus is applicable to all recommendation systems.","PeriodicalId":177416,"journal":{"name":"Conference on Electronic Information Engineering and Data Processing","volume":"14 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-05-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"124453103","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Analysis of seismic anomalies in northern state of New Zealand based on Schumann resonance time-frequency diagram","authors":"Mingyi Que, Yongming Huang, Yong Lu","doi":"10.1117/12.2682266","DOIUrl":"https://doi.org/10.1117/12.2682266","url":null,"abstract":"Schumann resonance is a phenomenon in the extremely low frequency electromagnetic field, which is generally presented in the form of spectrum diagram and time-frequency diagram in the form of data. In this paper, based on the analysis of the data of Schumann resonance 0~50Hz time-frequency diagram, the time difference matrix and frequency difference matrix of the first and second order of the Schumann resonance time-frequency diagram are extracted respectively, and the time-frequency diagram is encoded and decoded by convolution autoencoder. In the process of encoding and decoding, the mapping function is constructed by combining the constraints, and the time-frequency diagram is decomposed and fitted, so as to improve the ability of data to reflect changes. Then, according to the seven different index matrices obtained, the determinant, rank and trace of each index matrix are obtained, and the earthquakes with magnitude above six in the northern state of New Zealand from 2016 to 2019 are analyzed. The final experimental results show that there is a certain degree of feedback phenomenon in the selected analysis indicators in the research object of this paper for a period of time before and after the event, which shows that the method of time-frequency diagram analysis is feasible.","PeriodicalId":177416,"journal":{"name":"Conference on Electronic Information Engineering and Data Processing","volume":"27 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-05-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"132460406","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Multi-feature extraction network cross-resolution person re-identification based on SR technology","authors":"Run-Lan Tian, Zongzong Wu, Qingwei Pang, Jian Zheng","doi":"10.1117/12.2682284","DOIUrl":"https://doi.org/10.1117/12.2682284","url":null,"abstract":"In the real world, the resolution of the image that is collected can vary depending on the camera's quality or the change in the distance from the pedestrian. Important information is lost from the low-resolution image. It can be difficult to match Low Resolution (LR) input photographs with High Resolution (HR) gallery images. Thus, we suggest that the super-resolution module and the multi-feature extraction module be improved in order to address the aforementioned issues. To be more precise, the resolution of the low-resolution query image is restored in the first step using an upgraded Super Resolution (SR) model (VDSR-NAM). A two-stream feature extraction network extracts and fuses the features of the LR and SR images in the second stage. The potential of our model has been shown in numerous tests on cross-resolution person re-id datasets. The efficacy of the loss function on our model is concurrently confirmed by ablation experiments on the dataset MLR-VIPER.","PeriodicalId":177416,"journal":{"name":"Conference on Electronic Information Engineering and Data Processing","volume":"20 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-05-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"127579695","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Two-stage attentional temporal convolution and LSTM model for financial data forecasting","authors":"Lifang Chen, Xiaowan Li, Zhenping Xie","doi":"10.1117/12.2682556","DOIUrl":"https://doi.org/10.1117/12.2682556","url":null,"abstract":"Financial time series usually consist of multiple time series, and financial time series data forecasting models use the historical data plays of multiple driving series to predict the future values of the target series. In recent years, attention-based Long and Short-Term Memory (LSTM) neural networks and Temporal Convolutional Networks (TCN) have been widely used in time series forecasting. In this paper, we propose a two-stage attention-based TCN and LSTM hybrid forecasting model, in order to better obtain the spatial correlation of driving sequences, we used causal self-attention to obtain the spatial attention weights of driving sequences, then use TCN to extract the short-term features of the series in the first stage, in the second stage, adding the temporal attention module computes the sequence adaptively assigning weights to the input sequence for the current and historical moments, and finally use LSTM to capture the long-term dependence of the time-series data. We used the NASDAQ 100 stock dataset and the financial time series of CSI 300 companies to measure the performance of the proposed model in financial data forecasting.","PeriodicalId":177416,"journal":{"name":"Conference on Electronic Information Engineering and Data Processing","volume":"12700 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-05-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"131120594","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Haolin Sun, Anni Wang, Zijian Gao, Juan Liu, Jia Cui, Jiazheng Tian, C. Li
{"title":"Optimal scheduling of virtual power plant considering demand side response","authors":"Haolin Sun, Anni Wang, Zijian Gao, Juan Liu, Jia Cui, Jiazheng Tian, C. Li","doi":"10.1117/12.2682490","DOIUrl":"https://doi.org/10.1117/12.2682490","url":null,"abstract":"In this paper, a virtual power plant model considering demand side response is proposed. Considering the influence of price demand-side response on virtual power plant, an optimal scheduling control method was proposed for virtual power plant, which included demand-side response, wind power, photovoltaic, gas turbine and energy storage equipment. Firstly, a mathematical model was built with the maximum net income of virtual power plant as the objective function. Secondly, a reasonable operation strategy was designed. The demand-side response strategy was optimized based on wind-landscape output prediction. Thirdly, the gas turbine and energy storage equipment coordinated the output allocation to compensate the deviation between the actual output and the predicted output of the scenery. Finally, through the simulation analysis of the example, the results show that the consideration of the demand-side response based on price in the virtual power plant can standardize the user's electricity consumption. And the load peaking and valley filling effect is obvious. The designed operation strategy has coordinated the power distribution of various energy sources and maximized the income of the virtual power plant.","PeriodicalId":177416,"journal":{"name":"Conference on Electronic Information Engineering and Data Processing","volume":"20 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-05-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"133745802","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}