{"title":"Pedestrian Attribute Recognition Based on Association Rules","authors":"Diwei Xie, Heqian Qiu, Linfeng Xu","doi":"10.1109/ccis57298.2022.10016404","DOIUrl":"https://doi.org/10.1109/ccis57298.2022.10016404","url":null,"abstract":"Over the past few years, deep learning has achieved impressive performance, and pedestrian attribute recognition has also been extensively widely studied. Pedestrian attribute recognition aims to predict a set of attributes from a predefined attributes list to describe the characteristics of the person. However, there are many different levels of attributes in the predefined attribute list, especially some high-level semantic information, so how to exploit the relationship between these attributes is an important challenge. We propose a flexible Association Rules Module(ARM), which can use association rules to express the relationship between attributes. Moreover, this module can work on different baselines. Extensive experiments show that the proposed method achieves excellent performance on two datasets and four baselines.","PeriodicalId":374660,"journal":{"name":"2022 IEEE 8th International Conference on Cloud Computing and Intelligent Systems (CCIS)","volume":"12 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-11-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"114603955","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"A Multimodal Fusion Scene Graph Generation Method Based on Semantic Description","authors":"Liwen Ma, Weifeng Liu, Yaning Wang","doi":"10.1109/CCIS57298.2022.10016416","DOIUrl":"https://doi.org/10.1109/CCIS57298.2022.10016416","url":null,"abstract":"For the scene graph generation task, a multimodal fusion scene graph generation method based on semantic description is proposed considering the problems of long-tail distribution and low frequency of high-level semantic interactions in the dataset. Firstly, target detection and relationship inference are performed on the image to construct an image scene graph. Second, the semantic descriptions are transformed into semantic graphs, which are fed into a pre-trained scene graph parser to construct semantic scene graphs. Finally, the two scene graphs are aligned for display and the information of nodes and edges are updated to obtain a fused scene graph with more comprehensive coverage and more accurate semantic interaction information.","PeriodicalId":374660,"journal":{"name":"2022 IEEE 8th International Conference on Cloud Computing and Intelligent Systems (CCIS)","volume":"191 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-11-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"124247550","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Deep Text Retrieval Models based on DNN, CNN, RNN and Transformer: A review","authors":"Jianping Liu, Xintao Chu, Yingfei Wang, Meng Wang","doi":"10.1109/ccis57298.2022.10016379","DOIUrl":"https://doi.org/10.1109/ccis57298.2022.10016379","url":null,"abstract":"The development of deep learning technology provides a new development direction for text retrieval. Researchers have applied deep learning techniques to different information retrieval objects and carried out rich studies on them, such as web pages, scientific literature, and scientific data. This paper selects 40 research papers on related topics in the past 10 years through a step-by-step selection and conducts a review on the dimensions of model input, model structure, and its performance. Firstly, according to the differences in methods, we divided the deep learning text retrieval model into four categories: DNN-based, CNN-based, RNN-based, and Transformer-based, and analyzed the classical model structure and retrieval effect of each category. Secondly, we analyzed and compared the application scenarios of different types of models, and summarized some classic retrieval datasets. Finally, we discussed the main challenges and future research trends of deep text retrieval. This review is expected to provide basic knowledge and effective research entry points for scholars engaged in deep learning text retrieval.","PeriodicalId":374660,"journal":{"name":"2022 IEEE 8th International Conference on Cloud Computing and Intelligent Systems (CCIS)","volume":"3 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-11-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"125500701","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"SPNet: Utilizing Subspace Projection to Achieve Feature Interaction for Click-Through Rate","authors":"Xu Zhang, Z. Ou, Meina Song","doi":"10.1109/ccis57298.2022.10016313","DOIUrl":"https://doi.org/10.1109/ccis57298.2022.10016313","url":null,"abstract":"Click-through rate prediction is critical to online advertising services, wherein feature interaction is fundamental. There exist a number of schemes studying feature interaction, including factorization machine, xDeepFM, and other deep learning based models. Nevertheless, multiple features in one layer are not able to interact with each other or can not interact effectively. To resolve this problem, we propose a novel Subspace Projection Network (SPNet) in this paper. SPNet leverages subspace projection to make all features interact with each other in one subspace. Different subspaces employ different approaches to interact features in one layer. By stacking multiple layers, complex feature interactions can be implemented. To verify effectiveness of SPNet, we conduct experiments on two large-scale datasets. Experimental results demonstrate that SPNet not only outperforms the state-of-the-art shallow models, but also surpasses most deep learning based schemes.","PeriodicalId":374660,"journal":{"name":"2022 IEEE 8th International Conference on Cloud Computing and Intelligent Systems (CCIS)","volume":"51 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-11-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"129414652","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"An optimization r-cnn method for Ovarian cyst detection","authors":"Jiade Li, Qili Chen","doi":"10.1109/CCIS57298.2022.10016426","DOIUrl":"https://doi.org/10.1109/CCIS57298.2022.10016426","url":null,"abstract":"With the continuous development and progress of medical imaging technology and computer technology, medical image analysis has become an indispensable tool and technical means in medical research, clinical disease diagnosis and treatment. This paper will introduce the ovarian cyst detector based on r-cnn and its improved optimization method, and make a comparative analysis..","PeriodicalId":374660,"journal":{"name":"2022 IEEE 8th International Conference on Cloud Computing and Intelligent Systems (CCIS)","volume":"232 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-11-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"122028891","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Magnetic target localization method based on mobile platform","authors":"Pengfei Jiang, X. Pang, Hui Yang","doi":"10.1109/ccis57298.2022.10016375","DOIUrl":"https://doi.org/10.1109/ccis57298.2022.10016375","url":null,"abstract":"Aiming at the problem that the calculation process of magnetic target localization is complex and greatly affected by the geomagnetic field, a single-point magnetic gradient tensor localization method is derived differentially, and a target localization method based on a third-order tensor of plane cross structure is proposed. The positioning method can locate the magnetic target in real time based on the mobile platform, theoretically can reduce the influence of the geomagnetic field on the target positioning, and the calculation process is simple, and there is no need to solve the nonlinear equation system. The effectiveness of the localization method is verified by simulation, and the influence of the baseline of the tensor measurement system on the localization results is verified.","PeriodicalId":374660,"journal":{"name":"2022 IEEE 8th International Conference on Cloud Computing and Intelligent Systems (CCIS)","volume":"13 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-11-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"123062814","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Jie Chen, Luping Luo, Bojing Ji, Shu Zhao, Yanping Zhang
{"title":"A Joint Learning Sentiment Analysis Method Incorporating Emoji-Augmentation","authors":"Jie Chen, Luping Luo, Bojing Ji, Shu Zhao, Yanping Zhang","doi":"10.1109/CCIS57298.2022.10016405","DOIUrl":"https://doi.org/10.1109/CCIS57298.2022.10016405","url":null,"abstract":"Social media is the platform for most people to share their opinions, emojis are also widely used to express moods, emotions, and feelings on social media. There have been many researched on emojis and sentiment analysis. However, existing methods mainly face two limitations. First, since deep learning relies on large amounts of labeled data, the training samples of emoji are not enough to achieve the training effect. Second, they consider the sentiment of emojis and texts separately, not fully exploring the impact of emojis on the sentiment polarity of texts. In this paper, we propose a joint learning sentiment analysis method incorporating emoji-augmentation, and the method has two advantages compared with the existing work. First, We optimize the easy data augmentation method so that the newly generated sentences can also preserve the semantic information of emojis, which relieves the problem of insufficient training data with emojis. Second, it fuses emojis and text features to allow the model to better learn the mutual emotional semantics between text and emojis, jointly training emojis and words to obtain the sentence representations containing more semantic information of both emojis and text. Our experimental results show that the proposed method can significantly improve the performance compared with several baselines on two datasets.","PeriodicalId":374660,"journal":{"name":"2022 IEEE 8th International Conference on Cloud Computing and Intelligent Systems (CCIS)","volume":"47 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-11-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"128286191","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Attention based Long Short-Term Memory Network for Coastal Visibility Forecast","authors":"Rui Min, Ming Wu, Mengqiu Xu, Xun Zu, Xun Zhu","doi":"10.1109/CCIS57298.2022.10016374","DOIUrl":"https://doi.org/10.1109/CCIS57298.2022.10016374","url":null,"abstract":"Visibility prediction in coastal areas has always been an important issue affecting the safety of residents and the efficiency of urban transportation. The visibility prediction methods currently used by meteorological centers are mainly based on the statistical forecast with relatively low prediction accuracy and high computational complexity. These methods cannot work well with large amounts of data. However, with the rapid development of deep learning technology, the use of deep learning has become a primary trend. In this paper, we propose our visibility prediction model based on (Long Short-Term Memory) LSTM network and self-attention mechanism. The model takes Medium-range Forecasts Data from European Centre for Mediumrange Weather Forecasting (ECMWF) which we use EC data to refer it for simplicity and observatory visibility data as input to predict and uses the LSTM network as the backbone to extract time series information. We also use self-attention mechanism to process the input data before the data is input to the model to let the model better focus on the valuable information for prediction. Compared with the predicted visibility in EC data, our proposed method improved the 3-hour prediction accuracy by 20%, 1.5 times, and 8 times for high-range, medium-range, and low-range visibility, respectively. We also find the data imbalance will greatly affect the prediction accuracy for low-visibility data and use the weighted-loss and mix-up data augmentation strategy model in our model training. We improved the accuracy of low-visibility data by 1.2 times while the prediction results of high-visibility and medium-visibility data remained almost the same. In addition, we conduct several experiments to verify the effectiveness of our model design and the rationality of data augmentation.","PeriodicalId":374660,"journal":{"name":"2022 IEEE 8th International Conference on Cloud Computing and Intelligent Systems (CCIS)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-11-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"128484444","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"High-precision visual navigation and localization method","authors":"Z. Peng, Dongsheng Li, Longtao Cai","doi":"10.1109/ccis57298.2022.10016389","DOIUrl":"https://doi.org/10.1109/ccis57298.2022.10016389","url":null,"abstract":"The paper studies high-precision visual navigation and localization technology which has avoided the impact of dynamic objects. The dynamic feature points in the scene are eliminated by the dynamic object detection algorithm based on the YOLACT instance segmentation and geometric epipolar constraint, and the algorithm is integrated into SLAM to achieve the improved three-dimensional location of the dynamic scene, based on the TUM RGB-D data set, the localization accuracy and the running speed of algorithm are tested respectively. The new algorithm is compared with traditional PL-SLAM and ORB-SLAM algorithms. The test results show that the new algorithm average running time on the three data set is less 30.229s than the traditional PL-SLAM algorithm. Compared with the traditional PL-SLAM algorithm and ORB-SLAM algorithm, the root mean square error of absolute trajectory is reduced by 0.084m and 0.130m respectively.","PeriodicalId":374660,"journal":{"name":"2022 IEEE 8th International Conference on Cloud Computing and Intelligent Systems (CCIS)","volume":"188 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-11-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"121466193","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Contrastive Semantic Similarity Learning for Multi-Hop Question Answering over Event-Centric Knowledge Graphs","authors":"Wei Tang, Qingchao Kong, W. Mao, Xiaofei Wu","doi":"10.1109/ccis57298.2022.10016335","DOIUrl":"https://doi.org/10.1109/ccis57298.2022.10016335","url":null,"abstract":"Question answering in natural languages provides an intuitive and efficient way to help people access the rich information stored in various kinds of knowledge graphs (KGs). One of the key challenges for question answering over knowledge graphs (KGQA) is to learn a semantic representation of the input question and candidate relation chains over KGs and accurately measure the similarity between them. However, existing methods often failed to capture the semantic similarity for complex question answering, e.g., multi-hop and temporal constrained situations. In addition, existing KGQA related research mostly concentrates on entities while often ignores the events which contain a large portion of the world knowledge. To solve this issue, we propose a Contrastive Semantic Similarity Learning (CSSL) method for multi-hop question answering over event-centric KGs. In this method, for candidate relation chains generation, the retrieval subgraph is first constructed by identifying the topic event or entity in the question. To better accommodate complex questions, we introduce the contrastive learning framework to learn a common semantic space, where the similarity score is finally calculated to select the final answer. The experimental results on the EventQA dataset show that the proposed method achieves superior performances compared to the state-of-the-art baselines.","PeriodicalId":374660,"journal":{"name":"2022 IEEE 8th International Conference on Cloud Computing and Intelligent Systems (CCIS)","volume":"167 3 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-11-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"131513754","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}