Computational Intelligence最新文献

筛选
英文 中文
A hybrid approach for portfolio construction: Combing two-stage ensemble forecasting model with portfolio optimization 构建投资组合的混合方法:将两阶段集合预测模型与投资组合优化相结合
IF 2.8 4区 计算机科学
Computational Intelligence Pub Date : 2023-12-15 DOI: 10.1111/coin.12617
Wei Chen, Zinuo Liu, Lifen Jia
{"title":"A hybrid approach for portfolio construction: Combing two-stage ensemble forecasting model with portfolio optimization","authors":"Wei Chen,&nbsp;Zinuo Liu,&nbsp;Lifen Jia","doi":"10.1111/coin.12617","DOIUrl":"10.1111/coin.12617","url":null,"abstract":"<p>Combining the stock prediction with portfolio optimization can improve the performance of the portfolio construction. In this article, we propose a novel portfolio construction approach by utilizing a two-stage ensemble model to forecast stock prices and combining the forecasting results with the portfolio optimization. To be specific, there are two phases in the approach: stock prediction and portfolio optimization. The stock prediction has two stages. In the first stage, three neural networks, that is, multilayer perceptron (MLP), gated recurrent unit (GRU), and long short-term memory (LSTM) are used to integrate the forecasting results of four individual models, that is, LSTM, GRU, deep multilayer perceptron (DMLP), and random forest (RF). In the second stage, the time-varying weight ordinary least square model (OLS) is utilized to combine the first-stage forecasting results to obtain the ultimate forecasting results, and then the stocks having a better potential return on investment are chosen. In the portfolio optimization, a diversified mean-variance with forecasting model named DMVF is proposed, in which an average predictive error term is considered to obtain excess returns, and a 2-norm cost function is introduced to diversify the portfolio. Using the historical data from the Shanghai stock exchange as the study sample, the results of the experiments indicate the DMVF model with two-stage ensemble prediction outperforms benchmarks in terms of return and return-risk characteristics.</p>","PeriodicalId":55228,"journal":{"name":"Computational Intelligence","volume":null,"pages":null},"PeriodicalIF":2.8,"publicationDate":"2023-12-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"138826112","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Cache-aided multiuser UAV-MEC networks for smart grid networks: A DDPG approach 用于智能电网网络的缓存辅助多用户 UAV-MEC 网络:DDPG 方法
IF 2.8 4区 计算机科学
Computational Intelligence Pub Date : 2023-12-12 DOI: 10.1111/coin.12616
Chun Yang, Zhe Wang, Binyu Xie
{"title":"Cache-aided multiuser UAV-MEC networks for smart grid networks: A DDPG approach","authors":"Chun Yang,&nbsp;Zhe Wang,&nbsp;Binyu Xie","doi":"10.1111/coin.12616","DOIUrl":"10.1111/coin.12616","url":null,"abstract":"<p>Mobile edge computing (MEC) is an important research topic in the field of wireless communication and mobile computing, as it can effectively decrease the latency and energy consumption due to the trade-off between the communication and computing, where some intensive computing tasks can be offloaded to computational access points (CAPs), especially when the wireless transmission channel is in good condition. This article studies how to intelligently allocate the computing capability and wireless bandwidth among users for a cache-aided multi-terminal multi-CAP MEC network with non-ideal channel estimation, where there are <math>\u0000 <semantics>\u0000 <mrow>\u0000 <mi>N</mi>\u0000 </mrow>\u0000 <annotation>$$ N $$</annotation>\u0000 </semantics></math> mobile terminals and <math>\u0000 <semantics>\u0000 <mrow>\u0000 <mi>M</mi>\u0000 </mrow>\u0000 <annotation>$$ M $$</annotation>\u0000 </semantics></math> CAPs in the network. Each terminal has some tasks that need to be computed in a fast and efficient way. For such a system, we first design the system by jointly considering the computing capability and wireless bandwidth allocation, where the computing and communication delay is used as the performance of metric. To optimize the system performance, we then employ deep deterministic policy gradient to learn an effective strategy on the allocation of computing capability and wireless bandwidth, in order to decrease the system delay as much as possible. Simulations are finally conducted to show the superiority of the proposed studies in this article, especially about the advantages from cache.</p>","PeriodicalId":55228,"journal":{"name":"Computational Intelligence","volume":null,"pages":null},"PeriodicalIF":2.8,"publicationDate":"2023-12-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"138681118","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Improved secure PCA and LDA algorithms for intelligent computing in IoT-to-cloud setting 改进安全PCA和LDA算法,用于物联网到云环境下的智能计算
IF 2.8 4区 计算机科学
Computational Intelligence Pub Date : 2023-12-04 DOI: 10.1111/coin.12613
Liu Jiasen, Wang Xu An, Li Guofeng, Yu Dan, Zhang Jindan
{"title":"Improved secure PCA and LDA algorithms for intelligent computing in IoT-to-cloud setting","authors":"Liu Jiasen,&nbsp;Wang Xu An,&nbsp;Li Guofeng,&nbsp;Yu Dan,&nbsp;Zhang Jindan","doi":"10.1111/coin.12613","DOIUrl":"10.1111/coin.12613","url":null,"abstract":"<p>The rapid development of new technologies such as artificial intelligence and big data analysis requires the simultaneous development of cloud computing technology. The application of IoT-to-cloud setting has been fully applied in various industry sectors, such as sensor-cloud system which is composed of wireless sensor network and cloud computing technology. With the increasing amount and types of collected data, companies need to reduce the dimension of massive data in cloud servers for obtaining data analysis reports rapidly. Due to frequent cloud server data leaks, companies must adequately protect the privacy of some confidential data. To this end, we designed a dimension reduction method for ciphertext data in the sensor-cloud system based on the CKKS encryption scheme, principal component analysis (PCA) and linear discriminant analysis (LDA) dimension reduction algorithm. As data cannot be directly calculated using traditional PCA and LDA algorithm after encryption, we add some interactive operations and iterative calculations to replace some steps in traditional algorithms. Finally, we select the classification dataset IRIS which is commonly used in machine learning, and screen out the best encryption and calculation parameters, and efficiently realize the dimension reduction method of ciphertext data through a large number of experiments.</p>","PeriodicalId":55228,"journal":{"name":"Computational Intelligence","volume":null,"pages":null},"PeriodicalIF":2.8,"publicationDate":"2023-12-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"138520442","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Guest Editorial on the Special Issue on the Role of Fuzzy Systems on Biomedical Science in Healthcare 模糊系统在医疗保健领域生物医学科学中的作用特刊客座编辑
IF 2.8 4区 计算机科学
Computational Intelligence Pub Date : 2023-12-01 DOI: 10.1111/coin.12623
Davide Moroni, M. Trocan, B. U. Töreyin
{"title":"Guest Editorial on the Special Issue on the Role of Fuzzy Systems on Biomedical Science in Healthcare","authors":"Davide Moroni, M. Trocan, B. U. Töreyin","doi":"10.1111/coin.12623","DOIUrl":"https://doi.org/10.1111/coin.12623","url":null,"abstract":"Artificial neural networks (ANN) face challenges in the biomedical and health care sectors due to the elastic nature of biomedical data. This data requires a knowledge-centric approach rather than a purely data-centric one. Fuzzy systems efficiently handle the vagueness in medical big data, emulating human perception. These systems provide precise analysis for various medical situations, neutralizing uncertainties like varying disease patterns. They also support ranking populations based on health attributes, aiding in early prognosis and preventive medicine. This special issue is dedicated to focus on the recent advancements and applications of fuzzy systems within the area of healthcare data analysis. It has provided a platform for researchers to share innovative techniques and methodologies more effectively. Through this issue, we aspire to stimulate discussions, foster collaborations and inspire further innovations in leveraging fuzzy systems for more nuanced, human-like interpretations of complex biomedical datasets. As technology evolves, healthcare and diagnostics keeps changing continously. Taking a look at the array of innovative methods, we observe a clear inclination towards deep learning and computational intelligence in diagnostics. For instance, the application of Computational intelligence for analysing CT images for lung cancer detection and the XlmNet, which uses an Extreme Learning Machine Algorithm for classifying lung cancer from histopathological images, both focus on early-stage detection of lung diseases. Their reliance on intricate computational techniques demonstrates a move towards more precise and early diagnostic procedures. On the other hand, we have algorithms like the Residual neural network-assisted one-class classification, specifically tailored for melanoma recognition in imbalanced datasets. It’s evident that there’s a conscious effort to tackle class imbalance issues, which have long been a hurdle in medical image analysis. Mental health and wellbeing are not left behind either. The “Smart Analysis of Anxiety People and Their Activities” and the “Classification Analysis of Burnout People’s Brain Images” both emphasize the growing role of technology in understanding and diagnosing psychological health issues. Similarly, kidney diseases, retinal issues, skin lesions","PeriodicalId":55228,"journal":{"name":"Computational Intelligence","volume":null,"pages":null},"PeriodicalIF":2.8,"publicationDate":"2023-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"139189077","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A multi-modal fusion YoLo network for traffic detection 用于流量检测的多模态融合YoLo网络
IF 2.8 4区 计算机科学
Computational Intelligence Pub Date : 2023-11-29 DOI: 10.1111/coin.12615
Xinwang Zheng, Wenjie Zheng, Chujie Xu
{"title":"A multi-modal fusion YoLo network for traffic detection","authors":"Xinwang Zheng,&nbsp;Wenjie Zheng,&nbsp;Chujie Xu","doi":"10.1111/coin.12615","DOIUrl":"10.1111/coin.12615","url":null,"abstract":"<p>Traffic detection (including lane detection and traffic sign detection) is one of the key technologies to realize driving assistance system and auto drive system. However, most of the existing detection methods are designed based on single-modal visible light data, when there are dramatic changes in lighting in the scene (such as insufficient lighting in night), it is difficult for these methods to obtain good detection results. In view of multi-modal data can provide complementary discriminative information, based on the YoLoV5 model, this paper proposes a multi-modal fusion YoLoV5 network, which consists of three key components: the dual stream feature extraction module, the correlation feature extraction module, and the self-attention fusion module. Specifically, the dual stream feature extraction module is used to extract the features of each of the two modalities. Secondly, input the features learned from the dual stream feature extraction module into the correlation feature extraction module to learn the features with maximum correlation. Then, the extracted maximum correlation features are used to achieve information exchange between modalities through a self-attention mechanism, and thus obtain fused features. Finally, the fused features are inputted into the detection layer to obtain the final detection result. Experimental results on different traffic detection tasks can demonstrate the superiority of the proposed method.</p>","PeriodicalId":55228,"journal":{"name":"Computational Intelligence","volume":null,"pages":null},"PeriodicalIF":2.8,"publicationDate":"2023-11-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"138520445","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Few-shot learning for word-level scene text script identification 单词级场景文本脚本识别的少镜头学习
IF 2.8 4区 计算机科学
Computational Intelligence Pub Date : 2023-11-21 DOI: 10.1111/coin.12612
Veronica Naosekpam, Nilkanta Sahu
{"title":"Few-shot learning for word-level scene text script identification","authors":"Veronica Naosekpam,&nbsp;Nilkanta Sahu","doi":"10.1111/coin.12612","DOIUrl":"10.1111/coin.12612","url":null,"abstract":"<p>Script identification of text in scene images has attracted massive attention recently. However, the existing techniques primarily emphasize on scripts where data are available abundantly, such as English, European, or East Asian. Although these methods are robust in dealing with high-resource data, how these techniques will work on low-resource scripts has yet to be discovered. For example, in India, there is a disparity among the text scripts across the country's demographic. To bridge the research gap for resource-constraint script identification, we present a few-shot learning network called the TextScriptFSLNet. This network does not require huge training data while achieving state-of-the-art performance on benchmark datasets. Our proposed method acts in accordance with a <math>\u0000 <semantics>\u0000 <mrow>\u0000 <mi>C</mi>\u0000 </mrow>\u0000 <annotation>$$ C $$</annotation>\u0000 </semantics></math>-way <math>\u0000 <semantics>\u0000 <mrow>\u0000 <mi>K</mi>\u0000 </mrow>\u0000 <annotation>$$ K $$</annotation>\u0000 </semantics></math>-shot paradigm by splitting the train set as support and query set, respectively. The support set learns representative knowledge of each class and creates its prototypes. We use multi-kernel spatial attention fused 2-layer convolutional neural network and averaging technique to generate the prototype of each class. This spatial attention aids in grasping important information in an image and enriches the feature representation. To the best of our knowledge, the proposed work is the first of its kind in the scene text understanding domain. Additionally, we created a dataset called Indic-FSL2023 comprising 10 of the 22 officially recognized Indian scripts. The proposed method achieves the highest accuracy among the tested methods on the newly created Indic-FSL2023. Experiments are also conducted on MLe2e to demonstrate its versatility. Furthermore, we also showed how our proposed model performed concerning illumination changes and blur on scene text script images.</p>","PeriodicalId":55228,"journal":{"name":"Computational Intelligence","volume":null,"pages":null},"PeriodicalIF":2.8,"publicationDate":"2023-11-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"138520454","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Enhanced approach of multilabel learning for the Arabic aspect category detection of the hotel reviews 用于酒店评论阿拉伯语方面类别检测的多标签学习增强方法
IF 2.8 4区 计算机科学
Computational Intelligence Pub Date : 2023-11-14 DOI: 10.1111/coin.12609
Asma Ameur, Sana Hamdi, Sadok Ben Yahia
{"title":"Enhanced approach of multilabel learning for the Arabic aspect category detection of the hotel reviews","authors":"Asma Ameur,&nbsp;Sana Hamdi,&nbsp;Sadok Ben Yahia","doi":"10.1111/coin.12609","DOIUrl":"10.1111/coin.12609","url":null,"abstract":"<p>In many fields, like aspect category detection (ACD) in aspect-based sentiment analysis, it is necessary to label each instance with more than one label at the same time. This study tackles the multilabel classification problem in the ACD task for the Arabic language. For this purpose, we used Arabic hotel reviews from the SemEval-2016 dataset, comprising 13,113 annotated tuples provided for training (10,509) and testing (2,604). To extract valuable information, we first propose specific data preprocessing. Then, we suggest using the dynamic weighted loss function and a data augmentation method to fix the problem with this dataset's imbalance. Using two possible approaches, we develop new ways to find different categories of things in a review sentence. The first is based on classifier chains using machine learning models. The second is based on transfer learning using pretrained AraBERT fine-tuning for contextual representation. Our findings show that both approaches outperformed the related works for ACD on the Arabic SemEval-2016. Moreover, we observed that AraBERT fine-tuning performed much better and achieved a promising <math>\u0000 <semantics>\u0000 <mrow>\u0000 <msub>\u0000 <mrow>\u0000 <mi>F</mi>\u0000 </mrow>\u0000 <mrow>\u0000 <mn>1</mn>\u0000 </mrow>\u0000 </msub>\u0000 </mrow>\u0000 <annotation>$$ {F}_1 $$</annotation>\u0000 </semantics></math>-score of <math>\u0000 <semantics>\u0000 <mrow>\u0000 <mn>68</mn>\u0000 <mo>.</mo>\u0000 <mn>02</mn>\u0000 <mo>%</mo>\u0000 </mrow>\u0000 <annotation>$$ 68.02% $$</annotation>\u0000 </semantics></math>.</p>","PeriodicalId":55228,"journal":{"name":"Computational Intelligence","volume":null,"pages":null},"PeriodicalIF":2.8,"publicationDate":"2023-11-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"134954000","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
ResNLS: An improved model for stock price forecasting ResNLS:改进的股票价格预测模型
IF 2.8 4区 计算机科学
Computational Intelligence Pub Date : 2023-11-12 DOI: 10.1111/coin.12608
Yuanzhe Jia, Ali Anaissi, Basem Suleiman
{"title":"ResNLS: An improved model for stock price forecasting","authors":"Yuanzhe Jia,&nbsp;Ali Anaissi,&nbsp;Basem Suleiman","doi":"10.1111/coin.12608","DOIUrl":"10.1111/coin.12608","url":null,"abstract":"<p>Stock prices forecasting has always been a challenging task. Although many research projects adopt machine learning and deep learning algorithms to address the problem, few of them pay attention to the varying degrees of dependencies between stock prices. In this paper we introduce a hybrid model that improves stock price prediction by emphasizing the dependencies between adjacent stock prices. The proposed model, ResNLS, is mainly composed of two neural architectures, ResNet and LSTM. ResNet serves as a feature extractor to identify dependencies between stock prices across time windows, while LSTM analyses the initial time-series data with the combination of dependencies which considered as residuals. In predicting the SSE Composite Index, our experiment reveals that when the closing price data for the previous five consecutive trading days is used as the input, the performance of the model (ResNLS-5) is optimal compared to those with other inputs. Furthermore, ResNLS-5 outperforms vanilla CNN, RNN, LSTM, and BiLSTM models in terms of prediction accuracy. It also demonstrates at least a 20% improvement over the current state-of-the-art baselines. To verify whether ResNLS-5 can help clients effectively avoid risks and earn profits in the stock market, we construct a quantitative trading framework for back testing. The experimental results show that the trading strategy based on predictions from ResNLS-5 can successfully mitigate losses during declining stock prices and generate profits in the periods of rising stock prices.</p>","PeriodicalId":55228,"journal":{"name":"Computational Intelligence","volume":null,"pages":null},"PeriodicalIF":2.8,"publicationDate":"2023-11-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1111/coin.12608","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"135037642","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Cache-aided UAV-assisted relaying networks: Performance analysis and system optimization 缓存辅助无人机中继网络:性能分析与系统优化
IF 2.8 4区 计算机科学
Computational Intelligence Pub Date : 2023-11-06 DOI: 10.1111/coin.12610
Zhe Wang, Chun Yang, Binyu Xie
{"title":"Cache-aided UAV-assisted relaying networks: Performance analysis and system optimization","authors":"Zhe Wang,&nbsp;Chun Yang,&nbsp;Binyu Xie","doi":"10.1111/coin.12610","DOIUrl":"10.1111/coin.12610","url":null,"abstract":"<p>The utilization of distributed multi-agent unmanned aerial vehicles (UAVs) for computing tasks in remote areas has gained significant traction in recent years due to their adaptability and capability to access hard-to-reach regions that are inaccessible to ground-based methods. However, establishing wireless communication between UAVs and ground-based data sources in remote areas presents considerable challenges, particularly when UAVs are in motion. To tackle this challenge, this article investigates a cache-aided relaying system in the presence of UAVs, wherein a ground-based decode-and-forward relay equipped with cache space is deployed to facilitate wireless communication between UAVs and a central data source. Within the scope of this system, we first analyze the probability of transmission outage, providing an analytical expression for performance evaluation. We commence with the case of a single stationary UAV, subsequently expanding to multiple stationary UAVs, and ultimately incorporating multiple dynamic UAVs. Subsequently, we enhance the system performance by minimizing the outage probability through efficient power resource allocation among users. By means of mathematical modeling and simulations, this research examines the influence of various factors, including the cache size at the relay and the working mode of the UAV, on the system performance. Finally, simulations are conducted to validate the proposed analysis.</p>","PeriodicalId":55228,"journal":{"name":"Computational Intelligence","volume":null,"pages":null},"PeriodicalIF":2.8,"publicationDate":"2023-11-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"135683867","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A joint hierarchical cross-attention graph convolutional network for multi-modal facial expression recognition 用于多模态面部表情识别的联合分层交叉注意力图卷积网络
IF 2.8 4区 计算机科学
Computational Intelligence Pub Date : 2023-10-25 DOI: 10.1111/coin.12607
Chujie Xu, Yong Du, Jingzi Wang, Wenjie Zheng, Tiejun Li, Zhansheng Yuan
{"title":"A joint hierarchical cross-attention graph convolutional network for multi-modal facial expression recognition","authors":"Chujie Xu,&nbsp;Yong Du,&nbsp;Jingzi Wang,&nbsp;Wenjie Zheng,&nbsp;Tiejun Li,&nbsp;Zhansheng Yuan","doi":"10.1111/coin.12607","DOIUrl":"10.1111/coin.12607","url":null,"abstract":"<p>Emotional recognition in conversations (ERC) is increasingly being applied in various IoT devices. Deep learning-based multimodal ERC has achieved great success by leveraging diverse and complementary modalities. Although most existing methods try to adopt attention mechanisms to fuse different information, these methods ignore the complementarity between modalities. To this end, the joint cross-attention model is introduced to alleviate this issue. However, multi-scale feature information on different modalities is not utilized. Moreover, the context relationship plays an important role in feature extraction in the expression recognition task. In this paper, we propose a novel joint hierarchical graph convolution network (JHGCN) which exploits different layer features and context relationships for facial expression recognition based on audio-visual (A-V) information. Specifically, we adopt different deep networks to extract features from different modalities individually. For V modality, we construct V graph data based on patch embeddings which are extracted from the transformer encoder. Moreover, we embed the graph convolution which can leverage the intra-modality relationships with the transformer encoder. Then, the deep feature from different layers is fed to the hierarchical fusion module to enhance feature representation. At last, we use the joint cross-attention mechanism to exploit the complementary inter-modality relationships. To validate the proposed model, we have conducted various experiments on the AffWild2 and CMU-MOSI datasets. All results confirm that our proposed model achieves highly promising performance compared to the joint cross-attention model and other methods.</p>","PeriodicalId":55228,"journal":{"name":"Computational Intelligence","volume":null,"pages":null},"PeriodicalIF":2.8,"publicationDate":"2023-10-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"135112687","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
相关产品
×
本文献相关产品
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信