PeerJ Computer Science最新文献

筛选
英文 中文
Comprehensive empirical evaluation of feature extractors in computer vision.
IF 3.5 4区 计算机科学
PeerJ Computer Science Pub Date : 2024-11-04 eCollection Date: 2024-01-01 DOI: 10.7717/peerj-cs.2415
Murat Isik
{"title":"Comprehensive empirical evaluation of feature extractors in computer vision.","authors":"Murat Isik","doi":"10.7717/peerj-cs.2415","DOIUrl":"10.7717/peerj-cs.2415","url":null,"abstract":"<p><p>Feature detection and matching are fundamental components in computer vision, underpinning a broad spectrum of applications. This study offers a comprehensive evaluation of traditional feature detections and descriptors, analyzing methods such as Scale Invariant Feature Transform (SIFT), Speeded-Up Robust Features (SURF), Binary Robust Independent Elementary Features (BRIEF), Oriented FAST and Rotated BRIEF (ORB), Binary Robust Invariant Scalable Keypoints (BRISK), KAZE, Accelerated KAZE (AKAZE), Fast Retina Keypoint (FREAK), Dense and Accurate Invariant Scalable descriptor for Yale (DAISY), Features from Accelerated Segment Test (FAST), and STAR. Each feature extractor was assessed based on its architectural design and complexity, focusing on how these factors influence computational efficiency and robustness under various transformations. Utilizing the Image Matching Challenge Photo Tourism 2020 dataset, which includes over 1.5 million images, the study identifies the FAST algorithm as the most efficient detector when paired with the ORB descriptor and Brute-Force (BF) matcher, offering the fastest feature extraction and matching process. ORB is notably effective on affine-transformed and brightened images, while AKAZE excels in conditions involving blurring, fisheye distortion, image rotation, and perspective distortions. Through more than 2 million comparisons, the study highlights the feature extractors that demonstrate superior resilience across various conditions, including rotation, scaling, blurring, brightening, affine transformations, perspective distortions, fisheye distortion, and salt-and-pepper noise.</p>","PeriodicalId":54224,"journal":{"name":"PeerJ Computer Science","volume":"10 ","pages":"e2415"},"PeriodicalIF":3.5,"publicationDate":"2024-11-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11623105/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142803080","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Generative named entity recognition framework for Chinese legal domain.
IF 3.5 4区 计算机科学
PeerJ Computer Science Pub Date : 2024-11-04 eCollection Date: 2024-01-01 DOI: 10.7717/peerj-cs.2428
Xingliang Mao, Jie Jiang, Yongzhe Zeng, Yinan Peng, Shichao Zhang, Fangfang Li
{"title":"Generative named entity recognition framework for Chinese legal domain.","authors":"Xingliang Mao, Jie Jiang, Yongzhe Zeng, Yinan Peng, Shichao Zhang, Fangfang Li","doi":"10.7717/peerj-cs.2428","DOIUrl":"10.7717/peerj-cs.2428","url":null,"abstract":"<p><p>Named entity recognition (NER) is a crucial task in natural language processing, particularly challenging in the legal domain due to the intricate and lengthy nature of legal entities. Existing methods often struggle with accurately identifying entity boundaries and types in legal texts. To address these challenges, we propose a novel sequence-to-sequence framework designed specifically for the legal domain. This framework features an entity-type-aware module that leverages contrastive learning to enhance the prediction of entity types. Additionally, we incorporate a decoder with a copy mechanism that accurately identifies complex legal entities without the need for explicit tagging schemas. Our extensive experiments on two legal datasets show that our framework significantly outperforms state-of-the-art methods, achieving notable improvements in precision, recall, and F1 score. This demonstrates the effectiveness of our approach in improving entity recognition in legal texts, offering a promising direction for future research in legal NER.</p>","PeriodicalId":54224,"journal":{"name":"PeerJ Computer Science","volume":"10 ","pages":"e2428"},"PeriodicalIF":3.5,"publicationDate":"2024-11-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11622873/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142803349","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
TCellR2Vec: efficient feature selection for TCR sequences for cancer classification.
IF 3.5 4区 计算机科学
PeerJ Computer Science Pub Date : 2024-11-04 eCollection Date: 2024-01-01 DOI: 10.7717/peerj-cs.2239
Zahra Tayebi, Sarwan Ali, Murray Patterson
{"title":"TCellR2Vec: efficient feature selection for TCR sequences for cancer classification.","authors":"Zahra Tayebi, Sarwan Ali, Murray Patterson","doi":"10.7717/peerj-cs.2239","DOIUrl":"10.7717/peerj-cs.2239","url":null,"abstract":"<p><p>Cancer remains one of the leading causes of death globally. New immunotherapies that harness the patient's immune system to fight cancer show promise, but their development requires analyzing the diversity of immune cells called T-cells. T-cells have receptors that recognize and bind to cancer cells. Sequencing these T-cell receptors allows to provide insights into their immune response, but extracting useful information is challenging. In this study, we propose a new computational method, TCellR2Vec, to select key features from T-cell receptor sequences for classifying different cancer types. We extracted features like amino acid composition, charge, and diversity measures and combined them with other sequence embedding techniques. For our experiments, we used a dataset of over 50,000 T-cell receptor sequences from five cancer types, which showed that TCellR2Vec improved classification accuracy and efficiency over baseline methods. These results demonstrate TCellR2Vec's ability to capture informative aspects of complex T-cell receptor sequences. By improving computational analysis of the immune response, TCellR2Vec could aid the development of personalized immunotherapies tailored to each patient's T-cells. This has important implications for creating more effective cancer treatments based on the individual's immune system.</p>","PeriodicalId":54224,"journal":{"name":"PeerJ Computer Science","volume":"10 ","pages":"e2239"},"PeriodicalIF":3.5,"publicationDate":"2024-11-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11622898/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142803390","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Automatic visual recognition for leaf disease based on enhanced attention mechanism.
IF 3.5 4区 计算机科学
PeerJ Computer Science Pub Date : 2024-11-04 eCollection Date: 2024-01-01 DOI: 10.7717/peerj-cs.2365
Yumeng Yao, Xiaodun Deng, Xu Zhang, Junming Li, Wenxuan Sun, Gechao Zhang
{"title":"Automatic visual recognition for leaf disease based on enhanced attention mechanism.","authors":"Yumeng Yao, Xiaodun Deng, Xu Zhang, Junming Li, Wenxuan Sun, Gechao Zhang","doi":"10.7717/peerj-cs.2365","DOIUrl":"10.7717/peerj-cs.2365","url":null,"abstract":"<p><p>Recognition methods have made significant strides across various domains, such as image classification, automatic segmentation, and autonomous driving. Efficient identification of leaf diseases through visual recognition is critical for mitigating economic losses. However, recognizing leaf diseases is challenging due to complex backgrounds and environmental factors. These challenges often result in confusion between lesions and backgrounds, limiting information extraction from small lesion targets. To tackle these challenges, this article proposes a visual leaf disease identification method based on an enhanced attention mechanism. By integrating multi-head attention mechanisms, this method accurately identifies small targets of tomato lesions and demonstrates robustness in complex conditions, such as varying illumination. Additionally, the method incorporates Focaler-SIoU to enhance learning capabilities for challenging classification samples. Experimental results showcase that the proposed algorithm enhances average detection accuracy by 10.3% compared to the baseline model, while maintaining a balanced identification speed. This method facilitates rapid and precise identification of tomato diseases, offering a valuable tool for disease prevention and economic loss reduction.</p>","PeriodicalId":54224,"journal":{"name":"PeerJ Computer Science","volume":"10 ","pages":"e2365"},"PeriodicalIF":3.5,"publicationDate":"2024-11-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11623051/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142803365","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Measurement of sulfur content in coal mining areas by using field-remote sensing data and an integrated deep learning model.
IF 3.5 4区 计算机科学
PeerJ Computer Science Pub Date : 2024-11-04 eCollection Date: 2024-01-01 DOI: 10.7717/peerj-cs.2458
Jingyi Liu, Ba Tuan Le
{"title":"Measurement of sulfur content in coal mining areas by using field-remote sensing data and an integrated deep learning model.","authors":"Jingyi Liu, Ba Tuan Le","doi":"10.7717/peerj-cs.2458","DOIUrl":"10.7717/peerj-cs.2458","url":null,"abstract":"<p><p>High-quality coal emits a smaller amount of harmful substances during the combustion process, which greatly reduces the environmental hazard. The sulfur content of coal is one of the important indicators that determine coal quality. The world's demand for high-quality coal is increasing. This is challenging for the coal mining industry. Therefore, how to quickly determine the sulfur content of coal in coal mining areas has always been a research difficulty. This study is the first to map the distribution of sulfur content in opencast coal mines using field-remote sensing data, and propose a novel method for evaluating coal mine composition. We collected remote sensing, field visible and near-infrared (Vis-NIR) spectroscopy data and built analytical models based on a tiny neural network based on the convolutional neural network. The experimental results show that the proposed method can effectively analyze the coal sulfur content. The coal recognition accuracy is 99.65%, the root-mean-square error is 0.073 and the R is 0.87, and is better than support vector machines and partial least squares methods. Compared with traditional methods, the proposed method shows many advantages and superior performance.</p>","PeriodicalId":54224,"journal":{"name":"PeerJ Computer Science","volume":"10 ","pages":"e2458"},"PeriodicalIF":3.5,"publicationDate":"2024-11-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11622992/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142803322","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Fixation patterns in pairs of facial expressions-preferences of self-critical individuals.
IF 3.5 4区 计算机科学
PeerJ Computer Science Pub Date : 2024-11-01 eCollection Date: 2024-01-01 DOI: 10.7717/peerj-cs.2413
Bronislava Šoková, Martina Baránková, Júlia Halamová
{"title":"Fixation patterns in pairs of facial expressions-preferences of self-critical individuals.","authors":"Bronislava Šoková, Martina Baránková, Júlia Halamová","doi":"10.7717/peerj-cs.2413","DOIUrl":"10.7717/peerj-cs.2413","url":null,"abstract":"<p><p>So far, studies have revealed some differences in how long self-critical individuals fixate on specific facial expressions and difficulties in recognising these expressions. However, the research has also indicated a need to distinguish between the different forms of self-criticism (inadequate or hated self), the key underlying factor in psychopathology. Therefore, the aim of the current research was to explore fixation patterns for all seven primary emotions (happiness, sadness, fear, disgust, contempt, anger, and surprise) and the neutral face expression in relation to level of self-criticism by presenting random facial stimuli in the right or left visual field. Based on the previous studies, two groups were defined, and the pattern of fixations and eye movements were compared (high and low inadequate and hated self). The research sample consisted of 120 adult participants, 60 women and 60 men. We used the Forms of Self-Criticizing and Self-Reassuring Scale to measure self-criticism. As stimuli for the eye-tracking task, we used facial expressions from the Umeå University Database of Facial Expressions database. Eye movements were recorded using the Tobii X2 eye tracker. Results showed that in highly self-critical participants with inadequate self, time to first fixation and duration of first fixation was shorter. Respondents with higher inadequate self also exhibited a sustained pattern in fixations (total fixation duration; total fixation duration ratio and average fixation duration)-fixation time increased as self-criticism increased, indicating heightened attention to facial expressions. On the other hand, individuals with high hated self showed increased total fixation duration and fixation count for emotions presented in the right visual field but did not differ in initial fixation metrics in comparison with high inadequate self group. These results suggest that the two forms of self-criticism - inadequate self and hated self, may function as distinct mechanisms in relation to emotional processing, with implications for their role as potential transdiagnostic markers of psychopathology based on the fixation eye-tracking metrics.</p>","PeriodicalId":54224,"journal":{"name":"PeerJ Computer Science","volume":"10 ","pages":"e2413"},"PeriodicalIF":3.5,"publicationDate":"2024-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11623007/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142803237","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A web scraping app for smart literature search of the keywords.
IF 3.5 4区 计算机科学
PeerJ Computer Science Pub Date : 2024-10-31 eCollection Date: 2024-01-01 DOI: 10.7717/peerj-cs.2384
Muhammed Ali Mutlu, Eyup Emre Ulku, Kazim Yildiz
{"title":"A web scraping app for smart literature search of the keywords.","authors":"Muhammed Ali Mutlu, Eyup Emre Ulku, Kazim Yildiz","doi":"10.7717/peerj-cs.2384","DOIUrl":"10.7717/peerj-cs.2384","url":null,"abstract":"<p><p>Detailed literature search and writing is very important for the success of long research projects, publications and theses. Search engines provide significant convenience in research processes. However, conducting a comprehensive and systematic research on the web requires a long working process. In order to make literature searches effective, simple and comprehensive, various libraries and development tools have been created and made available. By using these development tools, research processes that may take days can be reduced to hours or even minutes. Literature review is not only necessary for academic studies, but it is a process that should be used and performed in every field where new approaches are adopted. Literature review is a process that gives us important ideas about whether similar studies have been conducted before, which methods have been used before and what has not been addressed in previous studies. It is also of great importance in terms of preventing possible copyright problems in future studies. The main purpose of this study is to propose an application that will facilitate, speed up and increase the efficiency of literature searches. In existing systems, literature searches are performed by browsing search sites or various article sites one by one and using the search tools provided by these sites. It is simple to use, allows the entire World Wide Web environment to be searched, and provides the user with the search findings. In this study, we have implemented an application that allows the crawling of the entire World Wide Web environment, is very simple to use, and quickly presents the crawl findings to the user.</p>","PeriodicalId":54224,"journal":{"name":"PeerJ Computer Science","volume":"10 ","pages":"e2384"},"PeriodicalIF":3.5,"publicationDate":"2024-10-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11623055/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142803217","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A framework for generating recommendations based on trust in an informal e-learning environment.
IF 3.5 4区 计算机科学
PeerJ Computer Science Pub Date : 2024-10-31 eCollection Date: 2024-01-01 DOI: 10.7717/peerj-cs.2386
Amjad Rehman, Adeel Ahmed, Tahani Jaser Alahmadi, Abeer Rashad Mirdad, Bayan Al Ghofaily, Khalid Saleem
{"title":"A framework for generating recommendations based on trust in an informal e-learning environment.","authors":"Amjad Rehman, Adeel Ahmed, Tahani Jaser Alahmadi, Abeer Rashad Mirdad, Bayan Al Ghofaily, Khalid Saleem","doi":"10.7717/peerj-cs.2386","DOIUrl":"10.7717/peerj-cs.2386","url":null,"abstract":"<p><p>Rapid advancement in information technology promotes the growth of new online learning communities in an e-learning environment that overloads information and data sharing. When a new learner asks a question, how a system recommends the answer is the problem of the learner's cold start. In this article, our contributions are: (i) We proposed a Trust-aware Deep Neural Recommendation (TDNR) framework that addresses learner cold-start issues in informal e-learning by modeling complex nonlinear relationships. (ii) We utilized latent Dirichlet allocation for tag modeling, assigning tag categories to newly posted questions and ranking experts related to specific tags for active questioners based on hub and authority scores. (iii) We enhanced recommendation accuracy in the TDNR model by introducing a degree of trust between questioners and responders. (iv) We incorporated the questioner-responder relational graph, derived from structural preference information, into our proposed model. We evaluated the proposed model on the Stack Overflow dataset using mean absolute precision (MAP), root mean squared error (RMSE), and F-measure metrics. Our significant findings are that TDNR is a hybrid approach that provides more accurate recommendations compared to rating-based and social-trust-based approaches, the proposed model can facilitate the formation of informal e-learning communities, and experiments show that TDNR outperforms the competing methods by an improved margin. The model's robustness, demonstrated by superior MAE, RMSE, and F-measure metrics, makes it a reliable solution for addressing information overload and user sparsity in Stack Overflow. By accurately modeling complex relationships and incorporating trust degrees, TDNR provides more relevant and personalized recommendations, even in cold-start scenarios. This enhances user experience by facilitating the formation of supportive learning communities and ensuring new learners receive accurate recommendations.</p>","PeriodicalId":54224,"journal":{"name":"PeerJ Computer Science","volume":"10 ","pages":"e2386"},"PeriodicalIF":3.5,"publicationDate":"2024-10-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11623215/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142803146","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Multimodal biometric identification: leveraging convolutional neural network (CNN) architectures and fusion techniques with fingerprint and finger vein data.
IF 3.5 4区 计算机科学
PeerJ Computer Science Pub Date : 2024-10-31 eCollection Date: 2024-01-01 DOI: 10.7717/peerj-cs.2440
Amal Alshardan, Arun Kumar, Mohammed Alghamdi, Mashael Maashi, Saad Alahmari, Abeer A K Alharbi, Wafa Almukadi, Yazeed Alzahrani
{"title":"Multimodal biometric identification: leveraging convolutional neural network (CNN) architectures and fusion techniques with fingerprint and finger vein data.","authors":"Amal Alshardan, Arun Kumar, Mohammed Alghamdi, Mashael Maashi, Saad Alahmari, Abeer A K Alharbi, Wafa Almukadi, Yazeed Alzahrani","doi":"10.7717/peerj-cs.2440","DOIUrl":"10.7717/peerj-cs.2440","url":null,"abstract":"<p><p>Advancements in multimodal biometrics, which integrate multiple biometric traits, promise to enhance the accuracy and robustness of identification systems. This study focuses on improving multimodal biometric identification by using fingerprint and finger vein images as the primary traits. We utilized the \"NUPT-FPV\" dataset, which contains a substantial number of finger vein and fingerprint images, which significantly aided our research. Convolutional neural networks (CNNs), renowned for their efficacy in computer vision tasks, are used in our model to extract distinct discriminative features. Specifically, we incorporate three popular CNN architectures: ResNet, VGGNet, and DenseNet. We explore three fusion strategies used in security applications: early fusion, late fusion, and score-level fusion. Early fusion integrates raw images at the input layer of a single CNN, combining information at the initial stages. Late fusion, in contrast, merges features after individual learning from each CNN model. Score-level fusion employs weighted aggregation to combine scores from each modality, leveraging the complementary information they provide. We also use contrast limited adaptive histogram equalization (CLAHE) to enhance fingerprint contrast and vein pattern features, improving feature visibility and extraction. Our evaluation metrics include accuracy, equal error rate (EER), and ROC curves. The fusion of CNN architectures and enhancement methods shows promising performance in identifying multimodal biometrics, aiming to increase identification accuracy. The proposed model offers a reliable authentication system using multiple biometrics to verify identity.</p>","PeriodicalId":54224,"journal":{"name":"PeerJ Computer Science","volume":"10 ","pages":"e2440"},"PeriodicalIF":3.5,"publicationDate":"2024-10-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11623012/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142803381","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
User emotion recognition and indoor space interaction design: a CNN model optimized by multimodal weighted networks.
IF 3.5 4区 计算机科学
PeerJ Computer Science Pub Date : 2024-10-31 eCollection Date: 2024-01-01 DOI: 10.7717/peerj-cs.2450
Lingyu Zhang
{"title":"User emotion recognition and indoor space interaction design: a CNN model optimized by multimodal weighted networks.","authors":"Lingyu Zhang","doi":"10.7717/peerj-cs.2450","DOIUrl":"10.7717/peerj-cs.2450","url":null,"abstract":"<p><p>In interior interaction design, achieving intelligent user-interior interaction is contingent upon understanding the user's emotional responses. Precise identification of the user's visual emotions holds paramount importance. Current visual emotion recognition methods rely solely on singular features, predominantly facial expressions, resulting in inadequate coverage of visual characteristics and low recognition rates. This study introduces a deep learning-based multimodal weighting network model to address this challenge. The model initiates with a convolutional attention module, employing a self-attention mechanism within a convolutional neural network (CNN). As a result, the multimodal weighting network model is integrated to optimize weights during training. Finally, a weight network classifier is derived from these optimized weights to facilitate visual emotion recognition. Experimental outcomes reveal a 77.057% correctness rate and a 74.75% accuracy rate in visual emotion recognition. Comparative analysis against existing models demonstrates the superiority of the multimodal weight network model, showcasing its potential to enhance human-centric and intelligent indoor interaction design.</p>","PeriodicalId":54224,"journal":{"name":"PeerJ Computer Science","volume":"10 ","pages":"e2450"},"PeriodicalIF":3.5,"publicationDate":"2024-10-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11623009/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142803415","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
相关产品
×
本文献相关产品
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信