Advances in computational intelligence最新文献

筛选
英文 中文
Motivators and detractors to integration of the user AI experience 用户AI体验整合的激励因素和损害因素
Advances in computational intelligence Pub Date : 2026-03-27 DOI: 10.1007/s43674-026-00088-2
Thomas S. Mueller
{"title":"Motivators and detractors to integration of the user AI experience","authors":"Thomas S. Mueller","doi":"10.1007/s43674-026-00088-2","DOIUrl":"10.1007/s43674-026-00088-2","url":null,"abstract":"<div><p>Developers and marketers of artificial intelligence (AI) powered innovations have pushed technology to the forefront, well ahead of user experience and ethical implications. This exploratory study used qualitative semi-structured interviews (<i>N</i> = 21) and then quantitative analysis (<i>N</i> = 629) to predict user motivators and detractors to AI adaptation. Fifty-eight percent of respondents in this study did not agree that AI will benefit our society. There was a significant difference those identified as highly feminine and the highly masculine. Those politically conservative were less likely to support AI initiatives than those politically liberal. Independent variables were factored into latent themes “Ethical Outcomes”, Social Media Risk vs. Reward”, and “Governmental Constraint.” Ethical Outcomes was the key significant predictor of “trust AI with my personal and business activities.” The effect of Social Media Risk was marginal but significant. Logistic regression with the dependent variable “overall, artificial intelligence will benefit our society” once again captured Ethical Outcomes. The concern for social media implications was 19% more likely to be considered when pondering beneficence of AI in society. Governmental Constraint was 70% more likely to occur. When AI is considered as a future religious entity, users perceive a decrease in benefit to society. A hierarchical decision tree with the target variable “trust related to AI” illustrates users first consider ethical implications of AI in society, and then face dilemmas of copyright infringement and effect on political elections.</p></div>","PeriodicalId":72089,"journal":{"name":"Advances in computational intelligence","volume":"6 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2026-03-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://link.springer.com/content/pdf/10.1007/s43674-026-00088-2.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"147561761","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
AI-enhanced ECG analysis for arrhythmia classification with edge device compatibility 具有边缘设备兼容性的人工智能增强心电分析心律失常分类
Advances in computational intelligence Pub Date : 2026-01-31 DOI: 10.1007/s43674-026-00087-3
Hermawan Nugroho, Ke Jun Peng, Anandan Shanmugam
{"title":"AI-enhanced ECG analysis for arrhythmia classification with edge device compatibility","authors":"Hermawan Nugroho,&nbsp;Ke Jun Peng,&nbsp;Anandan Shanmugam","doi":"10.1007/s43674-026-00087-3","DOIUrl":"10.1007/s43674-026-00087-3","url":null,"abstract":"<div><p>Cardiovascular diseases (CVDs) are the leading cause of global mortality, necessitating efficient and accessible diagnostic tools for early detection. While deep learning models have demonstrated high accuracy in arrhythmia classification, their deployment in resource-limited clinical settings remains challenging due to computational constraints. This study addresses this gap by developing and validating lightweight deep learning models specifically optimized for edge device deployment, enabling real-time ECG analysis in remote and under-resourced healthcare environments. Utilizing the MIT-BIH Arrhythmia Dataset, which includes 48 half-hour ECG recordings from 47 individuals, we trained and evaluated 2 models: a standard Convolutional Neural Network (CNN) and a computationally efficient Depth-Wise Separable Convolutional Neural Network (DWCNN). Preprocessing involved segmenting ECG recordings into individual heartbeats and addressing class imbalance through downsampling, resulting in a balanced dataset of 21,186 images across 6 arrhythmia types. The DWCNN achieved competitive diagnostic performance (precision: 0.97, recall: 0.97) while utilizing 75% fewer parameters than the standard CNN (8.45 M vs. 34.08 M) and 65% less memory (8.4 MB vs. 24.1 MB). Critically, we demonstrate successful deployment on the Intel Neural Compute Stick 2 (NCS2), a resource-constrained edge device, achieving inference times of 4.09 ms/241.91FPS on CPU and 9.05 ms/109.79FPS on the NCS2 platform. This practical demonstration of real-time arrhythmia classification on low-power edge devices represents a significant advancement toward accessible cardiac diagnostics in point-of-care settings, remote monitoring applications, and resource-limited healthcare facilities where centralized computing infrastructure or specialist expertise may be unavailable.</p></div>","PeriodicalId":72089,"journal":{"name":"Advances in computational intelligence","volume":"6 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2026-01-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"147342952","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A combined voting mechanism in KNN and random forest algorithms to enhance the diabetic retinopathy eye disease detection 一种结合KNN和随机森林算法的投票机制来增强糖尿病视网膜病变眼病的检测
Advances in computational intelligence Pub Date : 2026-01-22 DOI: 10.1007/s43674-025-00086-w
J. Vijaya, B. Sai Vikas, J. Jaya Surya, K. Nitheesh Kumar, B. Shriya
{"title":"A combined voting mechanism in KNN and random forest algorithms to enhance the diabetic retinopathy eye disease detection","authors":"J. Vijaya,&nbsp;B. Sai Vikas,&nbsp;J. Jaya Surya,&nbsp;K. Nitheesh Kumar,&nbsp;B. Shriya","doi":"10.1007/s43674-025-00086-w","DOIUrl":"10.1007/s43674-025-00086-w","url":null,"abstract":"<div><p>Diabetic retinopathy (DR) stands out as one of the most significant causes of treatable visual impairment and blindness worldwide. Hence, early detection coupled with timely intervention is crucial to prevent the disease from further progression. However, manually detecting DR through the use of retinal images is highly time-consuming, subjective, and often inaccessible in resource-limited settings. This research introduces a groundbreaking automated system for DR detection and severity classification. First, we collected the corresponding data and applied advanced preprocessing techniques such as resizing image, normalization, Gaussian blur, contrast limited adaptive histogram equalization (CLAHE), and image blending to improve model performance. Furthermore, we employed powerful deep learning (DL) architectures such as VGG, EfficientNet, DenseNet, ResNet50, and transformer models such as vision transformer (ViT), and swin transformer for feature extraction. The resulting features were then classified using robust machine learning algorithms, ensemble models, CNN models, and transformer-based models, including decision tree (DT), K-nearest neighbor (KNN), logistic regression (LR), support vector machine (SVM), random forest (RF), AdaBoost, and XGBoost, VGG, EfficientNet, DenseNet, ResNet50, vision transformer, swin transformer, and proposed a new hybrid technique that integrates KNN with random forest (KNN-RF). We tested the system with the appropriate key metrics: accuracy, precision, recall, and F1-score. Our findings show that the hybrid KNN-RF technique outperforms the others. Results point to promising possibilities for our approach in providing precise, scalable, and cost-effective diabetic retinopathy diagnoses in resource-scarce settings. This study emphasizes the importance of artificial intelligence in revolutionizing healthcare diagnostic processes and tackling essential global health issues.</p></div>","PeriodicalId":72089,"journal":{"name":"Advances in computational intelligence","volume":"6 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2026-01-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146016142","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Construction of new transfer functions and their application in solving knapsack problems with discrete Kepler optimization algorithm 新的传递函数的构造及其在离散Kepler优化算法求解背包问题中的应用
Advances in computational intelligence Pub Date : 2025-11-15 DOI: 10.1007/s43674-025-00085-x
Yichao He, Guoxin Chen, Ju Chen, Manman Meng
{"title":"Construction of new transfer functions and their application in solving knapsack problems with discrete Kepler optimization algorithm","authors":"Yichao He,&nbsp;Guoxin Chen,&nbsp;Ju Chen,&nbsp;Manman Meng","doi":"10.1007/s43674-025-00085-x","DOIUrl":"10.1007/s43674-025-00085-x","url":null,"abstract":"<div><p>Transfer function plays a crucial role in discretizing metaheuristic algorithms to solve combinatorial optimization problems. However, existing transfer functions not only have few classes, but their design methods also rely too much on the curve shape. Kepler optimization algorithm (KOA) is a novel metaheuristic algorithm that performs well in solving optimization problems on continuous domains, but cannot be directly applied to solve combinatorial optimization problems on discrete domains. In order to design more transfer functions and solve combinatorial optimization problems by KOA, this paper first proposes a practical method to construct transfer functions. From this, two new classes of transfer functions are given: A-shaped transfer functions and B-shaped transfer functions. Then, based on the transfer function, the first discrete Kepler optimization algorithm (DKOA) suitable for binary optimization problems is proposed. To verify the practicality of the new transfer functions and the efficiency of DKOA, DKOA is used to solve 0–1 knapsack problem and knapsack problem with a single continuous variable, respectively. Comparison with existing transfer functions and the state-of-the-art metaheuristic algorithms for solving these two problems shows that DKOA using A-shaped and B-shaped transfer functions is more competitive in terms of the ability to obtain optimal solutions, average performance and stability. This shows that the proposed new transfer functions are very practical, and the DKOA based on them is an effective metaheuristic algorithm for solving binary optimization problems.</p></div>","PeriodicalId":72089,"journal":{"name":"Advances in computational intelligence","volume":"5 4","pages":""},"PeriodicalIF":0.0,"publicationDate":"2025-11-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145561074","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Forecasting China’s producer price index for production materials via Gaussian process regression within a Bayesian inference framework 基于贝叶斯推理框架的高斯过程回归预测中国生产资料生产者价格指数
Advances in computational intelligence Pub Date : 2025-11-06 DOI: 10.1007/s43674-025-00084-y
Bingzi Jin, Xiaojie Xu
{"title":"Forecasting China’s producer price index for production materials via Gaussian process regression within a Bayesian inference framework","authors":"Bingzi Jin,&nbsp;Xiaojie Xu","doi":"10.1007/s43674-025-00084-y","DOIUrl":"10.1007/s43674-025-00084-y","url":null,"abstract":"<div><p>Projecting China’s producer price index (PPI) for production materials yields early signals of inflationary pressures and cost dynamics influencing both national economic stability and international supply networks. Reliable PPI forecasts equip policymakers, market participants, and firms with the information needed to refine monetary policy, pricing decisions, and resource allocation. This study proposes an innovative forecasting architecture based on Gaussian process regression (GPR), whose hyperparameters are estimated via a Bayesian inference procedure, enabling the model to adapt in real time to latent market fluctuations and previously unobserved structural shifts. By integrating these evolving characteristics, our approach more accurately captures changes in China’s PPI trajectory. The empirical analysis relies on a monthly dataset spanning October 1996 to February 2025, covering multiple waves of regulatory reform, industrial evolution, and macroeconomic transformation. Validation is performed over an out-of-sample period from June 2019 through February 2025, producing a relative root mean square error of 0.1120%, a root mean square error of 0.1131, a mean absolute error of 0.0832, and a correlation coefficient of 0.99984. To the best of our knowledge, this represents the first application of a Bayesian-inference-parameterized GPR model to forecast China’s PPI for production materials. Beyond advancing the theoretical discourse on machine-learning-based price prediction, the methodology provides a flexible analytical framework applicable to analogous macroeconomic time-series forecasting challenges.</p></div>","PeriodicalId":72089,"journal":{"name":"Advances in computational intelligence","volume":"5 4","pages":""},"PeriodicalIF":0.0,"publicationDate":"2025-11-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145456333","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Comparative performance analysis of a novel fusion-based algorithm for AGV navigation 一种基于融合的AGV导航新算法性能对比分析
Advances in computational intelligence Pub Date : 2025-10-29 DOI: 10.1007/s43674-025-00083-z
Ata Jahangir Moshayedi, Dangling Xu, Maryam Sharifdoust, Amir Sohail Khan, Zeashan Hameed Khan, Mehran Emadi Andani
{"title":"Comparative performance analysis of a novel fusion-based algorithm for AGV navigation","authors":"Ata Jahangir Moshayedi,&nbsp;Dangling Xu,&nbsp;Maryam Sharifdoust,&nbsp;Amir Sohail Khan,&nbsp;Zeashan Hameed Khan,&nbsp;Mehran Emadi Andani","doi":"10.1007/s43674-025-00083-z","DOIUrl":"10.1007/s43674-025-00083-z","url":null,"abstract":"<div><p>Automated guided vehicles (AGVs) are intelligent robotic systems that play a crucial role in applications, such as transportation, food delivery, and medical emergencies. One of the primary challenges in AGV deployment is achieving precise navigation to ensure task reliability, safety, and efficient battery consumption along predetermined routes. Vision-based methods have gained significant attention for their high performance in robot navigation. However, selecting the most effective algorithm with minimal sensor use remains an active area of research. This study introduces a novel and efficient fusion-based navigation method, termed the Extended Fusion 1 Method (EFM1), which integrates data from camera and infrared (IR) sensors. The system leverages feature-based algorithms, such as SIFT, ORB, FAST, SURF, BRISK, and BRIEF, to improve path-tracking accuracy. The main objective of this fusion approach is to enhance navigational precision and identify the most suitable algorithm for robust AGV path-tracking. The EFM1 method is simulated and validated using the CoppeliaSim (V-REP) simulator, incorporating the real-world dimensions of the previously developed AGV model, Hongma, via Python API. The simulation evaluates six feature-based algorithms across five distinct path types: circular, elliptical, spiral, figure-eight, and custom path. Performance is assessed in terms of maximum achievable speed, minimal path-tracking error, body orientation accuracy, and simulation time. Statistical analysis, including inferential techniques and post hoc tests, is used to interpret the results. The experimental findings demonstrate that the proposed EFM1 algorithm outperforms traditional vision-only approaches in effectively tracking all five path types, confirming its potential for reliable and efficient AGV navigation. The proposed EFM1 algorithm integrates vision and IR sensors, enhancing AGV navigation accuracy by up to 84% while using minimal sensors. It outperforms previous methods by delivering up to 22.2% faster simulation times, significantly reduced error rates (for example, ORB error decreased by 93%), and increased speed across five test paths. The comparison between investigated methods shows that FAST excels on dynamic paths with notable speed improvements, while SURF performs better on complex trajectories as confirmed by statistical analysis. Although EFM1 improves most metrics, body orientation changes increased, indicating a trade-off between agility and movement stability.</p></div>","PeriodicalId":72089,"journal":{"name":"Advances in computational intelligence","volume":"5 4","pages":""},"PeriodicalIF":0.0,"publicationDate":"2025-10-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145384942","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
An innovative approach to hesitant bipolar fuzzy soft sets in multi-criteria group decision-making 多准则群体决策中犹豫双极模糊软集的创新方法
Advances in computational intelligence Pub Date : 2025-06-24 DOI: 10.1007/s43674-025-00082-0
Ajoy Kanti Das, Suman Patra, Carlos Granados
{"title":"An innovative approach to hesitant bipolar fuzzy soft sets in multi-criteria group decision-making","authors":"Ajoy Kanti Das,&nbsp;Suman Patra,&nbsp;Carlos Granados","doi":"10.1007/s43674-025-00082-0","DOIUrl":"10.1007/s43674-025-00082-0","url":null,"abstract":"<div><p>This paper explores the integration of hesitant bipolar fuzzy soft sets (HBFSS) into multi-criteria group decision-making (MCGDM), aiming to enhance decision precision and resolve uncertainties in expert evaluations. We introduce a novel decision-making framework that combines the root mean square deviation (RMSD) method with a credibility score, capturing both the proximity to ideal solutions and the consistency of expert opinions. The process is applied to a sustainable energy project selection problem, showcasing its efficacy in ranking alternatives such as solar farm, wind park, and hydroelectric plant. A comparative analysis with the existing model highlights the limitations of traditional approaches, including the failure to differentiate alternatives with similar scores and neglecting expert consistency. Our results demonstrate that the proposed RMSD-Credibility approach offers a more nuanced, consistent, and precise ranking, improving decision quality in complex, uncertain environments. This paper contributes to advancing decision-making under fuzzy and uncertain conditions by providing an innovative aggregation mechanism tailored to the challenges of real-world multi-criteria problems.</p></div>","PeriodicalId":72089,"journal":{"name":"Advances in computational intelligence","volume":"5 3","pages":""},"PeriodicalIF":0.0,"publicationDate":"2025-06-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145168696","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A predictive contrivance for recognising traits in keystroke dynamics 一种识别击键动力学特征的预测装置
Advances in computational intelligence Pub Date : 2025-05-29 DOI: 10.1007/s43674-025-00081-1
Soumen Roy, Utpal Roy, Devadatta Sinha, Rajat Kumar Pal
{"title":"A predictive contrivance for recognising traits in keystroke dynamics","authors":"Soumen Roy,&nbsp;Utpal Roy,&nbsp;Devadatta Sinha,&nbsp;Rajat Kumar Pal","doi":"10.1007/s43674-025-00081-1","DOIUrl":"10.1007/s43674-025-00081-1","url":null,"abstract":"<div><p>Predicting personal traits, particularly age group, gender, handedness, and hand(s) used, in the form of digital identity for smartphone users by analysing keystroke dynamics (KD) attributes is a challenging area. However, it has a variety of applications in e-commerce, e-banking, e-teaching/learning, e-exams, forensics, and social networking. The main bottleneck of this problem is addressing the imbalanced nature of KD datasets using conventional machine learning (ML) approaches. By their inherent nature, KD datasets are often imbalanced from various perspectives due to the non-uniformity of diverse user traits and their varied usage patterns. This study proposes a predictive model for both fixed and free-text modes, considering the effect of attached smartphone sensors. We adopt a score-level fusion of eXtreme Gradient Boosting (XGBoost) models on several balanced bootstrapped training samples to address the limitations of conventional approaches. This ensemble approach utilizes multiple bootstrapped training sets, where the class distribution in each set is equally balanced for more accurate and robust performance. Furthermore, we observe the positive impact of incorporating these prediction scores and labels with primary biometric attributes in KD-based user authentication and identification, both in static/entry-point and continuous/active security designs—a previously unanswered challenges. The predictive mechanism and its adaptation in unique KD-based designs, based on datasets collected from a considerable number of volunteers with diverse age groups, genders, professions, and education levels through a smartphone in a web environment, demonstrate the novelty of our approach.</p></div>","PeriodicalId":72089,"journal":{"name":"Advances in computational intelligence","volume":"5 2","pages":""},"PeriodicalIF":0.0,"publicationDate":"2025-05-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145170902","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Design of a DNN-based operator on edge device for keyword spotting 一种基于dnn算子的关键字定位边缘设备设计
Advances in computational intelligence Pub Date : 2025-04-16 DOI: 10.1007/s43674-025-00080-2
Chan Kok Wei, Hermawan Nugroho
{"title":"Design of a DNN-based operator on edge device for keyword spotting","authors":"Chan Kok Wei,&nbsp;Hermawan Nugroho","doi":"10.1007/s43674-025-00080-2","DOIUrl":"10.1007/s43674-025-00080-2","url":null,"abstract":"<div><p>Keyword spotting (KWS) is a critical component of voice-driven smart-device applications, requiring high accuracy, sensitivity, and responsiveness to deliver optimal user experiences. Given the always-on nature of KWS systems, minimizing computational complexity and power consumption is essential, particularly for battery-powered edge devices with constrained resources. In this paper, we propose a compact and highly efficient convolutional neural network (CNN) for edge-based KWS tasks, using the Google Speech Commands (GSC) V2 dataset for training and evaluation. Our model employs modified MobileNetV2 architecture, optimized via knowledge distillation from an ensemble of high-performing CNN models. Experimental results demonstrate that the proposed model achieves 94.48% accuracy on clean test data and significantly outperforms existing state-of-the-art edge models on challenging noisy test sets, reaching 86.38% accuracy. The proposed CNN maintains this superior performance with only 73.8K parameters and 19.5M floating-point operations (FLOPs)—approximately three times fewer FLOPs and substantially fewer parameters than previously reported edge-focused KWS models. Moreover, when evaluated on a realistic and challenging external Kaggle test set, the proposed model shows excellent generalization with 88.38% accuracy, surpassing baseline depthwise separable CNN (DS-CNN) approaches. Upon practical deployment on a widely used embedded computing platform, our optimized model achieved fast inference times between 11 ms and 14 ms per sample, outperforming existing baseline methods and confirming its suitability for real-time applications. This study highlights the successful integration of model compression techniques, including ensemble learning and knowledge distillation, to achieve breakthrough performance improvements in accuracy, robustness to noise, computational efficiency, and inference speed, thereby advancing the practical deployment of high-performance KWS solutions on resource-constrained edge devices.</p></div>","PeriodicalId":72089,"journal":{"name":"Advances in computational intelligence","volume":"5 2","pages":""},"PeriodicalIF":0.0,"publicationDate":"2025-04-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143840486","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Do images really do the talking? 图像真的能说话吗?
Advances in computational intelligence Pub Date : 2025-03-01 DOI: 10.1007/s43674-025-00079-9
Siddhanth U. Hegde, Adeep Hande, Ruba Priyadharshini, Sajeetha Thavareesan, Ratnasingam Sakuntharaj, Sathiyaraj Thangasamy, B. Bharathi, Bharathi Raja Chakravarthi
{"title":"Do images really do the talking?","authors":"Siddhanth U. Hegde,&nbsp;Adeep Hande,&nbsp;Ruba Priyadharshini,&nbsp;Sajeetha Thavareesan,&nbsp;Ratnasingam Sakuntharaj,&nbsp;Sathiyaraj Thangasamy,&nbsp;B. Bharathi,&nbsp;Bharathi Raja Chakravarthi","doi":"10.1007/s43674-025-00079-9","DOIUrl":"10.1007/s43674-025-00079-9","url":null,"abstract":"<div><p>A meme is a part of media created to share an opinion or emotion across the internet. Due to their popularity, memes have become the new form of communication on social media. However, they are used in harmful ways such as trolling and cyberbullying progressively due to their nature. Various data modelling methods create different possibilities in feature extraction and turn them into beneficial information. The variety of modalities included in data plays a significant part in predicting the results. We try to explore the significance of visual features of images in classifying memes. Memes are a blend of both image and text, where the text is embedded into the picture. We consider a meme to be trolling if the meme in any way tries to troll a particular individual, group, or organisation. We try to incorporate the memes as a troll and non-trolling memes based on their images and text. We evaluate if there is any major significance of the visual features for identifying whether a meme is trolling or not. Our work illustrates different textual analysis methods and contrasting multimodal approaches ranging from simple merging to cross attention to utilising both worlds’—visual and textual features. The fine-tuned cross-lingual language model, XLM, performed the best in textual analysis, and the multimodal transformer performs the best in multimodal analysis.</p></div>","PeriodicalId":72089,"journal":{"name":"Advances in computational intelligence","volume":"5 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2025-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://link.springer.com/content/pdf/10.1007/s43674-025-00079-9.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143527685","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
相关产品
×
本文献相关产品
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信
小红书