Intelligent Systems with Applications最新文献

筛选
英文 中文
A sentiment analysis approach for understanding users’ perception of metaverse marketplace 了解用户对元网络市场感知的情感分析方法
Intelligent Systems with Applications Pub Date : 2024-03-19 DOI: 10.1016/j.iswa.2024.200362
Ahmed Al-Adaileh , Mousa Al-Kfairy , Mohammad Tubishat , Omar Alfandi
{"title":"A sentiment analysis approach for understanding users’ perception of metaverse marketplace","authors":"Ahmed Al-Adaileh ,&nbsp;Mousa Al-Kfairy ,&nbsp;Mohammad Tubishat ,&nbsp;Omar Alfandi","doi":"10.1016/j.iswa.2024.200362","DOIUrl":"https://doi.org/10.1016/j.iswa.2024.200362","url":null,"abstract":"<div><p>This research explores the user perceptions of the Metaverse Marketplace, analyzing a substantial dataset of over 860,000 Twitter posts through sentiment analysis and topic modeling techniques. The study aims to uncover the driving factors behind user engagement and sentiment in this novel digital trading space. Key findings highlight a predominantly positive user sentiment, with significant enthusiasm for the marketplace's revenue generation and entertainment potential, particularly within the gaming sector. Users express appreciation for the innovative opportunities the Metaverse Marketplace offers for artists, designers, and traders in handling and trading digital assets. This positive outlook is tempered by notable concerns regarding security and privacy within the Metaverse, pointing to a critical area for development and assurance. The study also reveals a substantial neutral sentiment, reflecting users’ cautious but interested stance, particularly regarding the marketplace's role in investment and passive income opportunities. This balanced view underscores the evolving nature of user perceptions in this emerging field. Theoretically, the research enriches the discourse on technology adoption, particularly in virtual environments, by highlighting perceived benefits and enjoyment as significant adoption drivers. These insights are invaluable for stakeholders in the Metaverse Marketplace, guiding the development of more secure, engaging, and user-friendly platforms. While providing a pioneering perspective on Metaverse user perceptions, the study acknowledges its limitation to Twitter data, suggesting the need for broader research methodologies for a more holistic understanding.</p></div>","PeriodicalId":100684,"journal":{"name":"Intelligent Systems with Applications","volume":"22 ","pages":"Article 200362"},"PeriodicalIF":0.0,"publicationDate":"2024-03-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S2667305324000383/pdfft?md5=408db24ecd15b5edd94a070515a178eb&pid=1-s2.0-S2667305324000383-main.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140179896","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
DeLiVoTr: Deep and light-weight voxel transformer for 3D object detection DeLiVoTr:用于 3D 物体检测的深度轻量级体素变换器
Intelligent Systems with Applications Pub Date : 2024-03-19 DOI: 10.1016/j.iswa.2024.200361
Gopi Krishna Erabati, Helder Araujo
{"title":"DeLiVoTr: Deep and light-weight voxel transformer for 3D object detection","authors":"Gopi Krishna Erabati,&nbsp;Helder Araujo","doi":"10.1016/j.iswa.2024.200361","DOIUrl":"10.1016/j.iswa.2024.200361","url":null,"abstract":"<div><p>The image-based backbone (feature extraction) networks downsample the feature maps not only to increase the receptive field but also to efficiently detect objects of various scales. The existing feature extraction networks in LiDAR-based 3D object detection tasks follow the feature map downsampling similar to image-based feature extraction networks to increase the receptive field. But, such downsampling of LiDAR feature maps in large-scale autonomous driving scenarios hinder the detection of small size objects, such as <em>pedestrians</em>. To solve this issue we design an architecture that not only maintains the same scale of the feature maps but also the receptive field in the feature extraction network to aid for efficient detection of small size objects. We resort to attention mechanism to build sufficient receptive field and we propose a <strong>De</strong>ep and <strong>Li</strong>ght-weight <strong>Vo</strong>xel <strong>Tr</strong>ansformer (DeLiVoTr) network with voxel intra- and inter-region transformer modules to extract voxel local and global features respectively. We introduce DeLiVoTr block that uses transformations with expand and reduce strategy to vary the width and depth of the network efficiently. This facilitates to learn wider and deeper voxel representations and enables to use not only smaller dimension for attention mechanism but also a light-weight feed-forward network, facilitating the reduction of parameters and operations. In addition to <em>model</em> scaling, we employ <em>layer-level</em> scaling of DeLiVoTr encoder layers for efficient parameter allocation in each encoder layer instead of fixed number of parameters as in existing approaches. Leveraging <em>layer-level depth</em> and <em>width</em> scaling we formulate three variants of DeLiVoTr network. We conduct extensive experiments and analysis on large-scale Waymo and KITTI datasets. Our network surpasses state-of-the-art methods for detection of small objects (<em>pedestrians</em>) with an inference speed of 20.5 FPS.</p></div>","PeriodicalId":100684,"journal":{"name":"Intelligent Systems with Applications","volume":"22 ","pages":"Article 200361"},"PeriodicalIF":0.0,"publicationDate":"2024-03-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S2667305324000371/pdfft?md5=a6e557978ff347c6423116d4ba2f6a20&pid=1-s2.0-S2667305324000371-main.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140275011","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Review of ambiguity problem in text summarization using hybrid ACA and SLR 使用混合 ACA 和 SLR 解决文本摘要中的歧义问题综述
Intelligent Systems with Applications Pub Date : 2024-03-19 DOI: 10.1016/j.iswa.2024.200360
Sutriawan Sutriawan , Supriadi Rustad , Guruh Fajar Shidik , Pujiono Pujiono , Muljono Muljono
{"title":"Review of ambiguity problem in text summarization using hybrid ACA and SLR","authors":"Sutriawan Sutriawan ,&nbsp;Supriadi Rustad ,&nbsp;Guruh Fajar Shidik ,&nbsp;Pujiono Pujiono ,&nbsp;Muljono Muljono","doi":"10.1016/j.iswa.2024.200360","DOIUrl":"10.1016/j.iswa.2024.200360","url":null,"abstract":"<div><p>Text summarization is the process of creating a text summary that contains important information from a text document. In recent years, significant progress has been made in the field of text summarization research, along with the challenges that drive research progress in the field at large. The development of textual data has sparked great interest in text summarization research, which is thoroughly reviewed in this survey study. Text summarization research improvements continue to be made to date with various approaches, such as abstractive and extractive. The abstractive approach uses an intermediate representation of the input document to produce a summary that may differ from the original text. The extractive approach means that key sentences are extracted from the source document and combined to form a summary. Despite the various methodologies and approaches recommended, the summaries produced still contain ambiguities that can be interpreted with different meanings, resulting in errors in defining ambiguities, uncertainty in measuring the quality of summaries, difficulty in modeling linguistic context, difficulty in representing semantic meanings, and difficulty in specifying types of ambiguities. This research survey offers a comprehensive exploration of text summarization research, covering challenges, classifications, approaches, preprocessing methods, features, techniques, and evaluation methods, meeting future research needs. The results provide an overview of the state of the art of recent research developments in the topic of ambiguity resolution in text summarization, such as trends in research topics and approaches or techniques used in addressing ambiguity problems in text summarization.</p></div>","PeriodicalId":100684,"journal":{"name":"Intelligent Systems with Applications","volume":"22 ","pages":"Article 200360"},"PeriodicalIF":0.0,"publicationDate":"2024-03-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S266730532400036X/pdfft?md5=3c2870d3b3f87a6ef8f6576559396413&pid=1-s2.0-S266730532400036X-main.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140269990","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
MEFF – A model ensemble feature fusion approach for tackling adversarial attacks in medical imaging MEFF - 应对医学成像中对抗性攻击的模型集合特征融合方法
Intelligent Systems with Applications Pub Date : 2024-03-16 DOI: 10.1016/j.iswa.2024.200355
Laith Alzubaidi , Khamael AL–Dulaimi , Huda Abdul-Hussain Obeed , Ahmed Saihood , Mohammed A. Fadhel , Sabah Abdulazeez Jebur , Yubo Chen , A.S. Albahri , Jose Santamaría , Ashish Gupta , Yuantong Gu
{"title":"MEFF – A model ensemble feature fusion approach for tackling adversarial attacks in medical imaging","authors":"Laith Alzubaidi ,&nbsp;Khamael AL–Dulaimi ,&nbsp;Huda Abdul-Hussain Obeed ,&nbsp;Ahmed Saihood ,&nbsp;Mohammed A. Fadhel ,&nbsp;Sabah Abdulazeez Jebur ,&nbsp;Yubo Chen ,&nbsp;A.S. Albahri ,&nbsp;Jose Santamaría ,&nbsp;Ashish Gupta ,&nbsp;Yuantong Gu","doi":"10.1016/j.iswa.2024.200355","DOIUrl":"https://doi.org/10.1016/j.iswa.2024.200355","url":null,"abstract":"<div><p>Adversarial attacks pose a significant threat to deep learning models, specifically medical images, as they can mislead models into making inaccurate predictions by introducing subtle distortions to the input data that are often imperceptible to humans. Although adversarial training is a common technique used to mitigate these attacks on medical images, it lacks the flexibility to address new attack methods and effectively improve feature representation. This paper introduces a novel Model Ensemble Feature Fusion (MEFF) designed to combat adversarial attacks in medical image applications. The proposed model employs feature fusion by combining features extracted from different DL models and then trains Machine Learning classifiers using the fused features. It uses a concatenation method to merge the extracted features, forming a more comprehensive representation and enhancing the model's ability to classify classes accurately. Our experimental study has performed a comprehensive evaluation of MEFF, considering several challenging scenarios, including 2D and 3D images, greyscale and colour images, binary classification, and multi-label classification. The reported results demonstrate the robustness of using MEFF against different types of adversarial attacks across six distinct medical image applications. A key advantage of MEFF is its capability to incorporate a wide range of adversarial attacks without the need to train from scratch. Therefore, it contributes to developing a more diverse and robust defence strategy. More importantly, by leveraging feature fusion and ensemble modelling, MEFF enhances the resilience of DL models in the face of adversarial attacks, paving the way for improved robustness and reliability in medical image analysis.</p></div>","PeriodicalId":100684,"journal":{"name":"Intelligent Systems with Applications","volume":"22 ","pages":"Article 200355"},"PeriodicalIF":0.0,"publicationDate":"2024-03-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S2667305324000310/pdfft?md5=5fa2dc401268f3c29a24c198fa07f620&pid=1-s2.0-S2667305324000310-main.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140191734","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
IoT data sharing technology based on blockchain and federated learning algorithms 基于区块链和联合学习算法的物联网数据共享技术
Intelligent Systems with Applications Pub Date : 2024-03-16 DOI: 10.1016/j.iswa.2024.200359
Zhiqiang Feng
{"title":"IoT data sharing technology based on blockchain and federated learning algorithms","authors":"Zhiqiang Feng","doi":"10.1016/j.iswa.2024.200359","DOIUrl":"https://doi.org/10.1016/j.iswa.2024.200359","url":null,"abstract":"<div><p>To share data on Internet of Things devices more securely, accurately, and efficiently, this study designs a layered sharing architecture based on blockchain and federated learning. This architecture achieves efficient and secure Internet of Things data sharing through client node clustering and blockchain consensus processes. In addition, to address the issue of imbalanced distribution of data labels in system devices, a device clustering federated learning algorithm based on label similarity is designed to improve the accuracy and stability of the model. The experimental results showed that under independent synchronous data distribution and non independent synchronous data distribution, the research algorithm achieved a 95 % accuracy after 30 iterations, and the communication cost was relatively low. When testing algorithm stability under non independent synchronous data distribution, the more label categories there are, the higher the accuracy. When the label category <em>M</em> = 12, the accuracy could reach 96.0 %. In the medical sharing system of a certain hospital, the research system took about 42.9 % less time to extract information than the original system, and the accuracy could be maintained at over 98 %. This research method can effectively solve the problem of uneven distribution of device data labels, and improve the data transmission efficiency and accuracy of Internet of Things data sharing systems. Moreover, this method can also reduce the impact of malicious nodes on the global model, providing technical support for data transmission and security protection in other fields.</p></div>","PeriodicalId":100684,"journal":{"name":"Intelligent Systems with Applications","volume":"22 ","pages":"Article 200359"},"PeriodicalIF":0.0,"publicationDate":"2024-03-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S2667305324000358/pdfft?md5=eadc7c0c02f671c3d2bfcdcae178083b&pid=1-s2.0-S2667305324000358-main.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140179989","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Cluster-based oversampling with area extraction from representative points for class imbalance learning 基于聚类的超采样与代表性点面积提取,用于类不平衡学习
Intelligent Systems with Applications Pub Date : 2024-03-16 DOI: 10.1016/j.iswa.2024.200357
Zakarya Farou , Yizhi Wang , Tomáš Horváth
{"title":"Cluster-based oversampling with area extraction from representative points for class imbalance learning","authors":"Zakarya Farou ,&nbsp;Yizhi Wang ,&nbsp;Tomáš Horváth","doi":"10.1016/j.iswa.2024.200357","DOIUrl":"https://doi.org/10.1016/j.iswa.2024.200357","url":null,"abstract":"<div><p>Class imbalance learning is challenging in various domains where training datasets exhibit disproportionate samples in a specific class. Resampling methods have been used to adjust the class distribution, but they often have limitations for small disjunct minority subsets. This paper introduces AROSS, an adaptive cluster-based oversampling approach that addresses these limitations. AROSS utilizes an optimized agglomerative clustering algorithm with the Cophenetic Correlation Coefficient and the Bayesian Information Criterion to identify representative areas of the minority class. Safe and half-safe areas are obtained using an incremental k-Nearest Neighbor strategy, and oversampling is performed with a truncated hyperspherical Gaussian distribution. Experimental evaluations on 70 binary datasets demonstrate the effectiveness of AROSS in improving class imbalance learning performance, making it a promising solution for mitigating class imbalance challenges, especially for small disjunct minority subsets.</p></div>","PeriodicalId":100684,"journal":{"name":"Intelligent Systems with Applications","volume":"22 ","pages":"Article 200357"},"PeriodicalIF":0.0,"publicationDate":"2024-03-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S2667305324000334/pdfft?md5=a11f2bb04866bb8768451b4018887e0e&pid=1-s2.0-S2667305324000334-main.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140162425","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Speed meets accuracy: Advanced deep learning for efficient Orientia tsutsugamushi bacteria assessment in RNAi screening 速度与准确性的完美结合:在 RNAi 筛选中利用先进的深度学习对恙虫病菌进行高效评估
Intelligent Systems with Applications Pub Date : 2024-03-16 DOI: 10.1016/j.iswa.2024.200356
Potjanee Kanchanapiboon , Chuenchat Songsaksuppachok , Porncheera Chusorn , Panrasee Ritthipravat
{"title":"Speed meets accuracy: Advanced deep learning for efficient Orientia tsutsugamushi bacteria assessment in RNAi screening","authors":"Potjanee Kanchanapiboon ,&nbsp;Chuenchat Songsaksuppachok ,&nbsp;Porncheera Chusorn ,&nbsp;Panrasee Ritthipravat","doi":"10.1016/j.iswa.2024.200356","DOIUrl":"https://doi.org/10.1016/j.iswa.2024.200356","url":null,"abstract":"<div><p>This study investigates the use of advanced computer vision techniques for assessing the severity of <em>Orientia tsutsugamushi</em> bacterial infectivity. It uses fluorescent scrub typhus images obtained from molecular screening, and addresses challenges posed by a complex and extensive image dataset, with limited computational resources. Our methodology integrates three key strategies within a deep learning framework: transitioning from instance segmentation (IS) models to an object detection model; reducing the model's backbone size; and employing lower-precision floating-point calculations. These approaches were systematically evaluated to strike an optimal balance between model accuracy and inference speed, crucial for effective bacterial infectivity assessment. A significant outcome is that the implementation of the Faster R-CNN architecture, with a shallow backbone and reduced precision, notably improves accuracy and reduces inference time in cell counting and infectivity assessment. This innovative approach successfully addresses the limitations of image processing techniques and IS models, effectively bridging the gap between sophisticated computational methods and modern molecular biology applications. The findings underscore the potential of this integrated approach to enhance the accuracy and efficiency of bacterial infectivity evaluations in molecular research.</p></div>","PeriodicalId":100684,"journal":{"name":"Intelligent Systems with Applications","volume":"22 ","pages":"Article 200356"},"PeriodicalIF":0.0,"publicationDate":"2024-03-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S2667305324000322/pdfft?md5=2d06cfac57033fbe4635f13bd56c5c03&pid=1-s2.0-S2667305324000322-main.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140179894","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Exploration of advancements in handwritten document recognition techniques 探索手写文件识别技术的进步
Intelligent Systems with Applications Pub Date : 2024-03-15 DOI: 10.1016/j.iswa.2024.200358
Vanita Agrawal , Jayant Jagtap , M.V.V. Prasad Kantipudi
{"title":"Exploration of advancements in handwritten document recognition techniques","authors":"Vanita Agrawal ,&nbsp;Jayant Jagtap ,&nbsp;M.V.V. Prasad Kantipudi","doi":"10.1016/j.iswa.2024.200358","DOIUrl":"https://doi.org/10.1016/j.iswa.2024.200358","url":null,"abstract":"<div><p>Handwritten document recognition and classification are among the many computers related issues being studied for digitizing handwritten data. A handwritten document comprises text, diagrams, mathematical expressions, numerals, and tables. Due to the variety of writing styles and the intricacy of the written language, it has proven difficult to recognize handwritten material. As a result, numerous handwritten document recognition systems have been developed, each with unique benefits and drawbacks. The paper reviews the evolution of handwritten document recognition in qualitative and quantitative ways. Initially, the bibliometric survey is presented based on the number of articles, citations, countries, authors, etc., on handwritten document recognition in the Scopus database. Later, a survey is done on the learning techniques used for handwritten documents: text recognition, digit recognition, mathematical expression recognition, table recognition, and diagram recognition. This paper also presents the directions for future research in handwritten document recognition.</p></div>","PeriodicalId":100684,"journal":{"name":"Intelligent Systems with Applications","volume":"22 ","pages":"Article 200358"},"PeriodicalIF":0.0,"publicationDate":"2024-03-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S2667305324000346/pdfft?md5=008f9ee0edb201f02c7d97e969505812&pid=1-s2.0-S2667305324000346-main.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140179895","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Tree boosting methods for balanced and imbalanced classification and their robustness over time in risk assessment 用于平衡和不平衡分类的树状提升方法及其在风险评估中的长期稳健性
Intelligent Systems with Applications Pub Date : 2024-03-12 DOI: 10.1016/j.iswa.2024.200354
Gissel Velarde , Michael Weichert, Anuj Deshmunkh, Sanjay Deshmane, Anindya Sudhir, Khushboo Sharma, Vaibhav Joshi
{"title":"Tree boosting methods for balanced and imbalanced classification and their robustness over time in risk assessment","authors":"Gissel Velarde ,&nbsp;Michael Weichert,&nbsp;Anuj Deshmunkh,&nbsp;Sanjay Deshmane,&nbsp;Anindya Sudhir,&nbsp;Khushboo Sharma,&nbsp;Vaibhav Joshi","doi":"10.1016/j.iswa.2024.200354","DOIUrl":"https://doi.org/10.1016/j.iswa.2024.200354","url":null,"abstract":"<div><p>Most real-world classification problems deal with imbalanced datasets, posing a challenge for Artificial Intelligence (AI), i.e., machine learning algorithms, because the minority class, which is of extreme interest, often proves difficult to be detected. This paper empirically evaluates tree boosting methods' performance given different dataset sizes and class distributions, from perfectly balanced to highly imbalanced. For tabular data, tree-based methods such as XGBoost, stand out in several benchmarks due to detection performance and speed. Therefore, XGBoost and Imbalance-XGBoost are evaluated. After introducing the motivation to address risk assessment with machine learning, the paper reviews evaluation metrics for detection systems or binary classifiers. It proposes a method for data preparation followed by tree boosting methods including hyper-parameter optimization. The method is evaluated on private datasets of 1 thousand (K), 10K and 100K samples on distributions with 50, 45, 25, and 5 percent positive samples. As expected, the developed method increases its recognition performance as more data is given for training and the F1 score decreases as the data distribution becomes more imbalanced, but it is still significantly superior to the baseline of precision-recall determined by the ratio of positives divided by positives and negatives. Sampling to balance the training set does not provide consistent improvement and deteriorates detection. In contrast, classifier hyper-parameter optimization improves recognition, but should be applied carefully depending on data volume and distribution. Finally, the developed method is robust to data variation over time up to some point. Retraining can be used when performance starts deteriorating.</p></div>","PeriodicalId":100684,"journal":{"name":"Intelligent Systems with Applications","volume":"22 ","pages":"Article 200354"},"PeriodicalIF":0.0,"publicationDate":"2024-03-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S2667305324000309/pdfft?md5=be6e208c32a749998c8ea1ee56dcab8e&pid=1-s2.0-S2667305324000309-main.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140122584","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
In-depth investigation of speech emotion recognition studies from past to present –The importance of emotion recognition from speech signal for AI– 从古至今语音情感识别研究的深入探究--从语音信号中识别情感对人工智能的重要性--从语音信号中识别情感对人工智能的重要性--从语音信号中识别情感对人工智能的重要性
Intelligent Systems with Applications Pub Date : 2024-03-11 DOI: 10.1016/j.iswa.2024.200351
Yeşim ÜLGEN SÖNMEZ , Asaf VAROL
{"title":"In-depth investigation of speech emotion recognition studies from past to present –The importance of emotion recognition from speech signal for AI–","authors":"Yeşim ÜLGEN SÖNMEZ ,&nbsp;Asaf VAROL","doi":"10.1016/j.iswa.2024.200351","DOIUrl":"https://doi.org/10.1016/j.iswa.2024.200351","url":null,"abstract":"<div><p>In the super smart society (Society 5.0), new and rapid methods are needed for speech recognition, emotion recognition, and speech emotion recognition areas to maximize human-machine or human-computer interaction and collaboration. Speech signal contains much information about the speaker, such as age, sex, ethnicity, health condition, emotion, and thoughts. The field of study which analyzes the mood of the person from the speech is called speech emotion recognition (SER). Classifying the emotions from the speech data is a complicated problem for artificial intelligence, and its sub-discipline, machine learning. Because it is hard to analyze the speech signal which contains various frequencies and characteristics. Speech data are digitized with signal processing methods and speech features are obtained. These features vary depending on the emotions such as sadness, fear, anger, happiness, boredom, confusion, etc. Even though different methods have been developed for determining the audio properties and emotion recognition, the success rate varies depending on the languages, cultures, emotions, and data sets. In speech emotion recognition, there is a need for new methods which can be applied in data sets with different sizes, which will increase classification success, in which best properties can be obtained, and which are affordable. The success rates are affected by many factors such as the methods used, lack of speech emotion datasets, the homogeneity of the database, the difficulty of the language (linguistic differences), the noise in audio data and the length of the audio data. Within the scope of this study, studies on emotion recognition from speech signals from past to present have been analyzed in detail. In this study, classification studies based on a discrete emotion model using speech data belonging to the Berlin emotional database (EMO-DB), Italian emotional speech database (EMOVO), The Surrey audio-visual expressed emotion database (SAVEE), Ryerson Audio-Visual Database of Emotional Speech and Song Database (RAVDESS), which are mostly independent of the speaker and content, are examined. The results of both classical classifiers and deep learning methods are compared. Deep learning results are more successful, but classical classification is more important in determining the defining features of speech, song or voice. So It develops feature extraction stage. This study will be able to contribute to the literature and help the researchers in the SER field.</p></div>","PeriodicalId":100684,"journal":{"name":"Intelligent Systems with Applications","volume":"22 ","pages":"Article 200351"},"PeriodicalIF":0.0,"publicationDate":"2024-03-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S2667305324000279/pdfft?md5=1617124db6cea95a53e38e62a54e8824&pid=1-s2.0-S2667305324000279-main.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140122577","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
相关产品
×
本文献相关产品
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信