2023 IEEE International Conference on Acoustics, Speech, and Signal Processing Workshops (ICASSPW)最新文献

筛选
英文 中文
Enhancing Human Activity Recognition Through Sensor Fusion And Hybrid Deep Learning Model 通过传感器融合和混合深度学习模型增强人体活动识别
A. Tarekegn, M. Ullah, F. A. Cheikh, Muhammad Sajjad
{"title":"Enhancing Human Activity Recognition Through Sensor Fusion And Hybrid Deep Learning Model","authors":"A. Tarekegn, M. Ullah, F. A. Cheikh, Muhammad Sajjad","doi":"10.1109/ICASSPW59220.2023.10193698","DOIUrl":"https://doi.org/10.1109/ICASSPW59220.2023.10193698","url":null,"abstract":"Wearable-based human activity recognition (HAR) is essential for several applications, such as health monitoring, physical training, and rehabilitation. However, most HAR systems presently depend on a single sensor, typically a smartphone, due to its widespread use. To improve performance and adapt to various scenarios, this study focuses on a smart belt equipped with acceleration and gyroscope sensors for detecting activities of daily living (ADLs). The collected data was pre-processed, fused and used to train a hybrid deep learning model incorporating a CNN and BiLSTM network. We evaluated the effect of window length on recognition accuracy and conducted a performance analysis of the proposed model. Our framework achieved an overall accuracy of 96% at a window length of 5 seconds, demonstrating its effectiveness in recognizing ADLs. The results show that belt sensor fusion for HAR provides valuable insights into human behaviour and could enhance applications such as healthcare, fitness, and sports training.","PeriodicalId":158726,"journal":{"name":"2023 IEEE International Conference on Acoustics, Speech, and Signal Processing Workshops (ICASSPW)","volume":"129 2 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-06-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"124246913","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Attention-Based Convolutional Neural Network for CT Scan COVID-19 Detection 基于注意力的卷积神经网络CT扫描COVID-19检测
Alessia Rondinella, Francesco Guarnera, O. Giudice, A. Ortis, F. Rundo, S. Battiato
{"title":"Attention-Based Convolutional Neural Network for CT Scan COVID-19 Detection","authors":"Alessia Rondinella, Francesco Guarnera, O. Giudice, A. Ortis, F. Rundo, S. Battiato","doi":"10.1109/ICASSPW59220.2023.10193471","DOIUrl":"https://doi.org/10.1109/ICASSPW59220.2023.10193471","url":null,"abstract":"The accurate detection of Covid-19 from chest Computed Tomography (CT) images can assist in early diagnosis and management of the disease. This paper presents a solution for Covid-19 detection, presented in the challenge of 3rd Covid-19 competition, inside the “AI-enabled Medical Image Analysis Workshop” organized by IEEE International Conference on Acoustic, Speech and Signal Processing (ICASSP) 2023. In this work, the application of deep learning models for chest CT image analysis was investigated, focusing on the use of a ResNet as a backbone network augmented with attention mechanisms. The ResNet provides an effective feature extractor for the classification task, while the attention mechanisms improve the model’s ability to focus on important regions of interest within the images. We conducted extensive experiments on a provided dataset and achieved a macro F1 score of 0.78 on the test set, demonstrating the potential to assist the diagnosis of Covid-19. Our proposed approach leverages the power of deep learning with attention mechanisms to address the challenges of Covid-19 detection in the early detection and management of the disease. In both test and validation set, the proposed method outperformed the baseline of the challenge, ranking fifth in the competition.","PeriodicalId":158726,"journal":{"name":"2023 IEEE International Conference on Acoustics, Speech, and Signal Processing Workshops (ICASSPW)","volume":"48 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-06-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"115605306","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Generative Models For Large-Scale Simulations Of Connectome Development 大规模连接体发育模拟的生成模型
Skylar J. Brooks, C. Stamoulis
{"title":"Generative Models For Large-Scale Simulations Of Connectome Development","authors":"Skylar J. Brooks, C. Stamoulis","doi":"10.1109/ICASSPW59220.2023.10193544","DOIUrl":"https://doi.org/10.1109/ICASSPW59220.2023.10193544","url":null,"abstract":"Functional interactions and anatomic connections between brain regions form the connectome. Its mathematical representation in terms of a graph reflects the inherent neuroanatomical organization into structures and regions (nodes) that are interconnected through neural fiber tracts and/or interact functionally (edges). Without knowledge of the ground truth topology of the connectome, functional (directional or nondirectional) graphs represent estimates of signal correlations, from which underlying mechanisms and processes, such as development and aging, or neuropathologies, are difficult to unravel. Biologically meaningful simulations using synthetic graphs with controllable parameters can complement real data analyses and provide critical insights into mechanisms underlying the organization of the connectome. Generative models can be highly valuable tools for creating large datasets of synthetic graphs with known topological characteristics. However, for these graphs to be meaningful, the variation of model parameters needs to be driven by real data. This paper presents a novel, data-driven approach for tuning the parameters of the generative LancichinettiFortunato-Radicchi (LFR) model, using a large dataset of connectomes (n = 5566) estimated from resting-state fMRI from early adolescents in the historically large Adolescent Brain Cognitive Development Study (ABCD). It also presents an application, i.e., simulations using the LFR, to generate large datasets of synthetic graphs representing brains at different stages of neural maturation, and gain insights into developmental changes in their topological organization.","PeriodicalId":158726,"journal":{"name":"2023 IEEE International Conference on Acoustics, Speech, and Signal Processing Workshops (ICASSPW)","volume":"34 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-06-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"122619217","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Querying A Sign Language Dictionary with Videos Using Dense Vector Search 使用密集向量搜索查询视频手语词典
Mathieu De Coster, J. Dambre
{"title":"Querying A Sign Language Dictionary with Videos Using Dense Vector Search","authors":"Mathieu De Coster, J. Dambre","doi":"10.1109/ICASSPW59220.2023.10193531","DOIUrl":"https://doi.org/10.1109/ICASSPW59220.2023.10193531","url":null,"abstract":"To search for an unknown sign in a sign language dictionary, users typically indicate parameters of the query, e.g., hand shape and signing location. Recent advances in sign language recognition enable video-based sign language dictionary search. In such a system, users can record an unknown sign and retrieve a list of signs that look similar, preferably including the queried sign as one of the top results. We have realized such a system by interpreting it as a dense vector search task. First, we learn a mapping (embedding) from sign videos to a vector space. The dictionary can then be searched by looking for the vectors in this space that are closest to the vector corresponding to the query. We present a proof of concept on a subset of the Flemish Sign Language dictionary. Further research is required to scale up our method to the large vocabularies of entire dictionaries.","PeriodicalId":158726,"journal":{"name":"2023 IEEE International Conference on Acoustics, Speech, and Signal Processing Workshops (ICASSPW)","volume":"2 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-06-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"129744080","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Easier Notation – a Proposal for a Gloss-Based Scripting Language for Sign Language Generation Based on Lexical Data 更容易的符号-基于词汇数据的手语生成基于术语表的脚本语言的建议
Thomas Hanke, Lutz König, Reiner Konrad, Maria Kopf, Marc Schulder, Rosalee J. Wolfe
{"title":"Easier Notation – a Proposal for a Gloss-Based Scripting Language for Sign Language Generation Based on Lexical Data","authors":"Thomas Hanke, Lutz König, Reiner Konrad, Maria Kopf, Marc Schulder, Rosalee J. Wolfe","doi":"10.1109/ICASSPW59220.2023.10192997","DOIUrl":"https://doi.org/10.1109/ICASSPW59220.2023.10192997","url":null,"abstract":"We introduce EASIER Notation, a gloss-based scripting language to describe sign language content to be signed by an avatar and describe the functionality a lexical database for a sign language needs to provide in order to fully support the notation approach. In addition, we present the prototype of a text editor supporting EASIER Notation for human post-editing of machine translation output as well as pre-scribing signed utterances from scratch.","PeriodicalId":158726,"journal":{"name":"2023 IEEE International Conference on Acoustics, Speech, and Signal Processing Workshops (ICASSPW)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-06-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"128764509","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
On Hybrid Free-Space Optic-Radio Systems as Enablers of 6G Services Over Non-Terrestrial Networks 在非地面网络上实现6G业务的混合自由空间光无线电系统
M. Amay, J. Bas
{"title":"On Hybrid Free-Space Optic-Radio Systems as Enablers of 6G Services Over Non-Terrestrial Networks","authors":"M. Amay, J. Bas","doi":"10.1109/ICASSPW59220.2023.10193454","DOIUrl":"https://doi.org/10.1109/ICASSPW59220.2023.10193454","url":null,"abstract":"6G envisions new services such as holographic communications, virtual reality, digital twins, fiber on the sky, augmented reality to name a few of them. These new services will require a large capacity and so, they will be allocated in very high frequency bands, e.g., mmWave, TeraHertz, as well as the optical ones (i.e., fiber and optical wireless). Thus, it is foreseen that 6G will combine free-space optic (FSO) and radio frequency (RF) bands to offer more capacity, resilience to channel impairments and security (e.g. quantum and post-quantum-based security). This paper provides analysis and results on the throughput, and outage probability for the capacity, resilient, and security architectures of hybrid optical-radio systems. For more accuracy, this paper assumes that the optical and radio links have atmospheric impairments–in the optical link, there is a strong turbulence modelled using Gamma-Gamma distribution, whereas in the radio link, there is a fading modelled using Nakagami-m distribution.","PeriodicalId":158726,"journal":{"name":"2023 IEEE International Conference on Acoustics, Speech, and Signal Processing Workshops (ICASSPW)","volume":"30 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-06-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"116260776","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Role of Audio In Video Summarization 音频在视频总结中的作用
Ibrahim Shoer, Berkay Köprü, E. Erzin
{"title":"Role of Audio In Video Summarization","authors":"Ibrahim Shoer, Berkay Köprü, E. Erzin","doi":"10.1109/ICASSPW59220.2023.10192578","DOIUrl":"https://doi.org/10.1109/ICASSPW59220.2023.10192578","url":null,"abstract":"Video summarization attracts attention for efficient video representation, retrieval, and browsing to ease volume and traffic surge problems. Although video summarization mostly uses the visual channel for compaction, the benefits of audio-visual modeling appeared in recent literature. The information coming from the audio channel can be a result of audio-visual correlation in the video content. In this study, we propose a new audio-visual video summarization framework integrating four ways of audio-visual information fusion with GRU-based and attention-based networks. Furthermore, we investigate a new explainability methodology using audio-visual canonical correlation analysis (CCA) to better understand and explain the role of audio in the video summarization task. Experimental evaluations on the TVSum dataset attain F1 score and Kendall-tau score improvements for the audio-visual video summarization. Furthermore, splitting video content on TVSum and COGNIMUSE datasets based on audio-visual CCA as positively and negatively correlated videos yields a strong performance improvement over the positively correlated videos for audio-only and audio-visual video summarization.","PeriodicalId":158726,"journal":{"name":"2023 IEEE International Conference on Acoustics, Speech, and Signal Processing Workshops (ICASSPW)","volume":"78 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-06-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"125975376","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Prediction of Driver’s Stress Affection in Simulated Autonomous Driving Scenarios 模拟自动驾驶场景下驾驶员应力影响预测
Valerio De Caro, Herbert Danzinger, C. Gallicchio, Clemens Könczöl, Vincenzo Lomonaco, Mina Marmpena, S. Politi, O. Veledar, D. Bacciu
{"title":"Prediction of Driver’s Stress Affection in Simulated Autonomous Driving Scenarios","authors":"Valerio De Caro, Herbert Danzinger, C. Gallicchio, Clemens Könczöl, Vincenzo Lomonaco, Mina Marmpena, S. Politi, O. Veledar, D. Bacciu","doi":"10.1109/ICASSPW59220.2023.10193353","DOIUrl":"https://doi.org/10.1109/ICASSPW59220.2023.10193353","url":null,"abstract":"We investigate the task of predicting stress affection from physiological data of users experiencing simulations of autonomous driving. We approach this task on two levels of granularity, depending on whether the prediction is performed at the end of the simulation, or along the simulation. In the former, denoted as coarse-grained prediction, we employed Decision Trees. In the latter, denoted as fine-grained prediction, we employed Echo State Networks, a Recurrent Neural Network that allows efficient learning from temporal data and hence is suitable for pervasive environments. We conduct experiments on a private dataset of physiological data from people participating in multiple driving scenarios simulating different stress-inducing events. The results show that the proposed model is capable of detecting event-related stress reactions, proving the existence of a correlation between stress-inducing events and the physiological data.","PeriodicalId":158726,"journal":{"name":"2023 IEEE International Conference on Acoustics, Speech, and Signal Processing Workshops (ICASSPW)","volume":"36 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-06-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"127472436","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
An Improved Autofocus Algorithm With Bayesian Tracking of Residual Motion For Automotive MIMO-SAR Imaging 基于贝叶斯残馀运动跟踪的汽车MIMO-SAR自动对焦改进算法
Gabriele Balducci, M. Manzoni, S. Tebaldini, A. M. Guarnieri, C. Prati, Ivan Russo
{"title":"An Improved Autofocus Algorithm With Bayesian Tracking of Residual Motion For Automotive MIMO-SAR Imaging","authors":"Gabriele Balducci, M. Manzoni, S. Tebaldini, A. M. Guarnieri, C. Prati, Ivan Russo","doi":"10.1109/ICASSPW59220.2023.10193027","DOIUrl":"https://doi.org/10.1109/ICASSPW59220.2023.10193027","url":null,"abstract":"Automotive Synthetic Aperture Radar (SAR) is a promising technology for autonomous driving, where reliable perception of the environment is needed. Though, SAR focusing needs precise vehicle’s trajectory knowledge, not compatible with automotive-grade navigation systems. Current autofocus algorithms refine navigation-based trajectory with radar data but do not exploit vehicle’s dynamic in the residual motion estimation. This paper investigates the injection of a-priori knowledge into residual motion estimation to achieve improved and physically consistent SAR imaging. An autoregressive model of the residual velocities and Bayesian tracking via Kalman Filter are proposed and deeply studied upon application on real data acquired in an open road campaign. A new metric is introduced to quantitatively compare the outcomes: the variance of Hough lines angular coefficients. Experimental results confirm that the metric is informative, and the presence of memory in the residual motion estimation is effective in better estimating residual velocity and, consequently, improved SAR imaging.","PeriodicalId":158726,"journal":{"name":"2023 IEEE International Conference on Acoustics, Speech, and Signal Processing Workshops (ICASSPW)","volume":"69 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-06-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"130332512","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Dynamic Source Localization and Functional Connectivity Estimation With State-Space Models: Preliminary Feasibility Analysis 基于状态空间模型的动态源定位和功能连通性估计:初步可行性分析
J. Bornot, R. Sotero, D. Coyle
{"title":"Dynamic Source Localization and Functional Connectivity Estimation With State-Space Models: Preliminary Feasibility Analysis","authors":"J. Bornot, R. Sotero, D. Coyle","doi":"10.1109/ICASSPW59220.2023.10193527","DOIUrl":"https://doi.org/10.1109/ICASSPW59220.2023.10193527","url":null,"abstract":"Dynamic imaging of source and functional connectivity (FC) using electroencephalographic (EEG) signals is essential for understanding the brain and cognition with sufficiently affordable technology to be widely applicable for studying changes associated with healthy ageing and the progression of neuropathology. We present an application for group analysis of recently developed state-space models and algorithms for simultaneously estimating the large-scale EEG inverse and FC problems. This approach reduces estimation bias and facilitates a detailed exploration and investigation of neuronal dynamics compared to current techniques. We present feasibility analyses for simulated and real EEG event-related data. The latter analysis uses a sixteen subjects EEG (Wakeman and Henson’s) database, with signals recorded during a face-processing task. We implement a state-space methodology efficiently using an alternating least squares (ALS) algorithm. This application to neuroimaging analysis may be critical to reliably capture the brain dynamics despite interindividual variability, as demonstrated by the results presented.","PeriodicalId":158726,"journal":{"name":"2023 IEEE International Conference on Acoustics, Speech, and Signal Processing Workshops (ICASSPW)","volume":"27 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-06-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"130453331","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
相关产品
×
本文献相关产品
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信