Loukia Avramelou, P. Nousi, N. Passalis, S. Doropoulos, A. Tefas
{"title":"Cryptosentiment: A Dataset and Baseline for Sentiment-Aware Deep Reinforcement Learning for Financial Trading","authors":"Loukia Avramelou, P. Nousi, N. Passalis, S. Doropoulos, A. Tefas","doi":"10.1109/ICASSPW59220.2023.10193330","DOIUrl":"https://doi.org/10.1109/ICASSPW59220.2023.10193330","url":null,"abstract":"Deep Learning (DL) models have been applied in several studies to solve financial trading problems. Most approaches handle these problems as classification or reinforcement learning problems with the objective of developing profitable strategies. Recent works have demonstrated that supplying financial trading agents with sentiment information can lead to improved performance. However, most of these works focus on collecting sentiment in a coarse-grain manner, which is not always appropriate for making fine-grained trading decisions, e.g., on a minute basis. In this paper, we introduce a fine-grained cryptocurrency sentiment dataset, called CryptoSentiment, which contains 235,907 sentiment scores for 14 cryptocurrency assets, gathered by various online sources. Moreover, we provide Deep Reinforcement Learning (DRL) baselines using the collected dataset, investigating the impact of multi-modal features on cryptocurrency trading.","PeriodicalId":158726,"journal":{"name":"2023 IEEE International Conference on Acoustics, Speech, and Signal Processing Workshops (ICASSPW)","volume":"6 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-06-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"131984813","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
E. Lopez, Eleonora Chiarantano, Eleonora Grassucci, D. Comminiello
{"title":"Hypercomplex Multimodal Emotion Recognition from EEG and Peripheral Physiological Signals","authors":"E. Lopez, Eleonora Chiarantano, Eleonora Grassucci, D. Comminiello","doi":"10.1109/ICASSPW59220.2023.10193329","DOIUrl":"https://doi.org/10.1109/ICASSPW59220.2023.10193329","url":null,"abstract":"Multimodal emotion recognition from physiological signals is receiving an increasing amount of attention due to the impossibility to control them at will unlike behavioral reactions, thus providing more reliable information. Existing deep learning-based methods still rely on extracted handcrafted features, not taking full advantage of the learning ability of neural networks, and often adopt a single-modality approach, while human emotions are inherently expressed in a multimodal way. In this paper, we propose a hypercomplex multimodal network equipped with a novel fusion module comprising parameterized hypercomplex multiplications. Indeed, by operating in a hypercomplex domain the operations follow algebraic rules which allow to model latent relations among learned feature dimensions for a more effective fusion step. We perform classification of valence and arousal from electroencephalogram (EEG) and peripheral physiological signals, employing the publicly available database MAHNOB-HCI surpassing a multimodal state-of-the-art network. The code of our work is freely available at https://github.com/ispamm/MHyEEG.","PeriodicalId":158726,"journal":{"name":"2023 IEEE International Conference on Acoustics, Speech, and Signal Processing Workshops (ICASSPW)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-06-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"128534249","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Jasper Kirton-Wingate, Shafique Ahmed, M. Gogate, Yu-sheng Tsao, Amir Hussain
{"title":"Towards Individualised Speech Enhancement: An SNR Preference Learning System for Multi-Modal Hearing Aids","authors":"Jasper Kirton-Wingate, Shafique Ahmed, M. Gogate, Yu-sheng Tsao, Amir Hussain","doi":"10.1109/ICASSPW59220.2023.10193122","DOIUrl":"https://doi.org/10.1109/ICASSPW59220.2023.10193122","url":null,"abstract":"Since the advent of deep learning (DL), speech enhancement (SE) models have performed well under a variety of noise conditions. However, such systems may still introduce sonic artefacts, sound unnatural, and restrict the ability for a user to hear ambient sound which may be of importance. Hearing Aid (HA) users may wish to customise their SE systems to suit their personal preferences and day-to-day lifestyle. In this paper, we introduce a preference learning based SE (PLSE) model for future multi-modal HAs that can contextually exploit audio and visual information to improve listening comfort (LC). The proposed system estimates the Signal-to-noise ratio (SNR) as a basic objective speech quality measure which quantifies the relative amount of background noise present in speech, and directly correlates to the intelligibility of the signal. This is used alongside a preference elicitation framework which learns a predictive function to determine the target SNR. The system is novel, scaling the output of an AudioVisual (AV) DL-based SE model to provide HA users with individualised SE. Preliminary results support the hypothesis of improving the overall subjective LC, without significantly impeding the speech intelligibility.","PeriodicalId":158726,"journal":{"name":"2023 IEEE International Conference on Acoustics, Speech, and Signal Processing Workshops (ICASSPW)","volume":"16 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-06-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"129338536","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
N. Babu, Kimon Karathanasopoulos, G. Vardoulias, C. Papadias
{"title":"Energy-Efficient UAV Trajectories: Simulation vs Emulation","authors":"N. Babu, Kimon Karathanasopoulos, G. Vardoulias, C. Papadias","doi":"10.1109/ICASSPW59220.2023.10193652","DOIUrl":"https://doi.org/10.1109/ICASSPW59220.2023.10193652","url":null,"abstract":"This paper uses an emulator to verify an energy-efficient trajectory for an unmanned aerial vehicle (UAV) acting as a portable access point (PAP) to serve a set of users. Specifically, we use the Common Open Research Emulator (CORE), and Extendable Mobile Ad-hoc Network Emulator (EMANE), which allow us to take theoretical assumptions regarding data transfer rates and transmission characteristics and test them in the virtualized wireless networking setting the two tools provide us. The optimal fly-hover-communicate trajectory that maximizes the system’s energy efficiency is obtained using a circle-packing algorithm. The CORE-EMANE emulator results match the simulated results, thereby verifying the practicality of the obtained trajectory solution.","PeriodicalId":158726,"journal":{"name":"2023 IEEE International Conference on Acoustics, Speech, and Signal Processing Workshops (ICASSPW)","volume":"15 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-06-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"126109679","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Improved Calibration Method For CML Humidity Retrievals Over Complex Terrain","authors":"Y. Rubin, P. Alpert","doi":"10.1109/ICASSPW59220.2023.10193335","DOIUrl":"https://doi.org/10.1109/ICASSPW59220.2023.10193335","url":null,"abstract":"The use of commercial microwave links (CMLs) as a tool for measuring atmospheric humidity has emerged as a promising new technique for environmental monitoring. CML-based humidity retrieval utilizes the attenuation of microwave signals caused by water vapor to estimate humidity, offering several advantages over traditional sensing techniques such as low cost, high spatial and temporal resolution, and coverage over a wide range of areas. This article discusses the calibration method for determining the base line for CML humidity observations, based on assumptions from the atmospheric sciences field, enabling the use of CMLs over a wide and complex terrain area. With continued development, CMLs are likely to become an increasingly valuable tool for weather forecasting, climate monitoring, and other environmental studies.","PeriodicalId":158726,"journal":{"name":"2023 IEEE International Conference on Acoustics, Speech, and Signal Processing Workshops (ICASSPW)","volume":"39 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-06-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"124915018","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Learning Multi-Rate Vector Quantization for Remote Deep Inference","authors":"M. Malka, Shai Ginzach, Nir Shlezinger","doi":"10.1109/ICASSPW59220.2023.10193526","DOIUrl":"https://doi.org/10.1109/ICASSPW59220.2023.10193526","url":null,"abstract":"Remote inference accommodates a broad range of scenarios, where inference is carried out using data acquired at a remote user. When the sensing and inferring users communicate over rate limited channels, compression of the data reduces latency, and deep learning enables to jointly learn the compression encoding along with the inference rule. However, because the data is compressed into a fixed number of bits, the resolution cannot be adapted to changes in channel conditions. In this work we propose a multi-rate remote deep inference scheme, which trains a single encoder-decoder model that uses learned vector quantizers while supporting different quantization levels. Our scheme is based on designing nested codebooks along with a learning algorithm based on progressive learning. Numerical results demonstrate that the proposed scheme yields remote deep inference that operates with multiple rates while approaching the performance of fixed-rate models.","PeriodicalId":158726,"journal":{"name":"2023 IEEE International Conference on Acoustics, Speech, and Signal Processing Workshops (ICASSPW)","volume":"25 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-06-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"125114977","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Ayush Kanyal, S. Kandula, Vince D. Calhoun, Dong Hye Ye
{"title":"Multi-Modal Deep Learning on Imaging Genetics for Schizophrenia Classification","authors":"Ayush Kanyal, S. Kandula, Vince D. Calhoun, Dong Hye Ye","doi":"10.1109/ICASSPW59220.2023.10193352","DOIUrl":"https://doi.org/10.1109/ICASSPW59220.2023.10193352","url":null,"abstract":"Schizophrenia (SZ) is a severe, chronic mental condition that impacts one’s capacity to think, act, and interact with others. It has been established that SZ patients have morphological changes in their brains, along with decreased hippocampal and thalamic volume. Also, it is known that patients with SZ have irregular functional brain connectivity. Furthermore, because SZ is a genetic illness, genetic markers such as single nucleotide polymorphisms (SNP) can be useful to characterize SZ patients. We propose an automatic method to detect changes in SZ patients’ brains considering its heterogeneous multi-modal nature. We present a novel deep-learning method to classify SZ subjects with morphological features from structural MRI (sMRI), brain connectivity features from functional MRI (fMRI), and genetic features from SNPs. For sMRI, we used a pre-trained DenseNet to extract convolutional features which encode the morphological changes induced by SZ. For fMRI, we choose the important connections in functional network connection (FNC) matrix by applying layer-wise relevance propagation (LRP). We also detect SZ-linked SNPs using LRP on a pre-trained 1-dimensional convolutional neural network. Combined features from these three modalities are then fed to an extreme gradient boosting (XGBoost) tree classifier for SZ diagnosis. The experiments using the clinical dataset have shown that our multi-modal approach significantly improved SZ classification accuracy compared with uni-modal deep learning methods.","PeriodicalId":158726,"journal":{"name":"2023 IEEE International Conference on Acoustics, Speech, and Signal Processing Workshops (ICASSPW)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-06-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"130163404","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Z. Huang, Akila Pemasiri, S. Denman, C. Fookes, Terrence Martin
{"title":"Multi-Task Learning For Radar Signal Characterisation","authors":"Z. Huang, Akila Pemasiri, S. Denman, C. Fookes, Terrence Martin","doi":"10.1109/ICASSPW59220.2023.10193318","DOIUrl":"https://doi.org/10.1109/ICASSPW59220.2023.10193318","url":null,"abstract":"Radio signal recognition is a crucial task in both civilian and military applications, as accurate and timely identification of unknown signals is an essential part of spectrum management and electronic warfare. The majority of research in this field has focused on applying deep learning for modulation classification, leaving the task of signal characterisation as an understudied area. This paper addresses this gap by presenting an approach for tackling radar signal classification and characterisation as a multi-task learning (MTL) problem. We propose the IQ Signal Transformer (IQST) among several reference architectures that allow for simultaneous optimisation of multiple regression and classification tasks. We demonstrate the performance of our proposed MTL model on a synthetic radar dataset, while also providing a first-of-its-kind benchmark for radar signal characterisation.","PeriodicalId":158726,"journal":{"name":"2023 IEEE International Conference on Acoustics, Speech, and Signal Processing Workshops (ICASSPW)","volume":"49 6 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-06-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"129277930","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Weitong Zhai, Xiangrong Wang, Xianghua Wang, M. Amin, T. Shan
{"title":"Optimal Sparse MIMO Transceiver Design for Joint Automotive Sensing and Communications","authors":"Weitong Zhai, Xiangrong Wang, Xianghua Wang, M. Amin, T. Shan","doi":"10.1109/ICASSPW59220.2023.10193486","DOIUrl":"https://doi.org/10.1109/ICASSPW59220.2023.10193486","url":null,"abstract":"Joint automotive sensing and communication assisted by optimal sparse MIMO transceiver design is a promising technology for autonomous driving as it reduces hardware cost while preserving high angular resolution. In this paper, we propose to co-design a shared sparse MIMO transceiver within the paradigm of joint sensing and communication (JSAC). Antenna selection is performed to minimize the Cramer–Rao bound (CRB) for accurate tracking with enhanced direction of arrival (DOA) estimation. Meanwhile, the spatial precoding matrix for communications, which exhibits the same sparsity structure with the shared transmitter for automotive sensing, is optimized to deliver a desired quality of service. A solution of this problem requires the application of a series of convex relaxation strategies to transform the resultant non-convex co-design problem into a convex form. The fractional inequality is transformed into the denominator inequality with a constrained numerator and reweighted l1-norm minimization is utilized to promote binary sparsity. Simulations are provided to demonstrate the effectiveness of the optimal sparse MIMO transceiver obtained by the proposed method.","PeriodicalId":158726,"journal":{"name":"2023 IEEE International Conference on Acoustics, Speech, and Signal Processing Workshops (ICASSPW)","volume":"42 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-06-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"127664674","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Haoyu Wang, Bei Liu, Yifei Wu, Zhengyang Chen, Y. Qian
{"title":"Lowbit Neural Network Quantization for Speaker Verification","authors":"Haoyu Wang, Bei Liu, Yifei Wu, Zhengyang Chen, Y. Qian","doi":"10.1109/ICASSPW59220.2023.10193337","DOIUrl":"https://doi.org/10.1109/ICASSPW59220.2023.10193337","url":null,"abstract":"With the continuous development of deep neural networks (DNN) in recent years, the performance of speaker verification systems has been significantly improved with the application of Deeper ResNet architectures. However, these deeper models occupy more storage space in application. In this paper, we adopt Alternate Direction Methods of Multipliers (ADMM) to realize low-bit quantization on the original ResNets. Our goal is to explore the maximal quantization compression without evident degradation in model performance. We implement different uniform quantization for each convolution layer to achieve mixed precision quantization of the entire model. Moreover, the impact of batch normalization layers in ADMM training and layer sensibility to quantization are explored. In our experiments, the 8 bit quantized ResNetl52 achieved comparable results to the full-precision one on Voxceleb 1, with only 45% of original model size. Besides, we find that shallow convolution layers are more sensitive to quantization. In addition, experimental results indicate that the model performance will be severely degraded if batch normalization layers are integrated into the convolution layer before the quantization training starts.","PeriodicalId":158726,"journal":{"name":"2023 IEEE International Conference on Acoustics, Speech, and Signal Processing Workshops (ICASSPW)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-06-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"130476752","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}