Lachlan I. Birnie, P. Samarasinghe, T. Abhayapala, Daniel Grixti-Cheng
{"title":"Noise RETF Estimation and Removal for Low SNR Speech Enhancement","authors":"Lachlan I. Birnie, P. Samarasinghe, T. Abhayapala, Daniel Grixti-Cheng","doi":"10.1109/mlsp52302.2021.9596209","DOIUrl":"https://doi.org/10.1109/mlsp52302.2021.9596209","url":null,"abstract":"A method for offline two-microphone speech enhancement in highly adverse noisy environments with signal-to-noise (SNR) ratios of −10 to −20 dB is proposed. While the topic of speech enhancement is well researched, there are very few methods developed to address such significant noise conditions. Specifically, we are interested in removing noise from unintelligible recordings such that the resulting denoised speech content is understandable to human listeners. We propose exploiting the Relative Transfer Function (ReTF), a spatial feature of the noise source in a speech enhancement algorithm. We model the noise source ReTF with a time-domain machine learning structure to estimate and subtract the noise signal from the mixture. Both a linear filtering and an autoen-coder based structure are proposed. For a single interfering noise source, speech intelligibility is improved to within 9% below the Short-Time Objective Intelligibility (STOI) score of the benchmark oracle Ideal Binary Mask (IBM).","PeriodicalId":156116,"journal":{"name":"2021 IEEE 31st International Workshop on Machine Learning for Signal Processing (MLSP)","volume":"80 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2021-10-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"124404360","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Detecting Network State in the Presence of Varying Levels of Congestion","authors":"R. Fréin, Obinna Izima, Ali Malik","doi":"10.1109/mlsp52302.2021.9596271","DOIUrl":"https://doi.org/10.1109/mlsp52302.2021.9596271","url":null,"abstract":"We consider the problem of estimating the state of computer networks which are delivering video in the presence of other interfering services. Existing methods for measuring jitter, a Quality of Delivery (QoD) measure for video, are based on statically configured IIR filters. They do not attempt to estimate the congestion in the network that caused the jitter to change. As a result, steps taken to improve QoD are frequently taken blindly. We pose the problem of estimating jitter as the problem of estimating a target source in the presence of interfering sources. To evaluate the approach we capture QoD measurements for a target video client from a six router networking test-bed where video is delivered over a substrate which is shared with varying levels of interfering sources which cause congestion. We demonstrate the performance of the new jitter estimator as part of a background congestion level detector. Numerical results based on real data show that considerable gains in congestion state classification are achieved for all congestion levels.","PeriodicalId":156116,"journal":{"name":"2021 IEEE 31st International Workshop on Machine Learning for Signal Processing (MLSP)","volume":"31 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2021-10-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"125176844","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Isaac Sebenius, Alexander Campbell, S. Morgan, E. Bullmore, Pietro Lio'
{"title":"Multimodal Graph Coarsening for Interpretable, MRI-Based Brain Graph Neural Network","authors":"Isaac Sebenius, Alexander Campbell, S. Morgan, E. Bullmore, Pietro Lio'","doi":"10.1109/MLSP52302.2021.9690626","DOIUrl":"https://doi.org/10.1109/MLSP52302.2021.9690626","url":null,"abstract":"Graph neural networks (GNN s) are a powerful class of model for representation learning on relational data and graph-structured signal, such as brain connectivity graphs derived from neuroimaging. To date, existing work applying graph learning methods to brain connectivity is limited to a single neuroimaging modality such as structural or functional MRI. In practice, the brain is best represented by multiple networks arising from different imaging modalities. We develop a gen-eral framework for jointly pooling multimodal graphs which share the same set of underlying nodes whilst differing in edge connectivity. Building on this approach, we propose a multimodal GNN (MM-GNN) model that incorporates mul-tiple types of neuroimaging-based brain connectivity. When applied to the task of classifying brain images from patients with schizophrenia and healthy control subjects, we observe that incorporating multimodal pooling dramatically improves performance over non-pooled networks and that MM-GNN matches or improves performance over multiple single-modal and non-GNN baselines. Finally, we demonstrate how our approach uses multimodal data to learn a unified, interpretable measure of the salience of individual brain regions of interest. In this way, MM-GNN represents a new method for leveraging diverse brain connectivity data to enhance the detection of mental health disorders and to understand their biological underpinnings.","PeriodicalId":156116,"journal":{"name":"2021 IEEE 31st International Workshop on Machine Learning for Signal Processing (MLSP)","volume":"81 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2021-10-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"116134939","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Diego Stucchi, Andrea Corsini, G. Genty, G. Boracchi, A. Foi
{"title":"A Weighted Loss Function to Predict Control Parameters for Supercontinuum Generation Via Neural Networks","authors":"Diego Stucchi, Andrea Corsini, G. Genty, G. Boracchi, A. Foi","doi":"10.1109/mlsp52302.2021.9596142","DOIUrl":"https://doi.org/10.1109/mlsp52302.2021.9596142","url":null,"abstract":"Supercontinuum light is generated by a train of laser pulses propagating in an optical fiber. The parameters characterizing these pulses influence the spectrum of the light as it exits the fiber. While spectrum generation is a direct process governed by nonlinear equations that can be reproduced through numerical simulation, determining the parameters of the pulse generating a given spectrum is a difficult inverse problem. Solving this inverse problem has a relevant practical implication, as it allows generating beams with desired spectral properties. We solve this multidimensional parameter estimation problem by training a neural network and we introduce, as key technical contribution, a weighted loss function that improves the estimation accuracy. Most remarkably, this loss function is not specific to the considered supercontinuum scenario, but has the potential to improve solutions of similar inverse problems where the forward process can be reproduced via computationally demanding simulations. Our experiments demonstrate the effectiveness of the pursued approach and of our weighted loss function.","PeriodicalId":156116,"journal":{"name":"2021 IEEE 31st International Workshop on Machine Learning for Signal Processing (MLSP)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2021-10-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"129677460","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Scalable Community Detection in the Degree-Corrected Stochastic Block Model","authors":"Yicong He, Andre Beckus, George K. Atia","doi":"10.1109/mlsp52302.2021.9596377","DOIUrl":"https://doi.org/10.1109/mlsp52302.2021.9596377","url":null,"abstract":"Community detection aims to partition a connected graph into a small number of clusters. The Degree-Corrected Stochastic Block Model (DCSBM) is one popular generative model that yields graphs with varying degree distributions within the communities. However, large computational complexity and storage requirements of existing approaches for DCSBM limit their scalability to large graphs. In this paper, we advance a scalable framework for DCSBM, in which the full graph is first sub-sampled by selecting a small subset of the nodes, then a clustering of the induced subgraph is obtained, followed by low-complexity retrieval of the global community structure from the clustering of the graph sketch. To sample the underlying graph, we introduce a family of sampling schemes that capture local community structures using metrics derived from the average neighbor degrees, which are shown to achieve the twin objective of sampling from low-density clusters and identifying high-degree nodes within each cluster. The proposed approach can perform on par with full scale clustering while affording substantial complexity and storage gains as demonstrated through experiments using both synthetic and real data.","PeriodicalId":156116,"journal":{"name":"2021 IEEE 31st International Workshop on Machine Learning for Signal Processing (MLSP)","volume":"120 3","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2021-10-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"114001613","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Deep Complex Convolutional Recurrent Network for Multi-Channel Speech Enhancement and Dereverberation","authors":"Femke B. Gelderblom, T. A. Myrvoll","doi":"10.1109/mlsp52302.2021.9596086","DOIUrl":"https://doi.org/10.1109/mlsp52302.2021.9596086","url":null,"abstract":"This paper proposes a neural network based system for multichannel speech enhancement and dereverberation. Speech recorded indoors by a far field microphone, is invariably degraded by noise and reflections. Recent single channel enhancement systems have improved denoising performance, but do not reduce reverberation, which also reduces speech quality and intelligibility. To address this, we propose a deep complex convolution recurrent network (DCCRN) based multi-channel system, with integrated minimum power distortionless response (MPDR) beamformer and weighted prediction error (WPE) preprocessing. PESQ and STOI performance is evaluated on a test set of room impulse responses and noise samples recorded by the same setup. The proposed system shows a statistically significant improvement (p « 0.05) over competitive systems.","PeriodicalId":156116,"journal":{"name":"2021 IEEE 31st International Workshop on Machine Learning for Signal Processing (MLSP)","volume":"22 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2021-10-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"115770964","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"A Coarse-to-Fine Object Detection Framework for High-Resolution Images with Sparse Objects","authors":"Jinyan Liu, Longbin Yan, Jie Chen","doi":"10.1109/mlsp52302.2021.9596518","DOIUrl":"https://doi.org/10.1109/mlsp52302.2021.9596518","url":null,"abstract":"To detect sparse small objects in high resolution images at a low cost is significantly more challenging than regular detection tasks. Compared to the overall detection accuracy, the recall rate is much less affected when using properly downsampled images for detection. Based on this fact, we propose a clustering-based coarse-to-fine object detection framework to enhance the object detection of sparse small objects. The first stage is coarse detection on a downsampled image to obtain image chips based on a clustering-baed region generation method. After that, the associated high resolution image clips are sent to a second-stage detector for fine detection. This approach reduces the number of chips for final object detection compared to regular methods, which divide the image into small tiles of the same size, and makes the best use of information in high-resolution images to increase detection accuracy. Experimental results show that our proposed approach achieves promising performance compared with other state-of-the-art detectors.","PeriodicalId":156116,"journal":{"name":"2021 IEEE 31st International Workshop on Machine Learning for Signal Processing (MLSP)","volume":"113 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2021-10-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"133082098","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Optimized Spatial Matching for Visual Object Tracking","authors":"Fuxiang Wang, Qing Mei, Xuhui Liu, Yao Xiao","doi":"10.1109/mlsp52302.2021.9596182","DOIUrl":"https://doi.org/10.1109/mlsp52302.2021.9596182","url":null,"abstract":"The biggest challenge for visual object tracking is the simultaneous requirements for robustness and discrimination. Although there are many algorithms to study current problems, this problem still cannot be overcome. In this paper, inspired by Siamese network SPM-Tracker, a new target tracking algorithm-OSM-Tracker is proposed. The algorithm is a two-stage Siamese network tracker composed of an optimized space network and a correction network. Through the cooperation of these two aspects with SPM-Tracker, compared OSM-Tracker has produced good results. Experiments have proved that our tracker has achieved a considerable performance improvement and achieved real-time effects on OTB-100and LaSOT.","PeriodicalId":156116,"journal":{"name":"2021 IEEE 31st International Workshop on Machine Learning for Signal Processing (MLSP)","volume":"23 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2021-10-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"127824521","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Bayesian Two-Stage Sequential Change Diagnosis Via Multi-Sensor Array","authors":"Xiaochuan Ma, L. Lai, Shuguang Cui","doi":"10.1109/mlsp52302.2021.9596446","DOIUrl":"https://doi.org/10.1109/mlsp52302.2021.9596446","url":null,"abstract":"In this paper, we formulate and solve a two-stage Bayesian sequential change diagnosis (SCD) problem in a multi-sensor setting. In the considered problem, the change propagates across the sensor array gradually. After a change is detected, we are allowed to continue observing more samples so that we can identify the distribution after the change more accurately. The goal is to minimize the total cost including delay, false alarm, and misdiagnosis probabilities. We characterize the optimal SCD rule. Moreover, to address the high computational complexity issue of the optimal SCD rule, we propose a low-complexity threshold rule that is asymptotically optimal as the unit delay costs go to zero.","PeriodicalId":156116,"journal":{"name":"2021 IEEE 31st International Workshop on Machine Learning for Signal Processing (MLSP)","volume":"32 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2021-10-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"133738411","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Small Moving Target MOT Tracking with GM-PHD Filter and Attention-Based CNN","authors":"Camilo Aguilar, M. Ortner, J. Zerubia","doi":"10.1109/mlsp52302.2021.9596204","DOIUrl":"https://doi.org/10.1109/mlsp52302.2021.9596204","url":null,"abstract":"We present a multi-object tracking (MOT) approach to track small moving targets in satellite images. Our objects of interest span few pixels, do not present a defined texture, and are easily lost in cluttered environments. We propose a patch-based convolutional neural network (CNN) that focuses on specific regions to detect and discriminate nearby small objects. We use the object motion information to drive the patch selection and detect objects using a region-based CNN. In addition, we present a direct MOT data-association approach by using an improved Gaussian mixture-probability hypothesis density (GM-PHD) filter. The GM-PHD filter offers an efficient yet robust MOT formulation that takes into account clutter, misdetection, and target appearance and disappearance. We are able to detect and track blob-like moving objects and demonstrate an improvement over competing state-of-the-art tracking approaches.","PeriodicalId":156116,"journal":{"name":"2021 IEEE 31st International Workshop on Machine Learning for Signal Processing (MLSP)","volume":"9 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2021-10-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"115519875","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}