{"title":"A preliminary acoustic study of Mandarin speech in drug addicts","authors":"Puyang Geng, Hong Guo, Qimeng Lu, Shaopei Shi","doi":"10.1117/12.2631443","DOIUrl":"https://doi.org/10.1117/12.2631443","url":null,"abstract":"Using a natural conversation paradigm, this study investigated the acoustic characteristics of Mandarin utterances in drug addicts. Twenty-one native speakers of Mandarin, including four heroin addicts, two 3,4-methylamphetamine (MDMA, also known as ecstasy) addicts, and 15 healthy controls without any history of drug abuse, were recruited for the speech production experiment. In comparison with the healthy controls, heroin addicts exhibited a higher mean F0, a lower mean intensity, a higher variability in both F0 and intensity, and a lower H1-H2, while MDMA addicts exhibited a higher variability in both F0 and intensity. Discriminant analysis based on these acoustic parameters further showed a good accuracy of differentiating the three groups of speakers. These findings provide the basis for future research into identifying drug addicts on the basis of speech signals.","PeriodicalId":415097,"journal":{"name":"International Conference on Signal Processing Systems","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-05-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"129628535","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Tao Wang, Yu Wu, Yongwei Zhang, Wen Lv, Xinwei Chen, Zhi-Wen Yang
{"title":"Concentration prediction of binary mixed gases based on random forest algorithm in the electronic nose system","authors":"Tao Wang, Yu Wu, Yongwei Zhang, Wen Lv, Xinwei Chen, Zhi-Wen Yang","doi":"10.1117/12.2631552","DOIUrl":"https://doi.org/10.1117/12.2631552","url":null,"abstract":"The concentration prediction of mixed gases is crucial to the pattern recognition research of electronic nose (E-nose) systems. The response experiments of the E-nose towards the ethanol and propanol mixture with different concentrations are carried out. Five kinds of machine learning algorithms, including linear regression, support vector machine, K-nearest neighbor, random forest, and decision tree, are used for training the multiple output regressors to predict the content of each component simultaneously. The R2 score, root mean squared error, and mean absolute error are used to evaluate the performance of these models. The relationship between prediction accuracy and concentration distribution has also been studied. The results show that the model based on the random forest has superior performance for forecasting the concentration of ethanol and propanol, with the R2 score more than 0.98 in the 5-fold cross-validation. This study provides a significant inspiration for designing a multi-output regression model to realize the quantitative prediction of mixed gases by the E-nose.","PeriodicalId":415097,"journal":{"name":"International Conference on Signal Processing Systems","volume":"49 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-05-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"114816993","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Research on different classifiers of automatic target recognition and classification for low-resolution ground radar","authors":"Yu-Li You, Renhong Xie, Jinwei Gu, Teng Wang, Peng Li, Yibin Rui","doi":"10.1117/12.2631543","DOIUrl":"https://doi.org/10.1117/12.2631543","url":null,"abstract":"As low-resolution radar is still the main radar in service in China, the ground target classification and recognition technology of low-resolution radar has a wide application prospect in modern military and civil fields. This paper mainly studies and compares two main types of automatic target recognition and classification method for low-resolution ground radar: conventional recognition based on feature extraction and neural networks, and the conclusion is that the latter has better performance and needs less time to train. The former model in this paper fuses the time domain and frequency domain features of ground target echo, then simulates, compares and analyzes the performance of different classifiers. The classifiers studied include: naive bayes classifier (NBC), decision tree classifier (DT), linear discriminant analysis (LDA) classifier, k nearest neighbors (KNN) classifier and support vector machine (SVM) classifier. Five-fold cross validation is adopted in the experiment to effectively avoid the impact of arbitrariness on the results caused by the random division of the sample set into training sample set and test sample set. Besides, based on conventional convolutional neural networks, a new neural network structure named multi-scale residual neural network (Multi-scale ResNet) is proposed for one-dimensional feature target recognition, which effectively reduces the data dimension through auto-encoder and solves the problem of performance degradation caused by the difficulty in training too many levels of traditional convolutional neural network. The bayesian hyper-parameter optimization method is utilized to optimize the hyper-parameters of different classifiersl. Finally, compared the accuracy of the two types of target recognition, the best performance of the pattern recognition is the support vector machine, which recognition rate is 91.2%, while multi-scale residual neural network recognition rate is up to 99.6%.","PeriodicalId":415097,"journal":{"name":"International Conference on Signal Processing Systems","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-05-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"129870077","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Expression-assisted facial action unit detection through an attention mechanism and smooth class-weighted Loss","authors":"Zhongling Liu, Ziqiang Shi, Rujie Liu, Liu Liu, Xiaoyue Mi, Kentaro Murase","doi":"10.1117/12.2631478","DOIUrl":"https://doi.org/10.1117/12.2631478","url":null,"abstract":"The performance of facial action unit (AU) detection is often limited owing to the lack of annotated AU data and data imbalance. The data scarcity problem can be partially mitigated by abundant expression data, as AU detection and facial expression recognition (FER) are closely related. Accordingly, in this study, FER and AU detection are trained jointly in a multi-task learning framework, in which FER serves as an auxiliary task for AU detection by providing supplementary information. Meanwhile, we propose to use an attention gate unit between the two tasks to flexibly select valuable information from each other. To address the model bias issue caused by AU data imbalance, a smooth class-weighted loss is adopted to alleviate the dominance of negative AU classes. The best average F1-score obtained using our approach on the BP4D dataset is 63.5%, which is very close to the state-of-the-art performance and exceeds the single task baseline by 3.2%.","PeriodicalId":415097,"journal":{"name":"International Conference on Signal Processing Systems","volume":"31 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-05-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"130974923","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Detection of voice disability and its severity in children","authors":"Mohina Ahmadi, Pavani Mullapudi, Shreyashri Athani, Shikha Tripathi","doi":"10.1117/12.2631479","DOIUrl":"https://doi.org/10.1117/12.2631479","url":null,"abstract":"Voice Disability is one of the common disabilities experienced by children. Speech being the major mode of communication, it is important to rectify the voice-related problems at an early stage in life. Painful endoscopic techniques like laryngoscopy are used by doctors to identify the voice disability. In this work, an algorithm is devised to measure the severity of voice disability in children using signal processing techniques. Spectrogram and curve fitting techniques are used to detect voice disability. The normal and pathological curve fitted functions are passed through an adaptive signal processing system. Correlation between the normal function and tuned pathological function is obtained which is used to determine the severity of the disability. The reported work on this topic is language-dependent and uses machine learning algorithms that need large databases. In this work, adaptive signal processing techniques and the use of voice acoustic parameters are explored. Sound samples used are vowel sounds that are independent of the language and a range has been assigned to quantify the severity of the disability.","PeriodicalId":415097,"journal":{"name":"International Conference on Signal Processing Systems","volume":"76 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-05-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"114247261","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"A method of multi-target bearings-only association and location with two platforms","authors":"Tao Jiang, Ganggang Tu, Jiutao Wu","doi":"10.1117/12.2631444","DOIUrl":"https://doi.org/10.1117/12.2631444","url":null,"abstract":"It is not easy for a single passive platform to realize target location by just bearings-only measurements, unless it carries a time-consuming laborious maneuvering. By contrast, with two platforms we could simply achieve accurate target location through triangulation approach without the need of platform maneuvering. However, this two-platform approach introduces an inter-platform association problem in multi-target scenarios, that is, one should recognize and exclude the false targets generated from wrongly paired bearings. To solve this, in this paper, a target motion analysis (TMA) based approach is proposed. Through a pre-TMA of all possible targets, the method calculates the confidence coefficients of all possible targets for recognition (real or false), which are then used to associate inter-platform bearings and select real targets. Simulation verification is performed through using Monte Carlo technique. The result demonstrates that the proposed method can achieve accurate inter-platform multi-target data association and multi-target location.","PeriodicalId":415097,"journal":{"name":"International Conference on Signal Processing Systems","volume":"110 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-05-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"134535110","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Image inpainting based on directional Gaussian graph model using multi-head reference","authors":"Zhonghao Zhang, Lihong Ma","doi":"10.1117/12.2631560","DOIUrl":"https://doi.org/10.1117/12.2631560","url":null,"abstract":"This paper aims to repair missing regions which are corrupted along arbitrary directions. It presents a mixed image inpainting method based on Markov random field. By using Belief Propagation scheme in low gray-levels and alternately suggesting a directional Gaussian Graphical model (DGGM) for multiple references in a high-level range, it gains a balance between the model accuracy and the computation complexity in realization. On the basis of an existing method in [1], it improves the method in high level inpainting for the task under small train sets and large corrupted regions, by introducing these multi-head reference clues. Experimental results are given, the inpainting quality of different kinds of images with different sizes and contents under different parameter settings are compared in metrics of the peak signal noise ratio and the structural similarity index. The significance of parameter settings and the efficient computational cost could demonstrate the feasibility of this method.","PeriodicalId":415097,"journal":{"name":"International Conference on Signal Processing Systems","volume":"107 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-05-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"117253830","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Referential genome sequence compression with low memory consumption","authors":"Zhiwen Lu, Jianhua Chen, Rongshu Wang","doi":"10.1117/12.2631583","DOIUrl":"https://doi.org/10.1117/12.2631583","url":null,"abstract":"With the rapid development of genome sequencing technology, a large amount of genome data has been generated, it also brings the storage problem of this massive data. Therefore, the compression of genome data has become a research hotspot. We propose a new genome data compression algorithm called LCMRGC (low memory consumption referential genome compressor) for FASTA format sequences. The algorithm uses the suffix array data structure to support the search of matching strings, and uses the binary search method to accelerate accurate matching, so as to obtain better compression ratio. Experiment results on standard genome data show that the proposed algorithm significantly reduces the memory requirement for program operation, and is competitive in compression ratio and compression time.","PeriodicalId":415097,"journal":{"name":"International Conference on Signal Processing Systems","volume":"77 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-05-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"126178048","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"A multi-scale adaptive feature enhancement network for image denoising","authors":"Qing Zhao, Z. Miao, Wanru Xu, Shaoyue Song","doi":"10.1117/12.2631573","DOIUrl":"https://doi.org/10.1117/12.2631573","url":null,"abstract":"Recently, deep learning has been widely used in image denoising. However, most of the existing deep learning-based methods are not adequate in blind denoising for additive white Gaussian noise (AWGN) images and real-world noisy images, which are still noisy or blurred. The difficulty is how to handle different noise levels and different types of noise with only one pre-trained model. In this paper, we propose a multi-scale adaptive feature enhancement network (MFENet) to improve the performance on blind image denoising. The MFENet is based on residual learning and batch normalization to speed up the network convergence. In the MFENet, dilated convolution and deformable convolution can expand the receptive field to obtain rich information from different scales. The deformable convolution is also able to adjust the sampling position to fit different shapes of objects. Spatial attention is used to enhance important features in the large amount of information. The experimental results show that the proposed method for blind denoising outperforms the state-of-the-art methods on both synthetic and real-world noisy images.","PeriodicalId":415097,"journal":{"name":"International Conference on Signal Processing Systems","volume":"140 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-05-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"124562936","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"A modulation index estimation algorithm for multi-h CPM signals","authors":"Hao Chen, Xiaolin Zhang, Xiaopeng Zhang","doi":"10.1117/12.2631576","DOIUrl":"https://doi.org/10.1117/12.2631576","url":null,"abstract":"Continuous Phase Modulation (CPM) signal has been widely used in modern satellite, mobile communication and military communication system due to its good spectrum and power utilization and constant envelope characteristics. The premise of demodulation or jamming of CPM signal intercepted in military communication confrontation is the accurate estimation of signal parameters. Aiming at the difficulty and complexity of multi-h CPM signal modulation index estimation, this paper puts forward a kind of blind estimation algorithm of the modulation index based on first order cyclic moment and the second-order cyclic cumulants, using the first and second order cyclic properties in frequency domain on the spectral properties to signal parameter estimation. The estimation algorithm is suitable for both single h CPM signals and multi-h CPM signals, where the multi-h CPM requires the synchronization of the guidance sequence. Simulation results show that the algorithm has better performance under low signal-to-noise ratio with less symbols required. When the number of symbols is 128 and the signal-to-noise ratio is 15 dB, the accurate recognition rate of the multi-index CPM signal can reach 98% and the recognition rate can reach 99% with the allowable error of 1/32.","PeriodicalId":415097,"journal":{"name":"International Conference on Signal Processing Systems","volume":"38 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-05-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"124948256","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}