2022 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA ASC)最新文献

筛选
英文 中文
A Fair Model is not Fair in a Biased Environment 公平的模式在有偏见的环境中是不公平的
Y. Sato, S. Maeda, M. Akasaka, M. Nishigaki, Tetsushi Ohki
{"title":"A Fair Model is not Fair in a Biased Environment","authors":"Y. Sato, S. Maeda, M. Akasaka, M. Nishigaki, Tetsushi Ohki","doi":"10.23919/APSIPAASC55919.2022.9980134","DOIUrl":"https://doi.org/10.23919/APSIPAASC55919.2022.9980134","url":null,"abstract":"Facial images contain sensitive attributes such as skin color, and the elimination of them from the input in the face recognition is not easy. In addition, the input data includes the influence of the environment in which the system is actually used, so the interaction between sensitive attributes and the environment may make it inherently difficult for the facial feature extractor to extract facial features. Therefore, studies on the fairness of face recognition should consider the fairness of environmental factors. Common datasets used to evaluate the fairness of face recognition includes a variety of environmental factors, and the fairness evaluated by these datasets are usually the fairness in a typical shooting environment. However, a dataset that includes only extremely biased environmental factors potentially results in less equity among attributes. We construct a dataset with pseudo-biased environmental factors by dynamically changing environmental factors such as brightness in the test data. The results also show that the biased environmental factors deteriorate the fairness inter-attribute. Also, we showed that the distinguished attributes in terms of fairness in a biased environment vary based on the architecture of the model and the training dataset.","PeriodicalId":382967,"journal":{"name":"2022 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA ASC)","volume":"54 35 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-11-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"123806519","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Dual Prototypical Network for Robust Few-shot Image Classification 鲁棒少拍图像分类的双原型网络
Qi Song, Zebin Peng, Luchen Ji, Xiaochen Yang, Xiaoxu Li
{"title":"Dual Prototypical Network for Robust Few-shot Image Classification","authors":"Qi Song, Zebin Peng, Luchen Ji, Xiaochen Yang, Xiaoxu Li","doi":"10.23919/APSIPAASC55919.2022.9979898","DOIUrl":"https://doi.org/10.23919/APSIPAASC55919.2022.9979898","url":null,"abstract":"Deep neural networks have outperformed humans on some image recognition and classification tasks. However, with the emergence of various novel classes, it remains a chal-lenge to continuously expand the learning capability of such networks from a limited number of labeled samples. Metric-based approaches have been playing a key role in few-shot image classification, but most of them measure the distance between samples in the metric space using only a single metric function. In this paper, we propose a Dual Prototypical Network (DPN) to improve the test-time robustness of the classical prototypical network. The proposed method not only focuses on the distance of the original features, but also adds perturbation noise to the image and calculates the distance of noisy features. By enforcing the model to predict well under both metrics, more representative and robust class prototypes are learned and thus lead to better generalization performance. We validate our method on three fine-grained datasets in both clean and noisy settings.","PeriodicalId":382967,"journal":{"name":"2022 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA ASC)","volume":"51 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-11-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"122603981","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A Unified Compression and Watermarking Scheme for MT-BTC Images MT-BTC图像的统一压缩和水印方案
Jing-Ming Guo, Sankarasrinivasan Seshathiri
{"title":"A Unified Compression and Watermarking Scheme for MT-BTC Images","authors":"Jing-Ming Guo, Sankarasrinivasan Seshathiri","doi":"10.23919/APSIPAASC55919.2022.9979994","DOIUrl":"https://doi.org/10.23919/APSIPAASC55919.2022.9979994","url":null,"abstract":"As most multimedia is compressed for optimal storage or transmission, the watermarking during compression is more appropriate and demanding. Multi-tone block truncation coding image (MT-BTC) is a latest and superior version of halftone based block truncation coding images. In this paper, a novel watermarking strategy is proposed for MT-BTC images using the adaptive dither array selection (ADAS) and inter-tone shifting. ADAS method utilize dither array constructed using two different gaussian filters based on the Human Visual System (HVS) model to embed the watermark information. Further, various configuration of dither array such as actual, conjugate, transpose and transpose conjugate is used to embed more data. Moreover, a new approach termed inter-tone shifting is also proposed to improve the decoding rate and security. The decoding scheme is performed using the pattern similarity on the dithered watermarked image. For result validation, a 2,4,6 and 8-Tone watermark image is embedded in the 4-Tone MTBTC image. From extensive analysis on decoding rate and robustness, it has been validated that the proposed scheme outperforms many halftone watermarking and consistent with conventional BTC methods.","PeriodicalId":382967,"journal":{"name":"2022 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA ASC)","volume":"39 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-11-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"122884414","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Robust Speech Dereverberation Based on Adaptive Weighted Prediction Error Algorithm with Eigenvector Extraction 基于特征向量提取的自适应加权预测误差算法的鲁棒语音去噪
Yitong Chen, Wen Zhang
{"title":"Robust Speech Dereverberation Based on Adaptive Weighted Prediction Error Algorithm with Eigenvector Extraction","authors":"Yitong Chen, Wen Zhang","doi":"10.23919/APSIPAASC55919.2022.9979829","DOIUrl":"https://doi.org/10.23919/APSIPAASC55919.2022.9979829","url":null,"abstract":"Due to its satisfactory performance and no need for room impulse response information, the adaptive weighted prediction error (AWPE) algorithm is promising for speech dereverberation in practice. However, the robustness of AWPE to additive noise is low. To alleviate this problem, this paper proposes a variant of the AWPE algorithm that is based on eigen-decomposition of the signal auto-correlation matrix to construct the reference signal. By using the dominant eigenvector as the reference signal, a linear prediction filter is designed which has a better performance to predict the late reverberation even when the additive noise level is high. To reduce the computational complexity of the standard eigen-decomposition operation in the proposed AWPE variant, an online eigenvector extraction algorithm based on a fixed-point iteration algorithm is presented. Simulations are conducted to validate the effectiveness and robustness of the proposed algorithms over the standard AWPE algorithm.","PeriodicalId":382967,"journal":{"name":"2022 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA ASC)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-11-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"121241582","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A Fast Converge Spectral Modulation Sensitive Active Noise Control System 一种快速收敛频谱调制灵敏主动噪声控制系统
Kah-Meng Cheong, Yih-Liang Shen, T. Chi
{"title":"A Fast Converge Spectral Modulation Sensitive Active Noise Control System","authors":"Kah-Meng Cheong, Yih-Liang Shen, T. Chi","doi":"10.23919/APSIPAASC55919.2022.9979983","DOIUrl":"https://doi.org/10.23919/APSIPAASC55919.2022.9979983","url":null,"abstract":"Psychoacoustic active noise control (PANC) systems have been proposed to improve the noise reduction performance of active noise control (ANC) systems by considering hearing properties. PANC systems are usually implemented in the filtered@ $mathbf{x}$ least mean square (FxLMS) architecture. In this paper, we propose a PANC system using a simple and stable fast affine projection (FAP) algorithm in the filtered-x architecture, namely the filtered-x conjugate gradient fast affine projection (FxCGFAP) PANC. The proposed FxCGFAP PANC system converges faster than typical FxLMS ANC and PANC systems. The proposed PANC system considers not only the sensitivity of human hearing to frequency but also the sensitivity to spectral modulation. Objective and subjective evaluations have been conducted. The evaluation results show that the proposed system outperforms the other three systems in terms of objective loudness scores and subjective ratings. The proposed system has been implemented on the Tensilica HiFi3 DSP platform with the fixed-point data format.","PeriodicalId":382967,"journal":{"name":"2022 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA ASC)","volume":"13 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-11-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"127731112","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Deep Unfolding-Aided Sum-Product Algorithm for Error Correction of CRC Coded Short Message 基于深度展开辅助的CRC编码短信纠错和积算法
Qilin Zhang, S. Ibi, Takumi Takahashi, H. Iwai
{"title":"Deep Unfolding-Aided Sum-Product Algorithm for Error Correction of CRC Coded Short Message","authors":"Qilin Zhang, S. Ibi, Takumi Takahashi, H. Iwai","doi":"10.23919/APSIPAASC55919.2022.9979875","DOIUrl":"https://doi.org/10.23919/APSIPAASC55919.2022.9979875","url":null,"abstract":"This paper proposes a deep unfolding-aided sum-product algorithm (SPA) for error correction decoding of cyclic redundancy check (CRC) coded short message. SPA is a practical decoding algorithm for linear codes without requiring enormous computational complexity. However, if the SPA is used as it is for CRC codes, belief correlation and outliers will be induced in the iterative decoding process, resulting in lousy correction capability. To compensate for this drawback, we design a SPA-based decoding process for CRC code that incorporates a data-driven design based on deep learning and learning optimization of in-ternal trainable parameters. Considering the operation principle of soft-decision decoder, a novel loss function based on a weighted average of negentropy, which is a key measure to evaluate the Gaussianity, and BCE of the decoder output is proposed. Numerical results show that the proposed algorithm improves the bit error rate (BER) performance with deep unfolding and negentropy-aware loss function.","PeriodicalId":382967,"journal":{"name":"2022 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA ASC)","volume":"342 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-11-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"132632371","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Fast Signal Completion Algorithm with Cyclic Convolutional Smoothing 基于循环卷积平滑的快速信号补全算法
Hiromu Takayama, Tatsuya Yokota
{"title":"Fast Signal Completion Algorithm with Cyclic Convolutional Smoothing","authors":"Hiromu Takayama, Tatsuya Yokota","doi":"10.23919/APSIPAASC55919.2022.9980284","DOIUrl":"https://doi.org/10.23919/APSIPAASC55919.2022.9980284","url":null,"abstract":"Recently, signal completion methods using delay-embedding transforms (DT) have been actively studied. Since the DT is an operation to transform a signal into a Hankel matrix, the high computational cost associated with the increase in data size is an issue. In this study, we consider modeling smooth signals based on inverse delay-embedding instead of delay-embedding. We propose a new algorithm that incorporates the properties of the delay-embedding-based methods while reducing the computational cost. The proposed algorithm takes advantage of the inverse delay-embedding being a cyclic convolution, and the computational complexity can be reduced to $mathcal{O}(NlogN)$ by transforming the optimization problem to Fourier space. Numerical experiments with typical signals and audio data show the effectiveness of the proposed algorithm in signal declipping and completion problems.","PeriodicalId":382967,"journal":{"name":"2022 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA ASC)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-11-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"133511797","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Deep Adaptive Denoising Auto-Encoder Networks for ECG Noise Cancellation via Time-Frequency Domain 基于时频域心电降噪的深度自适应自编码器网络
Amir Mohammadisrab, Poorya Aghaomidi, Jalil Mazloum, M. Akbarzadeh, M. Orooji, N. Mokari, H. Yanikomeroglu
{"title":"Deep Adaptive Denoising Auto-Encoder Networks for ECG Noise Cancellation via Time-Frequency Domain","authors":"Amir Mohammadisrab, Poorya Aghaomidi, Jalil Mazloum, M. Akbarzadeh, M. Orooji, N. Mokari, H. Yanikomeroglu","doi":"10.23919/APSIPAASC55919.2022.9980058","DOIUrl":"https://doi.org/10.23919/APSIPAASC55919.2022.9980058","url":null,"abstract":"In this paper, we study the performance of a deep adaptive denoising auto-encoder network (DeepADAENet) for electrocardiogram (ECG) signal noise cancelation in the time-frequency domain for practical use cases. In order to achieve a higher resolution in distinguishing the noise from valuable data, the fractional Stockwell transform (FrST) is exploited to convert the ECG to the time-frequency image. The magnitude of the time-frequency version of the ECG is noise-canceled using DeepADAENet. Then, inverse FrST is utilized to return the denoised time-frequency ECG into the time domain. Furthermore, we use the MIT-BID Apnea-ECG database (APNEA-ECG) for preparing the dataset due to various physiologies and records compared with other ECG databases. Moreover, muscle artifacts (MA), baseline wander (BW), and electrode motion (EM) from the MIT-BID Noise Stress Test Database (NSTDB) are utilized to make noisy this clean dataset. The ECG signals recorded by non-clinical devices contain more noise than clinical recording. Accordingly, by changing the coefficient and frequency of noise resources, we attempt to close the simulated noisy signal to reality. Results reveal the excellent performance of DeepADAENet compared with similar work in terms of signal-to-noise ratio (SNR), root mean square error (RMSE), and percent root mean square difference (PRD).","PeriodicalId":382967,"journal":{"name":"2022 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA ASC)","volume":"59 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-11-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"134038951","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Enhanced Bidirectional Motion Estimation Using Feature Refinement for HDR Imaging 基于HDR成像特征细化的增强双向运动估计
An Gia Vien, Truong Thanh Nhat Mai, Seonghyun Park, Gahyeong Kim, Chul Lee
{"title":"Enhanced Bidirectional Motion Estimation Using Feature Refinement for HDR Imaging","authors":"An Gia Vien, Truong Thanh Nhat Mai, Seonghyun Park, Gahyeong Kim, Chul Lee","doi":"10.23919/APSIPAASC55919.2022.9980026","DOIUrl":"https://doi.org/10.23919/APSIPAASC55919.2022.9980026","url":null,"abstract":"We propose a high dynamic range (HDR) image synthesis algorithm based on enhanced bidirectional motion estimation using feature refinement. First, we extract multiscale features from input low dynamic range (LDR) images and then estimate accurate motion vector fields between them in a coarse-to-fine manner via progressive refinement. Then, we estimate adaptive local kernels to merge only valid information in the spatio-exposed neighboring pixels for synthesis. Finally, we refine the initially merged image by exploiting global information to further improve synthesis performance. Experimental results show that the proposed algorithm outperforms state-of-the-art algorithms in quantitative and qualitative comparisons.","PeriodicalId":382967,"journal":{"name":"2022 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA ASC)","volume":"81 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-11-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"133801581","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Vibration measurement using spatial shifting coherent digital holography 利用空间位移相干数字全息测量振动
Long Ngo, Q. Pham
{"title":"Vibration measurement using spatial shifting coherent digital holography","authors":"Long Ngo, Q. Pham","doi":"10.23919/APSIPAASC55919.2022.9980262","DOIUrl":"https://doi.org/10.23919/APSIPAASC55919.2022.9980262","url":null,"abstract":"In this research, we proposed a new digital coherent holographic configuration for accurately measuring the three-dimensional (3D) vibration of the object. The vibration was indirectly measured by the displacement of the three mirrors attached on the object. The hologram recorded by the camera consisting of 6 sub-holograms can be separated by Fourier transform and appropriated spatial band-pass filters. Three phase sets extracted from 3 sub-holograms of the reference mirror and 3 object mirrors were used to calculate the displacement of the object in 3D directions. The relation between the displacement of the object and the phases of the sub-holograms was related to the wavelength of the light source, therefore this allows observing the vibration of the object with nano-scale accuracy in z direction and much smaller than the pixel size of the camera accuracy in x and y directions.","PeriodicalId":382967,"journal":{"name":"2022 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA ASC)","volume":"29 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-11-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"133853087","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
相关产品
×
本文献相关产品
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信