APSIPA Transactions on Signal and Information Processing最新文献

筛选
英文 中文
An Application-Oriented Taxonomy on Spoofing, Disguise and Countermeasures in Speaker Recognition 面向应用的说话人识别中的欺骗、伪装和对抗分类
IF 3.2
APSIPA Transactions on Signal and Information Processing Pub Date : 2022-01-01 DOI: 10.1561/116.00000017
Lantian Li, Xingliang Cheng, T. Zheng
{"title":"An Application-Oriented Taxonomy on Spoofing, Disguise and Countermeasures in Speaker Recognition","authors":"Lantian Li, Xingliang Cheng, T. Zheng","doi":"10.1561/116.00000017","DOIUrl":"https://doi.org/10.1561/116.00000017","url":null,"abstract":"","PeriodicalId":44812,"journal":{"name":"APSIPA Transactions on Signal and Information Processing","volume":null,"pages":null},"PeriodicalIF":3.2,"publicationDate":"2022-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"67080002","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Enhanced Automatic Areas of Interest (AOI) Bounding Boxes Estimation Algorithm for Dynamic Eye-Tracking Stimuli 动态眼动追踪刺激的增强自动感兴趣区域(AOI)边界盒估计算法
IF 3.2
APSIPA Transactions on Signal and Information Processing Pub Date : 2022-01-01 DOI: 10.1561/116.00000026
E. A. Lagmay, M. M. Rodrigo
{"title":"Enhanced Automatic Areas of Interest (AOI) Bounding Boxes Estimation Algorithm for Dynamic Eye-Tracking Stimuli","authors":"E. A. Lagmay, M. M. Rodrigo","doi":"10.1561/116.00000026","DOIUrl":"https://doi.org/10.1561/116.00000026","url":null,"abstract":"In eye-tracking research, an area of interest (AOI) is defined as any object in the visual stimuli which is/are focused on by the viewer, defined with bounding boxes of any shape. If a study makes use of a small number of static visual stimuli, then researchers may define AOIs manually. However, if the stimuli is dynamic, then manual AOI definition is not efficient or scalable. This paper presents the Enhanced Automatic AOI Bounding Boxes Estimation Algorithm which automatically esti-mates the AOI bounding boxes in dynamic stimuli using simple image segmentation techniques. This algorithm is an improvement on the Automatic AOI Bounding Boxes Estimation Algorithm. It uses a faster version of the SLIC algorithm which utilizes the AVX2 SIMD (Single Instruction, Multiple Data) parallelization paradigm, and replaces the second K-Means Image Segmentation procedure at the end of the pre-∗ and in the evaluation of the the end results of the Enhanced Automatic AOI Bounding Boxes Estimation vious version of the algorithm with Region Adjacency Graph (RAG) Thresholding. The evaluation of the overall results of the new algorithm shows that the Enhanced Automatic AOI Bounding Boxes Estimation Algorithm is superior to its predecessor both in terms of accuracy (recall and precision) and efficiency (benchmarking).","PeriodicalId":44812,"journal":{"name":"APSIPA Transactions on Signal and Information Processing","volume":null,"pages":null},"PeriodicalIF":3.2,"publicationDate":"2022-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"67081045","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
Joint Chord and Key Estimation Based on a Hierarchical Variational Autoencoder with Multi-task Learning 基于多任务学习的分层变分自编码器联合和弦和键估计
IF 3.2
APSIPA Transactions on Signal and Information Processing Pub Date : 2022-01-01 DOI: 10.1561/116.00000052
Yiming Wu, Kazuyoshi Yoshii
{"title":"Joint Chord and Key Estimation Based on a Hierarchical Variational Autoencoder with Multi-task Learning","authors":"Yiming Wu, Kazuyoshi Yoshii","doi":"10.1561/116.00000052","DOIUrl":"https://doi.org/10.1561/116.00000052","url":null,"abstract":"This paper describes a deep generative approach to joint chord and key estimation for music signals. The limited amount of music signals with complete annotations has been the major bottleneck in supervised multi-task learning of a classification model. To overcome this limitation, we integrate the supervised multi-task learning approach with the unsupervised autoencoding approach in a mutually complementary manner. Considering the typical process of music composition, we formulate a hierarchical latent variable model that sequentially generates keys, chords, and chroma vectors. The keys and chords are assumed to follow a language model that represents their relationships and dynamics. In the framework of amortized variational inference (AVI), we introduce a classification model that jointly infers discrete chord and key labels and a recognition model that infers continuous latent features. These models are combined to form a variational autoencoder (VAE) and are trained jointly in a (semi-)supervised manner, where the generative and language models act as regularizers for the classification model. We comprehensively investigate three different architectures for the chord and key classification model, and three different architectures for the language model. Experimental results demonstrate that the VAE-based multi-task learning improves chord estimation as well as key estimation.","PeriodicalId":44812,"journal":{"name":"APSIPA Transactions on Signal and Information Processing","volume":null,"pages":null},"PeriodicalIF":3.2,"publicationDate":"2022-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"67081066","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Editorial for the Special Issue on Information Processing for Understanding Human Attentional and Affective States 为理解人类注意和情感状态的信息处理特刊社论
IF 3.2
APSIPA Transactions on Signal and Information Processing Pub Date : 2022-01-01 DOI: 10.1561/116.00000100
J. Yoshimoto, U. Obaidellah
{"title":"Editorial for the Special Issue on Information Processing for Understanding Human Attentional and Affective States","authors":"J. Yoshimoto, U. Obaidellah","doi":"10.1561/116.00000100","DOIUrl":"https://doi.org/10.1561/116.00000100","url":null,"abstract":"","PeriodicalId":44812,"journal":{"name":"APSIPA Transactions on Signal and Information Processing","volume":null,"pages":null},"PeriodicalIF":3.2,"publicationDate":"2022-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"67081312","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Editorial for the Special Issue on Multi-Disciplinary Dis/Misinformation Analysis and Countermeasures 多学科误传分析与对策特刊社论
IF 3.2
APSIPA Transactions on Signal and Information Processing Pub Date : 2022-01-01 DOI: 10.1561/116.00000101
Yuhong Liu
{"title":"Editorial for the Special Issue on Multi-Disciplinary Dis/Misinformation Analysis and Countermeasures","authors":"Yuhong Liu","doi":"10.1561/116.00000101","DOIUrl":"https://doi.org/10.1561/116.00000101","url":null,"abstract":"","PeriodicalId":44812,"journal":{"name":"APSIPA Transactions on Signal and Information Processing","volume":null,"pages":null},"PeriodicalIF":3.2,"publicationDate":"2022-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"67081362","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Content-Adaptive Level of Detail for Lossless Point Cloud Compression 无损点云压缩的内容自适应细节水平
IF 3.2
APSIPA Transactions on Signal and Information Processing Pub Date : 2022-01-01 DOI: 10.1561/116.00000004
Lei Wei, Shuai Wan, Fuzheng Yang, Zhecheng Wang
{"title":"Content-Adaptive Level of Detail for Lossless Point Cloud Compression","authors":"Lei Wei, Shuai Wan, Fuzheng Yang, Zhecheng Wang","doi":"10.1561/116.00000004","DOIUrl":"https://doi.org/10.1561/116.00000004","url":null,"abstract":"The nonuniform distribution of points in a point cloud and their abundant attribute information (such as colour, reflectance, and normal) result in the generation of massive data, making point cloud compression (PCC) essential for related applications. The hierarchical structure of the level of detail (LOD) in a point cloud and the corresponding predictions are commonly used in PCC, whereas the current method of LOD generation is neither content adaptive nor optimized. Targeting lossless PCC, an LOD prediction error model is proposed in this work, based on which the prediction error is minimized to obtain the optimal coding performance. As a result, the process of generating LOD is optimized, where the smallest number of LOD levels that yields the minimum attribute bitrate can be found. The proposed method is evaluated on various standard datasets under common test conditions. Experimental results show that the proposed method achieves optimal coding performance in a content-adaptive way while significantly reducing the time required for encoding and decoding, i.e., ∼ 15.2% and ∼ 17.3% time savings on average for distance-based LOD, and ∼ 5.4% and ∼ 5.1% time savings for Morton-based LOD, respectively.","PeriodicalId":44812,"journal":{"name":"APSIPA Transactions on Signal and Information Processing","volume":null,"pages":null},"PeriodicalIF":3.2,"publicationDate":"2022-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"67079920","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Recurrent Neural Networks and Their Memory Behavior: A Survey 递归神经网络及其记忆行为研究综述
IF 3.2
APSIPA Transactions on Signal and Information Processing Pub Date : 2022-01-01 DOI: 10.1561/116.00000123
Yuanhang Su, C.-C. Jay Kuo
{"title":"Recurrent Neural Networks and Their Memory Behavior: A Survey","authors":"Yuanhang Su, C.-C. Jay Kuo","doi":"10.1561/116.00000123","DOIUrl":"https://doi.org/10.1561/116.00000123","url":null,"abstract":"","PeriodicalId":44812,"journal":{"name":"APSIPA Transactions on Signal and Information Processing","volume":null,"pages":null},"PeriodicalIF":3.2,"publicationDate":"2022-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"67081438","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 8
ITS-Net: Iterative Two-Stream Network for Image Super-Resolution ITS-Net:图像超分辨率迭代双流网络
IF 3.2
APSIPA Transactions on Signal and Information Processing Pub Date : 2022-01-01 DOI: 10.1561/116.00000018
Wei Li, Yan Huang, Yilong Yin, Jingliang Peng
{"title":"ITS-Net: Iterative Two-Stream Network for Image Super-Resolution","authors":"Wei Li, Yan Huang, Yilong Yin, Jingliang Peng","doi":"10.1561/116.00000018","DOIUrl":"https://doi.org/10.1561/116.00000018","url":null,"abstract":"","PeriodicalId":44812,"journal":{"name":"APSIPA Transactions on Signal and Information Processing","volume":null,"pages":null},"PeriodicalIF":3.2,"publicationDate":"2022-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"67080013","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Editorial for the Special Issue on Deep Neural Networks 深度神经网络特刊社论
IF 3.2
APSIPA Transactions on Signal and Information Processing Pub Date : 2022-01-01 DOI: 10.1561/116.00000062
Li-Wei Kang, C. Yeh
{"title":"Editorial for the Special Issue on Deep Neural Networks","authors":"Li-Wei Kang, C. Yeh","doi":"10.1561/116.00000062","DOIUrl":"https://doi.org/10.1561/116.00000062","url":null,"abstract":"","PeriodicalId":44812,"journal":{"name":"APSIPA Transactions on Signal and Information Processing","volume":null,"pages":null},"PeriodicalIF":3.2,"publicationDate":"2022-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"67081126","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Deep Learning for Human Action Recognition: A Comprehensive Review 人类行为识别的深度学习:综述
IF 3.2
APSIPA Transactions on Signal and Information Processing Pub Date : 2022-01-01 DOI: 10.1561/116.00000068
Duc-Quang Vu, Trang Phung Thi Thu, Ngan T. H. Le, Jia-Ching Wang
{"title":"Deep Learning for Human Action Recognition: A Comprehensive Review","authors":"Duc-Quang Vu, Trang Phung Thi Thu, Ngan T. H. Le, Jia-Ching Wang","doi":"10.1561/116.00000068","DOIUrl":"https://doi.org/10.1561/116.00000068","url":null,"abstract":"","PeriodicalId":44812,"journal":{"name":"APSIPA Transactions on Signal and Information Processing","volume":null,"pages":null},"PeriodicalIF":3.2,"publicationDate":"2022-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"67081172","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
相关产品
×
本文献相关产品
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信