Comput. Vis. Image Underst.最新文献_第4页

DFAF3D: A dual-feature-aware anchor-free single-stage 3D detector for point clouds daf3d:用于点云的双特征感知无锚单级3D探测器

Comput. Vis. Image Underst. Pub Date : 2022-11-01 DOI: 10.2139/ssrn.4195234

Qingsong Tang, Xinyu Bai, Jinting Guo, Bolin Pan, Wuming Jiang

引用次数: 3

RGB-T tracking by modality difference reduction and feature re-selection 基于模态差约简和特征重选择的RGB-T跟踪

Comput. Vis. Image Underst. Pub Date : 2022-11-01 DOI: 10.2139/ssrn.4137009

Qian Zhang, Xueru Liu, Tianlu Zhang

引用次数: 2

Multistage temporal convolution transformer for action segmentation 动作分割的多级时间卷积变压器

Comput. Vis. Image Underst. Pub Date : 2022-10-01 DOI: 10.2139/ssrn.4217347

Nicolas Aziere, S. Todorovic

引用次数: 8

Appropriate grape color estimation based on metric learning for judging harvest timing 基于度量学习的适当的葡萄颜色估计用于判断收获时间

Comput. Vis. Image Underst. Pub Date : 2022-09-28 DOI: 10.1007/s00371-022-02666-0

Tatsuyoshi Amemiya, Chee Siang Leow, Prawit Buayai, Koji Makino, Xiaoyang Mao, H. Nishizaki

引用次数: 1

MECCANO: A Multimodal Egocentric Dataset for Humans Behavior Understanding in the Industrial-like Domain MECCANO:用于工业领域人类行为理解的多模态自我中心数据集

Comput. Vis. Image Underst. Pub Date : 2022-09-19 DOI: 10.1016/S1077-3142(23)00144-3

F. Ragusa, Antonino Furnari, G. Farinella

引用次数: 11

Revisiting Crowd Counting: State-of-the-art, Trends, and Future Perspectives 重访人群计数:最新技术、趋势和未来展望

Comput. Vis. Image Underst. Pub Date : 2022-09-14 DOI: 10.48550/arXiv.2209.07271

Muhammad Asif Khan, H. Menouar, R. Hamila

{"title":"Revisiting Crowd Counting: State-of-the-art, Trends, and Future Perspectives","authors":"Muhammad Asif Khan, H. Menouar, R. Hamila","doi":"10.48550/arXiv.2209.07271","DOIUrl":"https://doi.org/10.48550/arXiv.2209.07271","url":null,"abstract":"Crowd counting is an effective tool for situational awareness in public places. Automated crowd counting using images and videos is an interesting yet challenging problem that has gained significant attention in computer vision. Over the past few years, various deep learning methods have been developed to achieve state-of-the-art performance. The methods evolved over time vary in many aspects such as model architecture, input pipeline, learning paradigm, computational complexity, and accuracy gains etc. In this paper, we present a systematic and comprehensive review of the most significant contributions in the area of crowd counting. Although few surveys exist on the topic, our survey is most up-to date and different in several aspects. First, it provides a more meaningful categorization of the most significant contributions by model architectures, learning methods (i.e., loss functions), and evaluation methods (i.e., evaluation metrics). We chose prominent and distinct works and excluded similar works. We also sort the well-known crowd counting models by their performance over benchmark datasets. We believe that this survey can be a good resource for novice researchers to understand the progressive developments and contributions over time and the current state-of-the-art.","PeriodicalId":10549,"journal":{"name":"Comput. Vis. Image Underst.","volume":"53 1","pages":"104597"},"PeriodicalIF":0.0,"publicationDate":"2022-09-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"73221761","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 11

A novel fast combine-and-conquer object detector based on only one-level feature map 一种新的基于单级特征映射的快速组合征服目标检测器

Comput. Vis. Image Underst. Pub Date : 2022-09-01 DOI: 10.2139/ssrn.4003831

Jianhua Yang, Ke Wang, Ruifeng Li, Zhong Qin, P. Perner

引用次数: 3

Multi-label out-of-distribution detection via exploiting sparsity and co-occurrence of labels 利用标签的稀疏性和共现性进行多标签超分布检测

Comput. Vis. Image Underst. Pub Date : 2022-09-01 DOI: 10.2139/ssrn.4151266

Lei Wang, Shengyue Huang, Luwen Huangfu, Bo Liu, Xiaohong Zhang

引用次数: 6

ST-VTON: Self-supervised vision transformer for image-based virtual try-on ST-VTON:用于基于图像的虚拟试戴的自监督视觉转换器

Comput. Vis. Image Underst. Pub Date : 2022-09-01 DOI: 10.2139/ssrn.4140115

Zheng Chong, L. Mo