2020 Digital Image Computing: Techniques and Applications (DICTA)最新文献

筛选
英文 中文
Miss Yoga: A Yoga Assistant Mobile Application Based on Keypoint Detection Miss Yoga:一款基于关键点检测的瑜伽助手手机应用
2020 Digital Image Computing: Techniques and Applications (DICTA) Pub Date : 2020-11-29 DOI: 10.1109/DICTA51227.2020.9363384
Renhao Huang, Jiqing Wang, Haowei Lou, Haodong Lu, Bofei Wang
{"title":"Miss Yoga: A Yoga Assistant Mobile Application Based on Keypoint Detection","authors":"Renhao Huang, Jiqing Wang, Haowei Lou, Haodong Lu, Bofei Wang","doi":"10.1109/DICTA51227.2020.9363384","DOIUrl":"https://doi.org/10.1109/DICTA51227.2020.9363384","url":null,"abstract":"This paper demonstrates a Yoga assistant mobile application based on human-keypoints detection models, which imitates the scene that real Yoga tutors guide and supervise their students to do Yoga via the video chat. In order to provide humanize, safe and convenient service, the core function is designed as hands-free using voice service, and embedding fast and accurate models to detect keypoints and calculate the scores. In addition, we propose an improved algorithm to calculate scores that can be applied to all poses. Our application is evaluated on different Yoga poses under different scenes, and its robustness is guaranteed.","PeriodicalId":348164,"journal":{"name":"2020 Digital Image Computing: Techniques and Applications (DICTA)","volume":"11 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2020-11-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"121403774","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 4
An Improved Discriminator for GAN-Based Trajectory Prediction Models 基于gan的弹道预测模型的改进判别器
2020 Digital Image Computing: Techniques and Applications (DICTA) Pub Date : 2020-11-29 DOI: 10.1109/DICTA51227.2020.9363414
Renhao Huang, Yang Song, M. Pagnucco
{"title":"An Improved Discriminator for GAN-Based Trajectory Prediction Models","authors":"Renhao Huang, Yang Song, M. Pagnucco","doi":"10.1109/DICTA51227.2020.9363414","DOIUrl":"https://doi.org/10.1109/DICTA51227.2020.9363414","url":null,"abstract":"Pedestrian trajectory prediction is an important component in autonomous systems, such as self-driving cars and social robots. It aims to accurately predict or plan future paths for pedestrians according to their movement histories. Recent studies have shown promising progress and most of them use some advanced encoder-decoder structures with Generative Adversarial Networks (GANs) to generate a distribution of multiple plausible paths of an agent. However, GAN-based models suffer from hard-training problems and training Recurrent Neural Networks (RNNs) is especially difficult. In this paper, we propose a discriminator that shares its encoder with the generator to reduce the training difficulty. We incorporate this discriminator into two successful stochastic models designed for pedestrian trajectory prediction. Our experimental results demonstrate that the new discriminator outperforms the baseline structures in general on multiple datasets.","PeriodicalId":348164,"journal":{"name":"2020 Digital Image Computing: Techniques and Applications (DICTA)","volume":"434 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2020-11-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"126859062","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
3D Reconstruction of Edge Line by ICP-based Matching with Geometric Constraints 基于icp的几何约束匹配边缘线三维重建
2020 Digital Image Computing: Techniques and Applications (DICTA) Pub Date : 2020-11-29 DOI: 10.1109/DICTA51227.2020.9363373
Kojiro Takeyama
{"title":"3D Reconstruction of Edge Line by ICP-based Matching with Geometric Constraints","authors":"Kojiro Takeyama","doi":"10.1109/DICTA51227.2020.9363373","DOIUrl":"https://doi.org/10.1109/DICTA51227.2020.9363373","url":null,"abstract":"This paper presents a novel edge-based 3D reconstruction method using a monocular camera. The edge information is known to be illumination-invariant and to include abundant structural information in a relatively small number of pixels. However, since edge line cannot explicitly determine the pixel-to-pixel correspondence as in the feature point approach, it is difficult to perform accurate matching of pixels in a scene with dense edge lines. In this study, edge-based 3D reconstruction using ICP (iterative closest point algorithm) with geometric constraints has been proposed. In our approach, quasi-rigid body assumption for the edge line deformation and smart search for the matching process are introduced for the improvement of matching robustness in scenes with dense edge lines. Experimental results show that the performance of our method for both motion parallax estimation and depth estimation is greatly improved compared with two recent edge-based methods.","PeriodicalId":348164,"journal":{"name":"2020 Digital Image Computing: Techniques and Applications (DICTA)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2020-11-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"128789471","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Branch Profiles for Shape Analysis 用于形状分析的分支型材
2020 Digital Image Computing: Techniques and Applications (DICTA) Pub Date : 2020-11-29 DOI: 10.1109/DICTA51227.2020.9363421
Zayed M. Asiri, B. Martin, M. Bottema
{"title":"Branch Profiles for Shape Analysis","authors":"Zayed M. Asiri, B. Martin, M. Bottema","doi":"10.1109/DICTA51227.2020.9363421","DOIUrl":"https://doi.org/10.1109/DICTA51227.2020.9363421","url":null,"abstract":"Ahstract- The necessity for characterising highly irregularly shaped objects appears in many circumstances, most prominently in biology and medicine, but also in physical sciences and elsewhere. Here, a multi-scale method for quantifying the level of branching in irregular structures is presented to extend the repertoire descriptors of shape. The method was used to classify strains of yeast colonies and to demonstrate differences in structure of newly formed cancellous bone in rats under various experimental conditions. Yeast colonies were classified with an accuracy of 1.000 (n = 10) and classification of newly formed cancellous bone into three classes achieved mean accuracy of 0.853 ±. 088 over 10 runs with data randomly sampled from the same 15 rats each run.","PeriodicalId":348164,"journal":{"name":"2020 Digital Image Computing: Techniques and Applications (DICTA)","volume":"5 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2020-11-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"132231128","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Tuna Nutriment Tracking using Trajectory Mapping in Application to Aquaculture Fish Tank 基于轨迹映射的金枪鱼营养跟踪在水产养殖鱼缸中的应用
2020 Digital Image Computing: Techniques and Applications (DICTA) Pub Date : 2020-11-29 DOI: 10.1109/DICTA51227.2020.9363387
Hilmil Pradana, K. Horio
{"title":"Tuna Nutriment Tracking using Trajectory Mapping in Application to Aquaculture Fish Tank","authors":"Hilmil Pradana, K. Horio","doi":"10.1109/DICTA51227.2020.9363387","DOIUrl":"https://doi.org/10.1109/DICTA51227.2020.9363387","url":null,"abstract":"The cost of fish feeding is usually around 40 percent of total production cost. Estimating a state of fishes in a tank and adjusting an amount of nutriments play an important role to manage cost of fish feeding system. Our approach is based on tracking nutriments on videos collected from an active aquaculture fish farm. Tracking approach is applied to acknowledge movement of nutriment to understand more about the fish behavior. Recently, there has been increasing number of researchers focused on developing tracking algorithms to generate more accurate and faster determination of object. Unfortunately, recent studies have shown that efficient and robust tracking of multiple objects with complex relations remain unsolved. Hence, focusing to develop tracking algorithm in aquaculture is more challenging because tracked object has a lot of aquatic variant creatures. By following aforementioned problem, we develop tuna nutriment tracking based on the classical minimum cost problem which consistently performs well in real environment datasets. In evaluation, the proposed method achieved 21.32 pixels and 3.08 pixels for average error distance and standard deviation, respectively. Quantitative evaluation based on the data generated by human annotators shows that the proposed method is valuable for aquaculture fish farm and can be widely applied to real environment datasets.","PeriodicalId":348164,"journal":{"name":"2020 Digital Image Computing: Techniques and Applications (DICTA)","volume":"726 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2020-11-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"133282886","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Fusing Visual Features and Metadata to Detect Flooding in Flickr Images 融合视觉特征和元数据来检测Flickr图像中的洪水
2020 Digital Image Computing: Techniques and Applications (DICTA) Pub Date : 2020-11-29 DOI: 10.1109/DICTA51227.2020.9363418
R. Jony, A. Woodley, Dimitri Perrin
{"title":"Fusing Visual Features and Metadata to Detect Flooding in Flickr Images","authors":"R. Jony, A. Woodley, Dimitri Perrin","doi":"10.1109/DICTA51227.2020.9363418","DOIUrl":"https://doi.org/10.1109/DICTA51227.2020.9363418","url":null,"abstract":"Social media platforms such as Flickr have become a source of information for the assessment of natural disasters, for instance assisting in flood mapping. Visual features and textual metadata have been used to identify natural disasters in social media images, however, they have often been used separately. Here, we fuse these two modes together using two fusion methods and deep learning to identify flood images in the MediaEval 2017 dataset. A novel backpropagation technique, Direct Backpropagation (DBP) is used to train a neural network for the classification. The results show that the fusion methods improve the classification accuracy compared to their individual counterparts. We compare our proposed learning method with other baseline methods and find it producing highest classification results. For external evaluation, the results are compared with MediaEval 2017 methods, where our methods outperform most of them.","PeriodicalId":348164,"journal":{"name":"2020 Digital Image Computing: Techniques and Applications (DICTA)","volume":"45 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2020-11-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"127540924","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
A 3D-2D Registration Method for Stereo Scan Overlay on Structure from Motion Model 运动模型结构立体扫描叠加的三维-二维配准方法
2020 Digital Image Computing: Techniques and Applications (DICTA) Pub Date : 2020-11-29 DOI: 10.1109/DICTA51227.2020.9363423
Deepak Rajamohan, M. Garratt, M. Pickering
{"title":"A 3D-2D Registration Method for Stereo Scan Overlay on Structure from Motion Model","authors":"Deepak Rajamohan, M. Garratt, M. Pickering","doi":"10.1109/DICTA51227.2020.9363423","DOIUrl":"https://doi.org/10.1109/DICTA51227.2020.9363423","url":null,"abstract":"Ahstract- The ability to detect and analyze changes or understand the scene while navigating close to buildings is very important for autonomous aerial and ground vehicle based surveillance applications. For this, the latest textured 3D scan of the platform's view frustum has to be placed accurately in the context of a big map like a Structure from Motion (SfM) map of the region. However, due to the drift in the camera trajectory, the scans are usually not aligned with the SfM model. This paper proposes a novel registration algorithm that aligns the 3D scan using known 2D images of the SfM model. The proposed 3D-2D registration method uses a heuristic approach which first performs a robust 2D-2D registration between the projection of the 3D scan and the SfM images and then calculates the 3D alignment parameters by combining registration results of multiple camera views of the SfM model. The results presented compare the robustness of the proposed registration techniques with traditional approaches.","PeriodicalId":348164,"journal":{"name":"2020 Digital Image Computing: Techniques and Applications (DICTA)","volume":"93 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2020-11-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"121810452","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Multitask Learning for Video-based Surgical Skill Assessment 基于视频的多任务学习外科技能评估
2020 Digital Image Computing: Techniques and Applications (DICTA) Pub Date : 2020-11-29 DOI: 10.1109/DICTA51227.2020.9363408
Zhiteng Jian, W. Yue, Qiuxia Wu, Wei Li, Zhiyong Wang, Vincent Lam
{"title":"Multitask Learning for Video-based Surgical Skill Assessment","authors":"Zhiteng Jian, W. Yue, Qiuxia Wu, Wei Li, Zhiyong Wang, Vincent Lam","doi":"10.1109/DICTA51227.2020.9363408","DOIUrl":"https://doi.org/10.1109/DICTA51227.2020.9363408","url":null,"abstract":"Surgical skill assessment (SSA) plays a vital role in medical systems for reducing intraoperative surgical errors and improving clinical outcomes. To ensure objective and efficient SSA, many automatic video-based SSA methods have been developed. In particular, various deep learning methods have been devised recently by utilising CNN or RNN-based networks for various skill assessment tasks (e.g., skill level prediction). While predicting overall skill levels and assessing detailed attribute-based scores are highly correlated, most existing studies deal with these two tasks separately, without fully exploiting different information sources encoded in a dataset. In contrast, we propose a novel end-to-end multitask learning framework to conduct skill level classification and attribute score regression jointly. Specifically, our network incorporates two branches for the two tasks, which share earlier layers for feature extraction and hold different prediction layers for specific targets. The shared feature extractor is optimised under the supervision of both tasks simultaneously, encouraging the model to consider information from different aspects and their relatedness to learn richer and more generalised features. In addition, since not every part of a surgical video contributes to skill assessment equally, we enhance an existing feature extractor I3D with a novel Spatio-Temporal & Channel Attention Module to emphasize important features. Experimental results on the public dataset JIGSAWS show that our proposed network outperforms state-of-the-art models on both skill classification and score regression tasks.","PeriodicalId":348164,"journal":{"name":"2020 Digital Image Computing: Techniques and Applications (DICTA)","volume":"15 3 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2020-11-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"123537177","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 6
Automated Computational Diagnosis of Peripheral Retinal Pathology in Optical Coherence Tomography (OCT) Scans using Graph Theory 基于图论的光学相干断层扫描(OCT)视网膜病理自动计算诊断
2020 Digital Image Computing: Techniques and Applications (DICTA) Pub Date : 2020-11-29 DOI: 10.1109/DICTA51227.2020.9363376
T. Lange, Stewart R. Lake, Karen Reynolds, M. Bottema
{"title":"Automated Computational Diagnosis of Peripheral Retinal Pathology in Optical Coherence Tomography (OCT) Scans using Graph Theory","authors":"T. Lange, Stewart R. Lake, Karen Reynolds, M. Bottema","doi":"10.1109/DICTA51227.2020.9363376","DOIUrl":"https://doi.org/10.1109/DICTA51227.2020.9363376","url":null,"abstract":"Analysis of retinal shape with optical coherence tomography (OCT) has been valuable in describing different ophthalmic conditions. An effective method for retinal contour delineation is graph theory. This study compares the ability of two different implementations of graph theory, the Livewire (LVW) intelligent scissors developed for ImageJ and a purpose-built graph searching function (GSF), to determine retinal shape for a retinal disease classifier. Both methods require user interaction. Retinal shape features derived from both methods were used to diagnose eyes with posterior vitreous detachment (PVD) or retinal detachment (RD) via quadratic discriminant analysis. Classification with each method was the same in 49 out of 51 eyes. Processing time was faster with the GSF than LVW. In mean (µ) ± standard deviation (SD), GSF took 524 ± 62 s and LVW took 814 ± 223 s (p = 5.52 x 10−14). Conclusively, GSF was easier to use and is preferred for further retinal shape analysis.","PeriodicalId":348164,"journal":{"name":"2020 Digital Image Computing: Techniques and Applications (DICTA)","volume":"30 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2020-11-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"120960395","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
FANet: Feature Aggregation Network for Semantic Segmentation 语义分割的特征聚合网络
2020 Digital Image Computing: Techniques and Applications (DICTA) Pub Date : 2020-11-29 DOI: 10.1109/DICTA51227.2020.9363370
Tanmay Singha, Duc-Son Pham, A. Krishna
{"title":"FANet: Feature Aggregation Network for Semantic Segmentation","authors":"Tanmay Singha, Duc-Son Pham, A. Krishna","doi":"10.1109/DICTA51227.2020.9363370","DOIUrl":"https://doi.org/10.1109/DICTA51227.2020.9363370","url":null,"abstract":"Due to the rapid development in robotics and autonomous industries, optimization and accuracy have become an important factor in the field of computer vision. It becomes a challenging task for the researchers to design an efficient, optimized model with high accuracy in the field of object detection and semantic segmentation. Some existing off-line scene segmentation methods have shown an outstanding result on different datasets at the cost of a large number of parameters and operations, whereas some well-known real-time semantic segmentation techniques have reduced the number of parameters and operations in demand for resource-constrained applications, but model accuracy is compromised. We propose a novel approach for scene segmentation suitable for resource-constrained embedded devices by keeping a right balance between model architecture and model performance. Exploiting the multi-scale feature fusion technique with accurate localization augmentation, we introduce a fast feature aggregation network, a real-time scene segmentation model capable of handling high-resolution input image (1024 × 2048 px). Relying on an efficient embedded vision backbone network, our feature pyramid network outperforms many existing off-line and real-time pixel-wise deep convolution neural networks (CNNs) and produces 89.7% pixel accuracy and 65.9% mean intersection over union (mIoU) on the Cityscapes benchmark validation dataset whilst having only 1.1M parameters and 5.8B FLOPS.","PeriodicalId":348164,"journal":{"name":"2020 Digital Image Computing: Techniques and Applications (DICTA)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2020-11-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"134139342","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 7
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
相关产品
×
本文献相关产品
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信