24th Irish Machine Vision and Image Processing Conference最新文献

An NLP approach to Image Analysis 图像分析的NLP方法

24th Irish Machine Vision and Image Processing Conference Pub Date : 2022-08-31 DOI: 10.56541/kfbi5107

G. Martínez

引用次数: 0

Reality Analagous Synthetic Dataset Generation with Daylight Variance for Deep Learning Classification 基于日光方差的深度学习分类现实模拟合成数据集生成

24th Irish Machine Vision and Image Processing Conference Pub Date : 2022-08-31 DOI: 10.56541/poya9239

Thomas Lee, Susan Mckeever, J. Courtney

引用次数: 0

Distance measurement between smartphones within an ad-hoc camera array, using audible PRBS 智能手机之间的距离测量在特设的相机阵列，使用听觉PRBS

24th Irish Machine Vision and Image Processing Conference Pub Date : 2022-08-31 DOI: 10.56541/kebp4512

Pádraic McEvoy, D. Berry, Ted Burke

{"title":"Distance measurement between smartphones within an ad-hoc camera array, using audible PRBS","authors":"Pádraic McEvoy, D. Berry, Ted Burke","doi":"10.56541/kebp4512","DOIUrl":"https://doi.org/10.56541/kebp4512","url":null,"abstract":"An approach for measuring the distance between two smartphones is presented in this paper. The method uses each smartphone’s microphone(s) and speaker(s) to concurrently emit and record audio in order to calculate the sound propagation delay and hence distance. Each device in turn emits a different audible pseudo-random binary sequence (PRBS) - specifically, a maximum length sequence (MLS). Each device captures both emitted signals in one continuous recording. The propagation delay between the devices is calculated by comparing their respective recordings, and in particular the temporal positions of the emitted signals within each recording. Each device emits one of the signals, records both signals, and then sends its recording to a master device for analysis, which is performed by a custom web application and is therefore independent of operating system. A mean error of 32.29 mm was found in initial testing, which was conducted using Samsung Galaxy A10 devices running Android 10. The key innovation in this method is that it requires no clock time synchronisation between devices because the distance is determined by comparing inter-transmission delays in the two recordings. Potential future improvements are discussed, including how to take into account the exact locations of each phone’s microphone and speaker to increase accuracy.","PeriodicalId":180076,"journal":{"name":"24th Irish Machine Vision and Image Processing Conference","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2022-08-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"125104420","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

A Comparison of Feature Extraction Methods Applied to Thermal Sensor Binary Image Data to Classify Bed Occupancy 热传感器二值图像数据特征提取方法在床占率分类中的比较

24th Irish Machine Vision and Image Processing Conference Pub Date : 2022-08-31 DOI: 10.56541/qlzv1440

Rebecca Hand, I. Cleland, C. Nugent

引用次数: 0

On the Feasibility of Privacy-Secured Facial Authentication for low-power IoT Devices - Quantifying the Effects of Head Pose Variation on End-to-End Neural Face Recognition 低功耗物联网设备隐私保护面部认证的可行性研究——量化头部姿势变化对端到端神经人脸识别的影响

24th Irish Machine Vision and Image Processing Conference Pub Date : 2022-08-31 DOI: 10.56541/fevr2516

Wang Yao, Viktor Varkarakis, Joseph Lemley, P. Corcoran

{"title":"On the Feasibility of Privacy-Secured Facial Authentication for low-power IoT Devices - Quantifying the Effects of Head Pose Variation on End-to-End Neural Face Recognition","authors":"Wang Yao, Viktor Varkarakis, Joseph Lemley, P. Corcoran","doi":"10.56541/fevr2516","DOIUrl":"https://doi.org/10.56541/fevr2516","url":null,"abstract":"Recent low-power neural accelerator hardware provides a solution for end-to-end privacy and secure facial authentication, such as smart refueling machine locks in shared accommodation, smart speakers, or televisions that respond only to family members. This work explores the impact that head pose variation has on the performance of a state-of-the-art face recognition model. A synthetic technique is employed to introduce head pose variation into data samples. Experiments show that the synthetic pose variations have a similar effect on face recognition performance as the real samples with pose variations. The impact of large variations of head poses on the face recognizer was then explored by further amplifying the angle of the synthetic head pose. It is found that the accuracy of the face recognition model deteriorates as the pose increases. After fine-tuning the network, the face recognition model achieves close to the accuracy of frontal faces in all pose variations, indicating that the face recognition model can be tuned to compensate for the effect of large poses.","PeriodicalId":180076,"journal":{"name":"24th Irish Machine Vision and Image Processing Conference","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2022-08-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"125932219","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 1

A Comparative Study of Traditional Light Field Methods and NeRF 传统光场法与NeRF的比较研究

24th Irish Machine Vision and Image Processing Conference Pub Date : 2022-08-31 DOI: 10.56541/iqkc6774

Pierre Matysiak, Susana Ruano, Martin Alain, A. Smolic

引用次数: 0

A machine vision system for avian song classification with CNN’s 基于CNN的鸟类歌曲分类机器视觉系统

24th Irish Machine Vision and Image Processing Conference Pub Date : 2022-08-31 DOI: 10.56541/mhzn4111

Gabriel R. Palma, Ana Aquino, P. Monticelli, L. Verdade, C. Markham, Rafael Moral

{"title":"A machine vision system for avian song classification with CNN’s","authors":"Gabriel R. Palma, Ana Aquino, P. Monticelli, L. Verdade, C. Markham, Rafael Moral","doi":"10.56541/mhzn4111","DOIUrl":"https://doi.org/10.56541/mhzn4111","url":null,"abstract":"Soundscape ecologists aim to study the acoustic characteristics of an area that reflects natural processes [Schafer, 1977]. These sounds can be interpreted as biological (biophony), geophysical (geophony), and human-produced (anthrophony) [Pijanowski et al., 2011]. A common task is to use sounds to identify species based on the frequency content of a given signal. This signal can be further converted into spectrograms enabling other types of analysis to automate the identification of species. Based on the promising results of deep learning methods, such as Convolution Neural Networks (CNNs) in image classification, here we propose the use of a pre-trained VGG16 CNN architecture to identify two nocturnal avian species, namely Antrostomus rufus and Megascops choliba, commonly encountered in Brazilian forests. Monitoring the abundance of these species is important to ecologists to develop conservation programmes, detect environmental disturbances and assess the impact of human action. Specialists recorded sounds in 16-bit wave files at a sampling rate of 44Hz and classified the presence of these species. With the classified wave files, we created additional classes to visualise the performance of the VGG16 CNN architecture for detecting both species. We end up with six categories containing 60 seconds of audio of species vocalisation combinations and background only sounds. We produced spectrograms using the information from each RGB channel, only one channel (grey-scale), and applied the histogram equalisation technique to the grey-scale images. A comparison of the system performance using histogram equalised images and unmodified images was made. Histogram equalisation improves the contrast, and so the visibility to the human observer. Investigating the effect of histogram equalisation on the performance of the CNN was a feature of this study. Moreover, to show the practical application of our work, we created 51 minutes of audio, which contains more noise than the presence of both species (a scenario commonly encountered in field surveys). Our results showed that the trained VGG16 CNN produced, after 8000 epochs, a training accuracy of 100% for the three approaches. The test accuracy was 80.64%, 75.26%, and 67.74% for the RGB, grey-scaled, and histogram equalised approaches. The method’s accuracy on the synthetic audio file of 51 minutes was 92.15%. This accuracy level reveals the potential of CNN architectures in automating species detection and identification by sound using passive monitoring. Our results suggest that using coloured images to represent the spectrogram better generalises the classification than grey-scale and histogram equalised images. This study might develop future avian monitoring programmes based on passive sound recording, which significantly enhances sampling size without increasing cost.","PeriodicalId":180076,"journal":{"name":"24th Irish Machine Vision and Image Processing Conference","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2022-08-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"122676264","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Detection and Isolation of 3D Objects in Unstructured Environments 非结构化环境中三维物体的检测与隔离

24th Irish Machine Vision and Image Processing Conference Pub Date : 2022-08-31 DOI: 10.56541/afmz9460

Dylan Do Couto, J. Butterfield, A. Murphy, K. Rafferty, Joseph Coleman

引用次数: 0

Towards Temporal Stability in Automatic Video Colourisation 自动视频着色的时间稳定性

24th Irish Machine Vision and Image Processing Conference Pub Date : 2022-08-31 DOI: 10.56541/zvhf9195

Rory Ward, J. Breslin

引用次数: 0

Acoustic Source Localization Using Straight Line Approximations 使用直线近似的声源定位

24th Irish Machine Vision and Image Processing Conference Pub Date : 2022-08-31 DOI: 10.56541/ljrb7078

Swarnadeep Bagchi, Ruairí de Fréin

引用次数: 0