PitchIn: Eavesdropping via Intelligible Speech Reconstruction Using Non-acoustic Sensor Fusion

2017 16th ACM/IEEE International Conference on Information Processing in Sensor Networks (IPSN) Pub Date : 2017-04-18 DOI:10.1145/3055031.3055088

Jun Han, Albert Jin Chung, P. Tague

{"title":"PitchIn: Eavesdropping via Intelligible Speech Reconstruction Using Non-acoustic Sensor Fusion","authors":"Jun Han, Albert Jin Chung, P. Tague","doi":"10.1145/3055031.3055088","DOIUrl":null,"url":null,"abstract":"Despite the advent of numerous Internet-of-Things (IoT) applications, recent research demonstrates potential side-channel vulnerabilities exploiting sensors which are used for event and environment monitoring. In this paper, we propose a new side-channel attack, where a network of distributed non-acoustic sensors can be exploited by an attacker to launch an eavesdropping attack by reconstructing intelligible speech signals. Specifically, we present PitchIn to demonstrate the feasibility of speech reconstruction from non-acoustic sensor data collected offline across networked devices. Unlike speech reconstruction which requires a high sampling frequency (e.g., > 5 KHz), typical applications using non-acoustic sensors do not rely on richly sampled data, presenting a challenge to the speech reconstruction attack. Hence, PitchIn leverages a distributed form of Time Interleaved Analog-Digital-Conversion (TI-ADC) to approximate a high sampling frequency, while maintaining low per-node sampling frequency. We demonstrate how distributed TI-ADC can be used to achieve intelligibility by processing an interleaved signal composed of different sensors across networked devices. We implement PitchIn and evaluate reconstructed speech signal intelligibility via user studies. PitchIn has word recognition accuracy as high as 79%. Though some additional work is required to improve accuracy, our results suggest that eavesdropping using a fusion of non-acoustic sensors is a real and practical threat.","PeriodicalId":228318,"journal":{"name":"2017 16th ACM/IEEE International Conference on Information Processing in Sensor Networks (IPSN)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2017-04-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"54","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2017 16th ACM/IEEE International Conference on Information Processing in Sensor Networks (IPSN)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/3055031.3055088","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 54

Abstract

Despite the advent of numerous Internet-of-Things (IoT) applications, recent research demonstrates potential side-channel vulnerabilities exploiting sensors which are used for event and environment monitoring. In this paper, we propose a new side-channel attack, where a network of distributed non-acoustic sensors can be exploited by an attacker to launch an eavesdropping attack by reconstructing intelligible speech signals. Specifically, we present PitchIn to demonstrate the feasibility of speech reconstruction from non-acoustic sensor data collected offline across networked devices. Unlike speech reconstruction which requires a high sampling frequency (e.g., > 5 KHz), typical applications using non-acoustic sensors do not rely on richly sampled data, presenting a challenge to the speech reconstruction attack. Hence, PitchIn leverages a distributed form of Time Interleaved Analog-Digital-Conversion (TI-ADC) to approximate a high sampling frequency, while maintaining low per-node sampling frequency. We demonstrate how distributed TI-ADC can be used to achieve intelligibility by processing an interleaved signal composed of different sensors across networked devices. We implement PitchIn and evaluate reconstructed speech signal intelligibility via user studies. PitchIn has word recognition accuracy as high as 79%. Though some additional work is required to improve accuracy, our results suggest that eavesdropping using a fusion of non-acoustic sensors is a real and practical threat.

查看原文本刊更多论文

尽管出现了许多物联网(IoT)应用，但最近的研究表明，用于事件和环境监测的传感器存在潜在的侧信道漏洞。在本文中，我们提出了一种新的侧信道攻击，攻击者可以利用分布式非声学传感器网络通过重建可理解的语音信号来发动窃听攻击。具体来说，我们提出PitchIn来证明从跨网络设备离线收集的非声学传感器数据中进行语音重建的可行性。与需要高采样频率(例如> 5 KHz)的语音重建不同，使用非声学传感器的典型应用不依赖于丰富的采样数据，这对语音重建攻击提出了挑战。因此，PitchIn利用时间交错模数转换(TI-ADC)的分布式形式来近似高采样频率，同时保持低每节点采样频率。我们演示了如何使用分布式TI-ADC通过处理跨网络设备的不同传感器组成的交错信号来实现可理解性。我们实现了PitchIn，并通过用户研究来评估重构语音信号的可理解性。PitchIn的单词识别准确率高达79%。虽然需要做一些额外的工作来提高准确性，但我们的研究结果表明，使用非声学传感器融合的窃听是一个真实而实际的威胁。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2017 16th ACM/IEEE International Conference on Information Processing in Sensor Networks (IPSN)

自引率

0.00%

发文量