Policy Based Synthesis: Data Generation and Augmentation Methods For RF Machine Learning

Rob Miller, S. Kokalj-Filipovic, Garrett M. Vanhoy, Joshua Morman
{"title":"Policy Based Synthesis: Data Generation and Augmentation Methods For RF Machine Learning","authors":"Rob Miller, S. Kokalj-Filipovic, Garrett M. Vanhoy, Joshua Morman","doi":"10.1109/GlobalSIP45357.2019.8969160","DOIUrl":null,"url":null,"abstract":"The current dataset generation methods for RF Machine Learning (RFML) tasks consist of either completely synthetically generated data or completely raw digitized data from an RF front end. The synthetic datasets are often unrealistic in terms of waveforms or protocols, and the raw captures are typically unlabeled (or often mislabeled), and can skew machine learning algorithms to focus on non-salient features. Further, the associated storage and processing requirements are quite large. In this work, a novel dataset generation and augmentation method called policy-based synthesis is presented that aims to address the short-comings of either approach by combining basic protocol knowledge with simulated channel and device impairments to supplement over-the-air captures made in a controlled environment. This method permits the learning of salient features and regularizes radio and device anomalies that are not of interest. Practical considerations for collecting and processing data for this hybridized approach are also detailed and examples are provided on a dataset that includes protocols commonly used in the 2.4 GHz ISM band such as Bluetooth and Wi-Fi.","PeriodicalId":221378,"journal":{"name":"2019 IEEE Global Conference on Signal and Information Processing (GlobalSIP)","volume":"48 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2019-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"4","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2019 IEEE Global Conference on Signal and Information Processing (GlobalSIP)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/GlobalSIP45357.2019.8969160","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 4

Abstract

The current dataset generation methods for RF Machine Learning (RFML) tasks consist of either completely synthetically generated data or completely raw digitized data from an RF front end. The synthetic datasets are often unrealistic in terms of waveforms or protocols, and the raw captures are typically unlabeled (or often mislabeled), and can skew machine learning algorithms to focus on non-salient features. Further, the associated storage and processing requirements are quite large. In this work, a novel dataset generation and augmentation method called policy-based synthesis is presented that aims to address the short-comings of either approach by combining basic protocol knowledge with simulated channel and device impairments to supplement over-the-air captures made in a controlled environment. This method permits the learning of salient features and regularizes radio and device anomalies that are not of interest. Practical considerations for collecting and processing data for this hybridized approach are also detailed and examples are provided on a dataset that includes protocols commonly used in the 2.4 GHz ISM band such as Bluetooth and Wi-Fi.
基于策略的综合:射频机器学习的数据生成和增强方法
射频机器学习(RFML)任务的当前数据集生成方法包括来自射频前端的完全合成生成的数据或完全原始的数字化数据。合成数据集在波形或协议方面通常是不现实的,原始捕获通常是未标记的(或经常是错误标记的),并且可以扭曲机器学习算法,使其专注于非显著特征。此外,相关的存储和处理需求非常大。在这项工作中,提出了一种新的数据集生成和增强方法,称为基于策略的合成,旨在通过将基本协议知识与模拟信道和设备损伤相结合来解决这两种方法的缺点,以补充在受控环境中进行的空中捕获。这种方法允许学习显著特征,并使不感兴趣的无线电和设备异常规格化。还详细介绍了收集和处理这种混合方法数据的实际考虑因素,并在数据集上提供了示例,该数据集包括2.4 GHz ISM频段(如蓝牙和Wi-Fi)中常用的协议。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信