Kevin Hoarau, Pierre-Ugo Tournoux, Tahiry Razafindralambo
{"title":"BML:高效、通用的BGP数据集收集工具","authors":"Kevin Hoarau, Pierre-Ugo Tournoux, Tahiry Razafindralambo","doi":"10.1109/ICCWorkshops50388.2021.9473737","DOIUrl":null,"url":null,"abstract":"The Border Gateway Protocol (BGP) is in charge of the route exchange at the Internet scale. Anomalies in BGP’s behaviour can have several causes (e.g. mis-configuration, outage and attacks) and despite being rare, their consequences can threaten the Internet stability and reliability. The study of such anomalies requires the extraction of specific features and internet topology from BGP data. The literature shows that adhoc procedures and tools have been developed to extract specific features to train machine learning models for anomaly detection. In this paper we propose BML, a BGP dataset generation tool that extracts the majority of known features in the literature, the internet topology and that allows the user to build specific features from BGP data. We illustrate the use of BML on a BGP anomaly by extracting 32 synthetic features and 14 BGP’s graphs features which allow a comprehensive understanding of the Border Gateway Protocol.","PeriodicalId":127186,"journal":{"name":"2021 IEEE International Conference on Communications Workshops (ICC Workshops)","volume":"3 3","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"4","resultStr":"{\"title\":\"BML: An Efficient and Versatile Tool for BGP Dataset Collection\",\"authors\":\"Kevin Hoarau, Pierre-Ugo Tournoux, Tahiry Razafindralambo\",\"doi\":\"10.1109/ICCWorkshops50388.2021.9473737\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"The Border Gateway Protocol (BGP) is in charge of the route exchange at the Internet scale. Anomalies in BGP’s behaviour can have several causes (e.g. mis-configuration, outage and attacks) and despite being rare, their consequences can threaten the Internet stability and reliability. The study of such anomalies requires the extraction of specific features and internet topology from BGP data. The literature shows that adhoc procedures and tools have been developed to extract specific features to train machine learning models for anomaly detection. In this paper we propose BML, a BGP dataset generation tool that extracts the majority of known features in the literature, the internet topology and that allows the user to build specific features from BGP data. We illustrate the use of BML on a BGP anomaly by extracting 32 synthetic features and 14 BGP’s graphs features which allow a comprehensive understanding of the Border Gateway Protocol.\",\"PeriodicalId\":127186,\"journal\":{\"name\":\"2021 IEEE International Conference on Communications Workshops (ICC Workshops)\",\"volume\":\"3 3\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2021-06-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"4\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2021 IEEE International Conference on Communications Workshops (ICC Workshops)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ICCWorkshops50388.2021.9473737\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2021 IEEE International Conference on Communications Workshops (ICC Workshops)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICCWorkshops50388.2021.9473737","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
BML: An Efficient and Versatile Tool for BGP Dataset Collection
The Border Gateway Protocol (BGP) is in charge of the route exchange at the Internet scale. Anomalies in BGP’s behaviour can have several causes (e.g. mis-configuration, outage and attacks) and despite being rare, their consequences can threaten the Internet stability and reliability. The study of such anomalies requires the extraction of specific features and internet topology from BGP data. The literature shows that adhoc procedures and tools have been developed to extract specific features to train machine learning models for anomaly detection. In this paper we propose BML, a BGP dataset generation tool that extracts the majority of known features in the literature, the internet topology and that allows the user to build specific features from BGP data. We illustrate the use of BML on a BGP anomaly by extracting 32 synthetic features and 14 BGP’s graphs features which allow a comprehensive understanding of the Border Gateway Protocol.