{"title":"面向数字行为分类的互联网流量多标签数据集研究","authors":"Wenbin Li, Gaspard Quenard","doi":"10.1109/ICCCI51764.2021.9486831","DOIUrl":null,"url":null,"abstract":"With the digital transformation of model society, the deep understanding of digital behavior is critical for both users and service providers. Nevertheless this work is challenging due to the lack of an extensive model and the corresponding dataset to support digital behavior classification. In response to this, we presented in this work a complete process of modelling, data collection and classification of user digital behaviors over Internet: firstly the fundamental digital context model is introduced to provide a thorough understanding of digital behavior and digital environment properties. Based on the model, the data collection process is presented and a multi-label dataset of Internet traffic (MLDIT) has been collected with all model properties, finally a first series of classification experiments with MLDIT has been conducted showing promising results to identify user interaction state, applications and actions. Aiming at providing a thorough model of digital behavior and a reference process for data collection and classification, we expect likewise to attract community efforts to collaborate on the MLDIT enrichment.","PeriodicalId":180004,"journal":{"name":"2021 3rd International Conference on Computer Communication and the Internet (ICCCI)","volume":null,"pages":null},"PeriodicalIF":0.0000,"publicationDate":"2021-06-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"4","resultStr":"{\"title\":\"Towards a Multi-Label Dataset of Internet Traffic for Digital Behavior Classification\",\"authors\":\"Wenbin Li, Gaspard Quenard\",\"doi\":\"10.1109/ICCCI51764.2021.9486831\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"With the digital transformation of model society, the deep understanding of digital behavior is critical for both users and service providers. Nevertheless this work is challenging due to the lack of an extensive model and the corresponding dataset to support digital behavior classification. In response to this, we presented in this work a complete process of modelling, data collection and classification of user digital behaviors over Internet: firstly the fundamental digital context model is introduced to provide a thorough understanding of digital behavior and digital environment properties. Based on the model, the data collection process is presented and a multi-label dataset of Internet traffic (MLDIT) has been collected with all model properties, finally a first series of classification experiments with MLDIT has been conducted showing promising results to identify user interaction state, applications and actions. Aiming at providing a thorough model of digital behavior and a reference process for data collection and classification, we expect likewise to attract community efforts to collaborate on the MLDIT enrichment.\",\"PeriodicalId\":180004,\"journal\":{\"name\":\"2021 3rd International Conference on Computer Communication and the Internet (ICCCI)\",\"volume\":null,\"pages\":null},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2021-06-25\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"4\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2021 3rd International Conference on Computer Communication and the Internet (ICCCI)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ICCCI51764.2021.9486831\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2021 3rd International Conference on Computer Communication and the Internet (ICCCI)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICCCI51764.2021.9486831","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Towards a Multi-Label Dataset of Internet Traffic for Digital Behavior Classification
With the digital transformation of model society, the deep understanding of digital behavior is critical for both users and service providers. Nevertheless this work is challenging due to the lack of an extensive model and the corresponding dataset to support digital behavior classification. In response to this, we presented in this work a complete process of modelling, data collection and classification of user digital behaviors over Internet: firstly the fundamental digital context model is introduced to provide a thorough understanding of digital behavior and digital environment properties. Based on the model, the data collection process is presented and a multi-label dataset of Internet traffic (MLDIT) has been collected with all model properties, finally a first series of classification experiments with MLDIT has been conducted showing promising results to identify user interaction state, applications and actions. Aiming at providing a thorough model of digital behavior and a reference process for data collection and classification, we expect likewise to attract community efforts to collaborate on the MLDIT enrichment.