Sinjoni Mukhopadhyay King , Faisal Nawab , Katia Obraczka
{"title":"Open source user mobility and activity datasets: Taxonomy and applications","authors":"Sinjoni Mukhopadhyay King , Faisal Nawab , Katia Obraczka","doi":"10.1016/j.adhoc.2025.103835","DOIUrl":null,"url":null,"abstract":"<div><div>The study of user mobility and activity (uMA) has numerous applications, including network resource planning, connected healthcare, localization, social media, and e-commerce. Current research in uMA heavily relies on open-source traces captured from pedestrian, vehicular, and application-based activities. These traces are rich in features and diverse, not only in the information they provide but also in their potential applications. However, this diversity presents two main challenges for researchers and practitioners who aim to utilize uMA datasets, classify existing datasets, or create new ones. Firstly, there is no readily available comprehensive classification of existing open-source uMA traces, making it typically labor-intensive and time-consuming to determine whether the identified datasets are suitable. Secondly, it is challenging to identify the key features and their specific use cases without conducting a detailed analysis of the traces.</div><div>This manuscript aims to address these challenges in three ways. First, we propose a taxonomy for classifying open-source mobility traces based on mobility mode, data source, collection technology, and application type. This taxonomy can be used to create tags for both existing and new datasets, making it easier to find problem-specific datasets compared to current search methods. Second, we demonstrate how existing datasets can be classified according to this taxonomy, providing examples of popular open-source uMA traces, along with information about their publishing source, licensing, and anonymization strategy. We also discuss how this taxonomy can guide the collection of new uMA datasets. Finally, we present three case studies using popular publicly available uMA datasets to illustrate how our taxonomy can be used to identify feature sets in the traces, helping to determine their applicability to specific use cases in networking, health, lifestyle, and location-based services.</div></div>","PeriodicalId":55555,"journal":{"name":"Ad Hoc Networks","volume":"173 ","pages":"Article 103835"},"PeriodicalIF":4.4000,"publicationDate":"2025-03-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Ad Hoc Networks","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S1570870525000836","RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, INFORMATION SYSTEMS","Score":null,"Total":0}
引用次数: 0
Abstract
The study of user mobility and activity (uMA) has numerous applications, including network resource planning, connected healthcare, localization, social media, and e-commerce. Current research in uMA heavily relies on open-source traces captured from pedestrian, vehicular, and application-based activities. These traces are rich in features and diverse, not only in the information they provide but also in their potential applications. However, this diversity presents two main challenges for researchers and practitioners who aim to utilize uMA datasets, classify existing datasets, or create new ones. Firstly, there is no readily available comprehensive classification of existing open-source uMA traces, making it typically labor-intensive and time-consuming to determine whether the identified datasets are suitable. Secondly, it is challenging to identify the key features and their specific use cases without conducting a detailed analysis of the traces.
This manuscript aims to address these challenges in three ways. First, we propose a taxonomy for classifying open-source mobility traces based on mobility mode, data source, collection technology, and application type. This taxonomy can be used to create tags for both existing and new datasets, making it easier to find problem-specific datasets compared to current search methods. Second, we demonstrate how existing datasets can be classified according to this taxonomy, providing examples of popular open-source uMA traces, along with information about their publishing source, licensing, and anonymization strategy. We also discuss how this taxonomy can guide the collection of new uMA datasets. Finally, we present three case studies using popular publicly available uMA datasets to illustrate how our taxonomy can be used to identify feature sets in the traces, helping to determine their applicability to specific use cases in networking, health, lifestyle, and location-based services.
期刊介绍:
The Ad Hoc Networks is an international and archival journal providing a publication vehicle for complete coverage of all topics of interest to those involved in ad hoc and sensor networking areas. The Ad Hoc Networks considers original, high quality and unpublished contributions addressing all aspects of ad hoc and sensor networks. Specific areas of interest include, but are not limited to:
Mobile and Wireless Ad Hoc Networks
Sensor Networks
Wireless Local and Personal Area Networks
Home Networks
Ad Hoc Networks of Autonomous Intelligent Systems
Novel Architectures for Ad Hoc and Sensor Networks
Self-organizing Network Architectures and Protocols
Transport Layer Protocols
Routing protocols (unicast, multicast, geocast, etc.)
Media Access Control Techniques
Error Control Schemes
Power-Aware, Low-Power and Energy-Efficient Designs
Synchronization and Scheduling Issues
Mobility Management
Mobility-Tolerant Communication Protocols
Location Tracking and Location-based Services
Resource and Information Management
Security and Fault-Tolerance Issues
Hardware and Software Platforms, Systems, and Testbeds
Experimental and Prototype Results
Quality-of-Service Issues
Cross-Layer Interactions
Scalability Issues
Performance Analysis and Simulation of Protocols.