基于智能手机的室内定位数据集调查:机器学习视角

IF 7.6 3区 计算机科学 Q1 COMPUTER SCIENCE, INFORMATION SYSTEMS
Gaetano Carmelo La Delfa , Javier Prieto , Salvatore Monteleone , Hamaad Rafique , Maurizio Palesi , Davide Patti
{"title":"基于智能手机的室内定位数据集调查:机器学习视角","authors":"Gaetano Carmelo La Delfa ,&nbsp;Javier Prieto ,&nbsp;Salvatore Monteleone ,&nbsp;Hamaad Rafique ,&nbsp;Maurizio Palesi ,&nbsp;Davide Patti","doi":"10.1016/j.iot.2025.101753","DOIUrl":null,"url":null,"abstract":"<div><div>Indoor localization has gained significant attention in recent years due to its applications across sectors such as healthcare, logistics, manufacturing, and retail. However, while outdoor localization has been effectively addressed with GPS, indoor localization remains challenging despite significant research progress. Many studies have explored the capabilities of modern smartphones, equipped with a variety of sensors, to develop machine-learning methods for indoor localization, ranging from classical fingerprinting to deep sequence models and transformers. Nevertheless, most rely on small, proprietary datasets that are not publicly available. Large, high-quality public datasets are essential for researchers to efficiently test, refine, and validate algorithms, enable comparisons between different approaches and develop robust and accurate localization solutions. To reduce data collection time and costs and help researchers find the most appropriate datasets for their needs, this paper surveys 20 publicly available high-quality indoor localization datasets suitable for Machine Learning, released between 2014 and 2024, that cover various sensing technologies. The survey reveals a shift toward multi-sensor data collection, extending beyond Wi-Fi and Bluetooth signals to include inertial sensors such as accelerometers and gyroscopes, as well as magnetic fields. It also highlights that while over 75% of datasets cover multi-floor structures or multiple buildings, there is a scarcity of datasets covering diverse types of indoor environments, with most focused on office or academic settings. Moreover, the temporal dimension, crucial in dynamic indoor scenarios, remains largely underrepresented, limiting the development of ML models for tracking dynamic trajectories or adapting to evolving signal patterns.</div></div>","PeriodicalId":29968,"journal":{"name":"Internet of Things","volume":"34 ","pages":"Article 101753"},"PeriodicalIF":7.6000,"publicationDate":"2025-08-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Survey of smartphone-based datasets for indoor localization: A machine learning perspective\",\"authors\":\"Gaetano Carmelo La Delfa ,&nbsp;Javier Prieto ,&nbsp;Salvatore Monteleone ,&nbsp;Hamaad Rafique ,&nbsp;Maurizio Palesi ,&nbsp;Davide Patti\",\"doi\":\"10.1016/j.iot.2025.101753\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>Indoor localization has gained significant attention in recent years due to its applications across sectors such as healthcare, logistics, manufacturing, and retail. However, while outdoor localization has been effectively addressed with GPS, indoor localization remains challenging despite significant research progress. Many studies have explored the capabilities of modern smartphones, equipped with a variety of sensors, to develop machine-learning methods for indoor localization, ranging from classical fingerprinting to deep sequence models and transformers. Nevertheless, most rely on small, proprietary datasets that are not publicly available. Large, high-quality public datasets are essential for researchers to efficiently test, refine, and validate algorithms, enable comparisons between different approaches and develop robust and accurate localization solutions. To reduce data collection time and costs and help researchers find the most appropriate datasets for their needs, this paper surveys 20 publicly available high-quality indoor localization datasets suitable for Machine Learning, released between 2014 and 2024, that cover various sensing technologies. The survey reveals a shift toward multi-sensor data collection, extending beyond Wi-Fi and Bluetooth signals to include inertial sensors such as accelerometers and gyroscopes, as well as magnetic fields. It also highlights that while over 75% of datasets cover multi-floor structures or multiple buildings, there is a scarcity of datasets covering diverse types of indoor environments, with most focused on office or academic settings. Moreover, the temporal dimension, crucial in dynamic indoor scenarios, remains largely underrepresented, limiting the development of ML models for tracking dynamic trajectories or adapting to evolving signal patterns.</div></div>\",\"PeriodicalId\":29968,\"journal\":{\"name\":\"Internet of Things\",\"volume\":\"34 \",\"pages\":\"Article 101753\"},\"PeriodicalIF\":7.6000,\"publicationDate\":\"2025-08-30\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Internet of Things\",\"FirstCategoryId\":\"94\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S2542660525002665\",\"RegionNum\":3,\"RegionCategory\":\"计算机科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"COMPUTER SCIENCE, INFORMATION SYSTEMS\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Internet of Things","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S2542660525002665","RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, INFORMATION SYSTEMS","Score":null,"Total":0}
引用次数: 0

摘要

近年来,由于其在医疗保健、物流、制造和零售等领域的应用,室内本地化得到了极大的关注。然而,尽管GPS已经有效地解决了室外定位问题,但室内定位仍然具有挑战性,尽管研究取得了重大进展。许多研究已经探索了配备各种传感器的现代智能手机的功能,以开发用于室内定位的机器学习方法,从经典的指纹识别到深度序列模型和变压器。然而,大多数依赖于小型的专有数据集,这些数据集不公开可用。大型、高质量的公共数据集对于研究人员有效地测试、改进和验证算法,实现不同方法之间的比较以及开发强大而准确的定位解决方案至关重要。为了减少数据收集时间和成本,并帮助研究人员找到最适合他们需求的数据集,本文调查了2014年至2024年间发布的20个公开可用的适用于机器学习的高质量室内定位数据集,涵盖了各种传感技术。该调查揭示了向多传感器数据收集的转变,从Wi-Fi和蓝牙信号扩展到包括惯性传感器,如加速度计和陀螺仪,以及磁场。报告还强调,虽然超过75%的数据集覆盖多层结构或多栋建筑,但覆盖不同类型室内环境的数据集却很稀缺,大多数集中在办公室或学术环境。此外,在动态室内场景中至关重要的时间维度在很大程度上仍未得到充分体现,这限制了用于跟踪动态轨迹或适应不断变化的信号模式的ML模型的发展。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Survey of smartphone-based datasets for indoor localization: A machine learning perspective
Indoor localization has gained significant attention in recent years due to its applications across sectors such as healthcare, logistics, manufacturing, and retail. However, while outdoor localization has been effectively addressed with GPS, indoor localization remains challenging despite significant research progress. Many studies have explored the capabilities of modern smartphones, equipped with a variety of sensors, to develop machine-learning methods for indoor localization, ranging from classical fingerprinting to deep sequence models and transformers. Nevertheless, most rely on small, proprietary datasets that are not publicly available. Large, high-quality public datasets are essential for researchers to efficiently test, refine, and validate algorithms, enable comparisons between different approaches and develop robust and accurate localization solutions. To reduce data collection time and costs and help researchers find the most appropriate datasets for their needs, this paper surveys 20 publicly available high-quality indoor localization datasets suitable for Machine Learning, released between 2014 and 2024, that cover various sensing technologies. The survey reveals a shift toward multi-sensor data collection, extending beyond Wi-Fi and Bluetooth signals to include inertial sensors such as accelerometers and gyroscopes, as well as magnetic fields. It also highlights that while over 75% of datasets cover multi-floor structures or multiple buildings, there is a scarcity of datasets covering diverse types of indoor environments, with most focused on office or academic settings. Moreover, the temporal dimension, crucial in dynamic indoor scenarios, remains largely underrepresented, limiting the development of ML models for tracking dynamic trajectories or adapting to evolving signal patterns.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
Internet of Things
Internet of Things Multiple-
CiteScore
3.60
自引率
5.10%
发文量
115
审稿时长
37 days
期刊介绍: Internet of Things; Engineering Cyber Physical Human Systems is a comprehensive journal encouraging cross collaboration between researchers, engineers and practitioners in the field of IoT & Cyber Physical Human Systems. The journal offers a unique platform to exchange scientific information on the entire breadth of technology, science, and societal applications of the IoT. The journal will place a high priority on timely publication, and provide a home for high quality. Furthermore, IOT is interested in publishing topical Special Issues on any aspect of IOT.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信