LVRF:一种基于潜变量的地理数据集挖掘方法

IF 0.4 Q4 COMPUTER SCIENCE, INFORMATION SYSTEMS
Liangdong Deng, Arpan Mahara, N. Rishe, Malek Adjouadi
{"title":"LVRF:一种基于潜变量的地理数据集挖掘方法","authors":"Liangdong Deng, Arpan Mahara, N. Rishe, Malek Adjouadi","doi":"10.58245/ipsi.tir.2302.02","DOIUrl":null,"url":null,"abstract":"Geographic datasets are usually accompanied by spatial non-stationarity – a phenomenon that the relationship between features varies across space. Naturally, nonstationarity can be interpreted as the underlying rule that decides how data are generated and alters over space. Therefore, traditional machine learning algorithms are not suitable for handling non-stationary geographic datasets, as they only render a single global model. To solve this problem, researchers often adopt the multiple-local-model approach, which uses different models to account for different sub-regions of space. This approach has been proven efficient but not optimal, as it is inherently difficult to decide the size of subregions. Additionally, the fact that local models are only trained on a subset of data also limits their potential. This paper proposes an entirely different strategy that interprets nonstationarity as a lack of data and addresses it by introducing latent variables to the original dataset. Backpropagation is then used to find the best values for these latent variables. Experiments show that this method is at least as efficient as multiple-local-model-based approaches and has even greater potential.","PeriodicalId":41192,"journal":{"name":"IPSI BgD Transactions on Internet Research","volume":"217 1","pages":""},"PeriodicalIF":0.4000,"publicationDate":"2023-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":"{\"title\":\"LVRF: A Latent Variable Based Approach for Exploring Geographic Datasets\",\"authors\":\"Liangdong Deng, Arpan Mahara, N. Rishe, Malek Adjouadi\",\"doi\":\"10.58245/ipsi.tir.2302.02\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Geographic datasets are usually accompanied by spatial non-stationarity – a phenomenon that the relationship between features varies across space. Naturally, nonstationarity can be interpreted as the underlying rule that decides how data are generated and alters over space. Therefore, traditional machine learning algorithms are not suitable for handling non-stationary geographic datasets, as they only render a single global model. To solve this problem, researchers often adopt the multiple-local-model approach, which uses different models to account for different sub-regions of space. This approach has been proven efficient but not optimal, as it is inherently difficult to decide the size of subregions. Additionally, the fact that local models are only trained on a subset of data also limits their potential. This paper proposes an entirely different strategy that interprets nonstationarity as a lack of data and addresses it by introducing latent variables to the original dataset. Backpropagation is then used to find the best values for these latent variables. Experiments show that this method is at least as efficient as multiple-local-model-based approaches and has even greater potential.\",\"PeriodicalId\":41192,\"journal\":{\"name\":\"IPSI BgD Transactions on Internet Research\",\"volume\":\"217 1\",\"pages\":\"\"},\"PeriodicalIF\":0.4000,\"publicationDate\":\"2023-07-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"1\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"IPSI BgD Transactions on Internet Research\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.58245/ipsi.tir.2302.02\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q4\",\"JCRName\":\"COMPUTER SCIENCE, INFORMATION SYSTEMS\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"IPSI BgD Transactions on Internet Research","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.58245/ipsi.tir.2302.02","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q4","JCRName":"COMPUTER SCIENCE, INFORMATION SYSTEMS","Score":null,"Total":0}
引用次数: 1

摘要

地理数据集通常伴随着空间非平稳性——一种特征之间的关系在空间上变化的现象。当然,非平稳性可以解释为决定数据如何生成和随空间变化的基本规则。因此,传统的机器学习算法不适合处理非平稳地理数据集,因为它们只呈现单一的全局模型。为了解决这一问题,研究人员通常采用多局部模型方法,即使用不同的模型来考虑空间的不同子区域。这种方法已被证明是有效的,但不是最佳的,因为确定分区域的大小本身就很困难。此外,局部模型只在数据子集上训练的事实也限制了它们的潜力。本文提出了一种完全不同的策略,将非平稳性解释为缺乏数据,并通过向原始数据集引入潜在变量来解决这个问题。然后使用反向传播来找到这些潜在变量的最佳值。实验表明,该方法至少与基于多局部模型的方法一样有效,并且具有更大的潜力。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
LVRF: A Latent Variable Based Approach for Exploring Geographic Datasets
Geographic datasets are usually accompanied by spatial non-stationarity – a phenomenon that the relationship between features varies across space. Naturally, nonstationarity can be interpreted as the underlying rule that decides how data are generated and alters over space. Therefore, traditional machine learning algorithms are not suitable for handling non-stationary geographic datasets, as they only render a single global model. To solve this problem, researchers often adopt the multiple-local-model approach, which uses different models to account for different sub-regions of space. This approach has been proven efficient but not optimal, as it is inherently difficult to decide the size of subregions. Additionally, the fact that local models are only trained on a subset of data also limits their potential. This paper proposes an entirely different strategy that interprets nonstationarity as a lack of data and addresses it by introducing latent variables to the original dataset. Backpropagation is then used to find the best values for these latent variables. Experiments show that this method is at least as efficient as multiple-local-model-based approaches and has even greater potential.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
IPSI BgD Transactions on Internet Research
IPSI BgD Transactions on Internet Research COMPUTER SCIENCE, INFORMATION SYSTEMS-
自引率
25.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信