Multi-level Multi-task Learning for Modeling Cross-Scale Interactions in Nested Geospatial Data

Shuai Yuan, Jiayu Zhou, P. Tan, C. E. Fergus, T. Wagner, P. Soranno
{"title":"Multi-level Multi-task Learning for Modeling Cross-Scale Interactions in Nested Geospatial Data","authors":"Shuai Yuan, Jiayu Zhou, P. Tan, C. E. Fergus, T. Wagner, P. Soranno","doi":"10.1109/ICDM.2017.154","DOIUrl":null,"url":null,"abstract":"Predictive modeling of nested geospatial data is a challenging problem as the models must take into account potential interactions among variables defined at different spatial scales. These cross-scale interactions, as they are commonly known, are particularly important to understand relationships among ecological properties at macroscales. In this paper, we present a novel, multi-level multi-task learning framework for modeling nested geospatial data in the lake ecology domain. Specifically, we consider region-specific models to predict lake water quality from multi-scaled factors. Our framework enables distinct models to be developed for each region using both its local and regional information. The framework also allows information to be shared among the region-specific models through their common set of latent factors. Such information sharing helps to create more robust models especially for regions with limited or no training data. In addition, the framework can automatically determine cross-scale interactions between the regional variables and the local variables that are nested within them. Our experimental results show that the proposed framework outperforms all the baseline methods in at least 64% of the regions for 3 out of 4 lake water quality datasets evaluated in this study. Furthermore, the latent factors can be clustered to obtain a new set of regions that is more aligned with the response variables than the original regions that were defined a priori from the ecology domain.","PeriodicalId":254086,"journal":{"name":"2017 IEEE International Conference on Data Mining (ICDM)","volume":"46 24 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2017-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"5","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2017 IEEE International Conference on Data Mining (ICDM)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICDM.2017.154","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 5

Abstract

Predictive modeling of nested geospatial data is a challenging problem as the models must take into account potential interactions among variables defined at different spatial scales. These cross-scale interactions, as they are commonly known, are particularly important to understand relationships among ecological properties at macroscales. In this paper, we present a novel, multi-level multi-task learning framework for modeling nested geospatial data in the lake ecology domain. Specifically, we consider region-specific models to predict lake water quality from multi-scaled factors. Our framework enables distinct models to be developed for each region using both its local and regional information. The framework also allows information to be shared among the region-specific models through their common set of latent factors. Such information sharing helps to create more robust models especially for regions with limited or no training data. In addition, the framework can automatically determine cross-scale interactions between the regional variables and the local variables that are nested within them. Our experimental results show that the proposed framework outperforms all the baseline methods in at least 64% of the regions for 3 out of 4 lake water quality datasets evaluated in this study. Furthermore, the latent factors can be clustered to obtain a new set of regions that is more aligned with the response variables than the original regions that were defined a priori from the ecology domain.
嵌套地理空间数据中跨尺度交互建模的多层次多任务学习
嵌套地理空间数据的预测建模是一个具有挑战性的问题,因为模型必须考虑在不同空间尺度上定义的变量之间潜在的相互作用。众所周知,这些跨尺度的相互作用对于理解宏观尺度上生态特性之间的关系尤为重要。在本文中,我们提出了一种新的、多层次的多任务学习框架,用于湖泊生态域嵌套地理空间数据的建模。具体而言,我们考虑了区域特定模型来从多尺度因子预测湖泊水质。我们的框架允许使用本地和区域信息为每个地区开发不同的模型。该框架还允许在特定区域的模型之间通过其共同的潜在因素集共享信息。这种信息共享有助于创建更健壮的模型,特别是对于训练数据有限或没有训练数据的地区。此外,该框架可以自动确定区域变量和嵌套在其中的局部变量之间的跨尺度交互。我们的实验结果表明,对于本研究评估的4个湖泊水质数据集中的3个,所提出的框架在至少64%的地区优于所有基线方法。此外,潜在因素可以聚类,以获得一组新的区域,这些区域比从生态域先验地定义的原始区域更符合响应变量。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信