Detection of outliers with respect to a MUSIC geotechnical database

IF 3 3区 工程技术 Q2 ENGINEERING, GEOLOGICAL
Jianye Ching, Kok-Kwang Phoon, Pengsheng Huang
{"title":"Detection of outliers with respect to a MUSIC geotechnical database","authors":"Jianye Ching, Kok-Kwang Phoon, Pengsheng Huang","doi":"10.1139/cgj-2023-0188","DOIUrl":null,"url":null,"abstract":"This paper proposes a novel method that addresses a non-traditional class of outlier detection problems. The purpose of most outlier detection methods in the literature is to detect outliers within a dataset. A record can be considered as an outlier if it is distinct from the regular records in the dataset. However, the purpose of the novel outlier detection method proposed by this paper is to detect outlier data groups (a data group may denote a site or a project) with respect to a soil/rock property database. A data group is an outlier group if its characteristics (mean, variance, correlation, or higher order dependency) are distinct from the regular data groups in the database. This paper frames the outlier detection problem into a formal hypothesis testing problem with the null hypothesis “the target data group is identically distributed as the regular groups in the database”. With the hierarchical Bayesian model (HBM) previously developed by the first two authors, the p-value for this hypothesis testing problem can be estimated rigorously. Numerical and real examples show that the p-value can effectively detect outlier data groups as well as outlier records with respect to a database.","PeriodicalId":9382,"journal":{"name":"Canadian Geotechnical Journal","volume":"14 1","pages":"0"},"PeriodicalIF":3.0000,"publicationDate":"2023-09-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Canadian Geotechnical Journal","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1139/cgj-2023-0188","RegionNum":3,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"ENGINEERING, GEOLOGICAL","Score":null,"Total":0}
引用次数: 1

Abstract

This paper proposes a novel method that addresses a non-traditional class of outlier detection problems. The purpose of most outlier detection methods in the literature is to detect outliers within a dataset. A record can be considered as an outlier if it is distinct from the regular records in the dataset. However, the purpose of the novel outlier detection method proposed by this paper is to detect outlier data groups (a data group may denote a site or a project) with respect to a soil/rock property database. A data group is an outlier group if its characteristics (mean, variance, correlation, or higher order dependency) are distinct from the regular data groups in the database. This paper frames the outlier detection problem into a formal hypothesis testing problem with the null hypothesis “the target data group is identically distributed as the regular groups in the database”. With the hierarchical Bayesian model (HBM) previously developed by the first two authors, the p-value for this hypothesis testing problem can be estimated rigorously. Numerical and real examples show that the p-value can effectively detect outlier data groups as well as outlier records with respect to a database.
MUSIC岩土工程数据库异常值的检测
本文提出了一种新的方法来解决一类非传统的离群值检测问题。文献中大多数离群值检测方法的目的是检测数据集中的离群值。如果一条记录与数据集中的常规记录不同,那么它可以被视为异常值。然而,本文提出的新颖离群检测方法的目的是检测相对于土壤/岩石属性数据库的离群数据组(一个数据组可以表示一个站点或一个项目)。如果数据组的特征(平均值、方差、相关性或高阶依赖性)不同于数据库中的常规数据组,则该数据组就是离群组。本文将离群点检测问题转化为一个具有“目标数据组与数据库中的规则组相同分布”的零假设的形式假设检验问题。使用前两位作者先前开发的层次贝叶斯模型(HBM),可以严格估计该假设检验问题的p值。数值和实际实例表明,p值可以有效地检测数据库的离群数据组和离群记录。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
Canadian Geotechnical Journal
Canadian Geotechnical Journal 地学-地球科学综合
CiteScore
7.20
自引率
5.60%
发文量
163
审稿时长
7.5 months
期刊介绍: The Canadian Geotechnical Journal features articles, notes, reviews, and discussions related to new developments in geotechnical and geoenvironmental engineering, and applied sciences. The topics of papers written by researchers and engineers/scientists active in industry include soil and rock mechanics, material properties and fundamental behaviour, site characterization, foundations, excavations, tunnels, dams and embankments, slopes, landslides, geological and rock engineering, ground improvement, hydrogeology and contaminant hydrogeology, geochemistry, waste management, geosynthetics, offshore engineering, ice, frozen ground and northern engineering, risk and reliability applications, and physical and numerical modelling. Contributions that have practical relevance are preferred, including case records. Purely theoretical contributions are not generally published unless they are on a topic of special interest (like unsaturated soil mechanics or cold regions geotechnics) or they have direct practical value.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信