Small dataset augmentation with radial basis function approximation for causal discovery using constraint-based method

IF 1.3 4区 计算机科学 Q3 ENGINEERING, ELECTRICAL & ELECTRONIC
Chan Young Jung, Yun Jang
{"title":"Small dataset augmentation with radial basis function approximation for causal discovery using constraint-based method","authors":"Chan Young Jung, Yun Jang","doi":"10.4218/etrij.2023-0397","DOIUrl":null,"url":null,"abstract":"Causal analysis involves analysis and discovery. We consider causal discovery, which implies learning and discovering causal structures from available data, owing to the significance of interpreting causal relationships in various fields. Research on causal discovery has been primarily focused on constraint- and score-based interpretable methods rather than on methods based on complex deep learning models. However, identifying causal relationships in real-world datasets remains challenging. Numerous studies have been conducted using small datasets with established ground truths. Moreover, constraint-based methods are based on conditional independence tests. However, such tests have a lower statistical power when applied to small datasets. To solve the small sample size problem, we propose a model that generates a continuous function from available samples using radial basis function approximation. We address the problem by extracting data from the generated continuous function and evaluate the proposed method on both real and synthetic datasets generated by structural equation modeling. The proposed method outperforms constraint-based methods using only small datasets.","PeriodicalId":11901,"journal":{"name":"ETRI Journal","volume":"18 1","pages":""},"PeriodicalIF":1.3000,"publicationDate":"2024-04-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"ETRI Journal","FirstCategoryId":"94","ListUrlMain":"https://doi.org/10.4218/etrij.2023-0397","RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"ENGINEERING, ELECTRICAL & ELECTRONIC","Score":null,"Total":0}
引用次数: 0

Abstract

Causal analysis involves analysis and discovery. We consider causal discovery, which implies learning and discovering causal structures from available data, owing to the significance of interpreting causal relationships in various fields. Research on causal discovery has been primarily focused on constraint- and score-based interpretable methods rather than on methods based on complex deep learning models. However, identifying causal relationships in real-world datasets remains challenging. Numerous studies have been conducted using small datasets with established ground truths. Moreover, constraint-based methods are based on conditional independence tests. However, such tests have a lower statistical power when applied to small datasets. To solve the small sample size problem, we propose a model that generates a continuous function from available samples using radial basis function approximation. We address the problem by extracting data from the generated continuous function and evaluate the proposed method on both real and synthetic datasets generated by structural equation modeling. The proposed method outperforms constraint-based methods using only small datasets.
使用基于约束的方法,用径向基函数近似法增加小数据集以发现因果关系
因果分析包括分析和发现。我们考虑因果发现,这意味着从可用数据中学习和发现因果结构,因为在各个领域解释因果关系具有重要意义。关于因果发现的研究主要集中在基于约束和分数的可解释方法,而不是基于复杂深度学习模型的方法。然而,在现实世界的数据集中识别因果关系仍然具有挑战性。许多研究都是使用具有既定基本事实的小型数据集进行的。此外,基于约束的方法是基于条件独立性测试的。然而,当应用于小型数据集时,此类检验的统计能力较低。为了解决样本量小的问题,我们提出了一种模型,利用径向基函数近似法从可用样本中生成连续函数。我们通过从生成的连续函数中提取数据来解决这个问题,并在通过结构方程建模生成的真实数据集和合成数据集上评估所提出的方法。在仅使用小数据集的情况下,所提出的方法优于基于约束的方法。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
ETRI Journal
ETRI Journal 工程技术-电信学
CiteScore
4.00
自引率
7.10%
发文量
98
审稿时长
6.9 months
期刊介绍: ETRI Journal is an international, peer-reviewed multidisciplinary journal published bimonthly in English. The main focus of the journal is to provide an open forum to exchange innovative ideas and technology in the fields of information, telecommunications, and electronics. Key topics of interest include high-performance computing, big data analytics, cloud computing, multimedia technology, communication networks and services, wireless communications and mobile computing, material and component technology, as well as security. With an international editorial committee and experts from around the world as reviewers, ETRI Journal publishes high-quality research papers on the latest and best developments from the global community.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信