关联规则分析:敏感性分析

IF 1.5 4区 数学 Q3 MATHEMATICS, INTERDISCIPLINARY APPLICATIONS
Ron S. Kenett, Chris Gotwalt
{"title":"关联规则分析:敏感性分析","authors":"Ron S. Kenett,&nbsp;Chris Gotwalt","doi":"10.1002/asmb.70022","DOIUrl":null,"url":null,"abstract":"<div>\n \n <p>Association rules are extracting information from transactional databases of documents with a collection of items also called “tokens” or “words”. The approach is used in the analysis of text records, of social media and of consumer behaviour. We present an innovative sensitivity analysis of association rules (AR) measures of interest. In text analytics, a document term matrix (DTM) consists of rows referring to documents and columns corresponding to items. In binary weights, “1” indicates the presence of a term in a document and “0” otherwise. From a DTM one computes measures of interest characterising ARs. The approach we introduce is based on the application of befitting cross validation (BCV) principles to ARs. The sensitivity analysis of ARs is based on computer generated repeated shuffling of training and validation sets that provide an assessment of the uncertainty of AR measures of interest. We demonstrate this methodology with reports of symptoms associated with a Nicardipine drug product used in the treatment of high blood pressure and angina. Patients self-reports on side effect events are analysed. Association rules, derived from these reports, describe combinations of terms in these reports. AR measures of interest are defined in section 1. In section 2 we introduce the case study that motivates the method we propose. In section 3 we apply BCV principles by concatenating side effect events of Nicardipine by patient. Sensitivity analysis (SA) of ARs is introduced and demonstrated in section 4. The sensitivity analysis method presented here is discussed in section 5 where we formulate general data analysis considerations on how to organise and analyse semantic data.</p>\n </div>","PeriodicalId":55495,"journal":{"name":"Applied Stochastic Models in Business and Industry","volume":"41 3","pages":""},"PeriodicalIF":1.5000,"publicationDate":"2025-06-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"The Analysis of Association Rules: Sensitivity Analysis\",\"authors\":\"Ron S. Kenett,&nbsp;Chris Gotwalt\",\"doi\":\"10.1002/asmb.70022\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div>\\n \\n <p>Association rules are extracting information from transactional databases of documents with a collection of items also called “tokens” or “words”. The approach is used in the analysis of text records, of social media and of consumer behaviour. We present an innovative sensitivity analysis of association rules (AR) measures of interest. In text analytics, a document term matrix (DTM) consists of rows referring to documents and columns corresponding to items. In binary weights, “1” indicates the presence of a term in a document and “0” otherwise. From a DTM one computes measures of interest characterising ARs. The approach we introduce is based on the application of befitting cross validation (BCV) principles to ARs. The sensitivity analysis of ARs is based on computer generated repeated shuffling of training and validation sets that provide an assessment of the uncertainty of AR measures of interest. We demonstrate this methodology with reports of symptoms associated with a Nicardipine drug product used in the treatment of high blood pressure and angina. Patients self-reports on side effect events are analysed. Association rules, derived from these reports, describe combinations of terms in these reports. AR measures of interest are defined in section 1. In section 2 we introduce the case study that motivates the method we propose. In section 3 we apply BCV principles by concatenating side effect events of Nicardipine by patient. Sensitivity analysis (SA) of ARs is introduced and demonstrated in section 4. The sensitivity analysis method presented here is discussed in section 5 where we formulate general data analysis considerations on how to organise and analyse semantic data.</p>\\n </div>\",\"PeriodicalId\":55495,\"journal\":{\"name\":\"Applied Stochastic Models in Business and Industry\",\"volume\":\"41 3\",\"pages\":\"\"},\"PeriodicalIF\":1.5000,\"publicationDate\":\"2025-06-09\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Applied Stochastic Models in Business and Industry\",\"FirstCategoryId\":\"100\",\"ListUrlMain\":\"https://onlinelibrary.wiley.com/doi/10.1002/asmb.70022\",\"RegionNum\":4,\"RegionCategory\":\"数学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q3\",\"JCRName\":\"MATHEMATICS, INTERDISCIPLINARY APPLICATIONS\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Applied Stochastic Models in Business and Industry","FirstCategoryId":"100","ListUrlMain":"https://onlinelibrary.wiley.com/doi/10.1002/asmb.70022","RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"MATHEMATICS, INTERDISCIPLINARY APPLICATIONS","Score":null,"Total":0}
引用次数: 0

摘要

关联规则从带有一组项目(也称为“令牌”或“词”)的文档的事务性数据库中提取信息。这种方法被用于分析文本记录、社交媒体和消费者行为。我们提出了一个创新的敏感性分析的关联规则(AR)措施的兴趣。在文本分析中,文档术语矩阵(DTM)由引用文档的行和对应于项的列组成。在二进制权重中,“1”表示文档中存在某个术语,否则为“0”。从DTM中计算出表征ar的兴趣度量。我们介绍的方法是基于拟合交叉验证(BCV)原则在ar中的应用。AR的敏感性分析是基于计算机生成的训练集和验证集的重复洗牌,这些集提供了对感兴趣的AR测量的不确定性的评估。我们用一种尼卡地平药物产品治疗高血压和心绞痛的相关症状报告来证明这种方法。分析患者对副作用事件的自我报告。来自这些报告的关联规则描述了这些报告中术语的组合。感兴趣的应收帐款度量见第1节。在第2节中,我们介绍了激励我们提出的方法的案例研究。在第3节中,我们通过逐个患者串联尼卡地平的副作用事件来应用BCV原则。第4节介绍并演示了ar的敏感性分析(SA)。本文提出的敏感性分析方法将在第5节中进行讨论,其中我们制定了关于如何组织和分析语义数据的一般数据分析考虑因素。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
The Analysis of Association Rules: Sensitivity Analysis

Association rules are extracting information from transactional databases of documents with a collection of items also called “tokens” or “words”. The approach is used in the analysis of text records, of social media and of consumer behaviour. We present an innovative sensitivity analysis of association rules (AR) measures of interest. In text analytics, a document term matrix (DTM) consists of rows referring to documents and columns corresponding to items. In binary weights, “1” indicates the presence of a term in a document and “0” otherwise. From a DTM one computes measures of interest characterising ARs. The approach we introduce is based on the application of befitting cross validation (BCV) principles to ARs. The sensitivity analysis of ARs is based on computer generated repeated shuffling of training and validation sets that provide an assessment of the uncertainty of AR measures of interest. We demonstrate this methodology with reports of symptoms associated with a Nicardipine drug product used in the treatment of high blood pressure and angina. Patients self-reports on side effect events are analysed. Association rules, derived from these reports, describe combinations of terms in these reports. AR measures of interest are defined in section 1. In section 2 we introduce the case study that motivates the method we propose. In section 3 we apply BCV principles by concatenating side effect events of Nicardipine by patient. Sensitivity analysis (SA) of ARs is introduced and demonstrated in section 4. The sensitivity analysis method presented here is discussed in section 5 where we formulate general data analysis considerations on how to organise and analyse semantic data.

求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
CiteScore
2.70
自引率
0.00%
发文量
67
审稿时长
>12 weeks
期刊介绍: ASMBI - Applied Stochastic Models in Business and Industry (formerly Applied Stochastic Models and Data Analysis) was first published in 1985, publishing contributions in the interface between stochastic modelling, data analysis and their applications in business, finance, insurance, management and production. In 2007 ASMBI became the official journal of the International Society for Business and Industrial Statistics (www.isbis.org). The main objective is to publish papers, both technical and practical, presenting new results which solve real-life problems or have great potential in doing so. Mathematical rigour, innovative stochastic modelling and sound applications are the key ingredients of papers to be published, after a very selective review process. The journal is very open to new ideas, like Data Science and Big Data stemming from problems in business and industry or uncertainty quantification in engineering, as well as more traditional ones, like reliability, quality control, design of experiments, managerial processes, supply chains and inventories, insurance, econometrics, financial modelling (provided the papers are related to real problems). The journal is interested also in papers addressing the effects of business and industrial decisions on the environment, healthcare, social life. State-of-the art computational methods are very welcome as well, when combined with sound applications and innovative models.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信