利用领域专业知识进行探索性数据分析

Tristan Langer, Tobias Meisen
{"title":"利用领域专业知识进行探索性数据分析","authors":"Tristan Langer, Tobias Meisen","doi":"10.1145/3356422.3356434","DOIUrl":null,"url":null,"abstract":"In exploratory data analysis, domain knowledge and experience play a central role in order to extract information from the data and to derive proof and knowledge. However, experienced domain experts are rarely the same people who carry out the data analyses. Therefore, utilizing domain expertise for guidance in analytic processes is a complex challenge. In recent years, machine learning has seen great advances. Increasing processing power and growth in data as well as affordable storage have led to more advanced algorithms. Therefore, with the emergence of applicable machine learning algorithms, there is now a method for preserving and making use even of complex knowledge. In this paper, we present a concept that allows to extract and utilize domain knowledge for exploratory data analysis. We introduce concepts of interaction store and analysis context store to record user interaction and context during an exploratory analysis. We use the recorded data to construct semantic interaction sequences and predict their potential insight. The prediction can then be used to guide other data scientist in their sensemaking while performing exploratory data analysis in similar domains and use cases. Furthermore, we discuss possible research opportunities and implications resulting from the presented concept.","PeriodicalId":197051,"journal":{"name":"Proceedings of the 12th International Symposium on Visual Information Communication and Interaction","volume":"25 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2019-09-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":"{\"title\":\"Towards Utilizing Domain Expertise for Exploratory Data Analysis\",\"authors\":\"Tristan Langer, Tobias Meisen\",\"doi\":\"10.1145/3356422.3356434\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"In exploratory data analysis, domain knowledge and experience play a central role in order to extract information from the data and to derive proof and knowledge. However, experienced domain experts are rarely the same people who carry out the data analyses. Therefore, utilizing domain expertise for guidance in analytic processes is a complex challenge. In recent years, machine learning has seen great advances. Increasing processing power and growth in data as well as affordable storage have led to more advanced algorithms. Therefore, with the emergence of applicable machine learning algorithms, there is now a method for preserving and making use even of complex knowledge. In this paper, we present a concept that allows to extract and utilize domain knowledge for exploratory data analysis. We introduce concepts of interaction store and analysis context store to record user interaction and context during an exploratory analysis. We use the recorded data to construct semantic interaction sequences and predict their potential insight. The prediction can then be used to guide other data scientist in their sensemaking while performing exploratory data analysis in similar domains and use cases. Furthermore, we discuss possible research opportunities and implications resulting from the presented concept.\",\"PeriodicalId\":197051,\"journal\":{\"name\":\"Proceedings of the 12th International Symposium on Visual Information Communication and Interaction\",\"volume\":\"25 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2019-09-20\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"1\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Proceedings of the 12th International Symposium on Visual Information Communication and Interaction\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1145/3356422.3356434\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the 12th International Symposium on Visual Information Communication and Interaction","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/3356422.3356434","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1

摘要

在探索性数据分析中,为了从数据中提取信息并获得证据和知识,领域知识和经验起着核心作用。然而,经验丰富的领域专家很少是执行数据分析的人。因此,在分析过程中利用领域专业知识进行指导是一个复杂的挑战。近年来,机器学习取得了巨大的进步。处理能力的增强和数据的增长以及可负担的存储导致了更先进的算法。因此,随着适用的机器学习算法的出现,现在有了一种保存和利用复杂知识的方法。在本文中,我们提出了一个概念,允许提取和利用领域知识进行探索性数据分析。我们引入了交互存储和分析上下文存储的概念,以记录探索性分析期间的用户交互和上下文。我们使用记录的数据来构建语义交互序列并预测其潜在的洞察力。然后,预测可以用来指导其他数据科学家在类似领域和用例中执行探索性数据分析时进行意义构建。此外,我们讨论了可能的研究机会和由此产生的概念。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Towards Utilizing Domain Expertise for Exploratory Data Analysis
In exploratory data analysis, domain knowledge and experience play a central role in order to extract information from the data and to derive proof and knowledge. However, experienced domain experts are rarely the same people who carry out the data analyses. Therefore, utilizing domain expertise for guidance in analytic processes is a complex challenge. In recent years, machine learning has seen great advances. Increasing processing power and growth in data as well as affordable storage have led to more advanced algorithms. Therefore, with the emergence of applicable machine learning algorithms, there is now a method for preserving and making use even of complex knowledge. In this paper, we present a concept that allows to extract and utilize domain knowledge for exploratory data analysis. We introduce concepts of interaction store and analysis context store to record user interaction and context during an exploratory analysis. We use the recorded data to construct semantic interaction sequences and predict their potential insight. The prediction can then be used to guide other data scientist in their sensemaking while performing exploratory data analysis in similar domains and use cases. Furthermore, we discuss possible research opportunities and implications resulting from the presented concept.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信