Qutaber: task-based exploratory data analysis with enriched context awareness

IF 1.7 4区 计算机科学 Q3 COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS
Qi Jiang, Guodao Sun, Tong Li, Jingwei Tang, Wang Xia, Sujia Zhu, Ronghua Liang
{"title":"Qutaber: task-based exploratory data analysis with enriched context awareness","authors":"Qi Jiang, Guodao Sun, Tong Li, Jingwei Tang, Wang Xia, Sujia Zhu, Ronghua Liang","doi":"10.1007/s12650-024-00975-1","DOIUrl":null,"url":null,"abstract":"<h3 data-test=\"abstract-sub-heading\">Abstract</h3><p>Exploratory data analysis (EDA) has emerged as a critical tool for users to gain deep insights into data and unearth hidden patterns. The integration of recommendation algorithms has enhanced its capabilities and further popularized its utilization. Most recommendation-based EDA methods concentrate on the extraction of pivotal insights from datasets, and the taxonomy of these insights is well-established. However, the support for further analytical endeavors to expand these initial findings remains constrained, as evidenced by the restricted scope of analytical intents that are tailored to specific scenarios. Moreover, these systems often lack sufficient context-awareness capabilities, failing to equip users with the necessary tools for a thorough exploration of extensive recommendations. To address these limitations, we introduce Qutaber, a task-based EDA system with enriched context-awareness. We first summarize six core analytical tasks tailored for EDA scenarios through literature reviews and expert interviews. Then, Qutaber integrates the use of small multiples, enhanced with a multi-metric re-ranking function, to enable a thorough and efficient examination of expanded charts pertaining to various analytical tasks. Furthermore, a machine learning method is leveraged to characterize the semantic features of these charts for a holistic landscape of recommended charts. Finally, a case study using a real-world dataset demonstrates Qutaber’s practical application, followed by a user study to further evaluate the usability of the proposed techniques. Our findings illustrate that Qutaber facilitates an effective and context-rich EDA experience for users.</p><h3 data-test=\"abstract-sub-heading\">Graphic abstract</h3>","PeriodicalId":54756,"journal":{"name":"Journal of Visualization","volume":"60 1","pages":""},"PeriodicalIF":1.7000,"publicationDate":"2024-03-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Visualization","FirstCategoryId":"94","ListUrlMain":"https://doi.org/10.1007/s12650-024-00975-1","RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS","Score":null,"Total":0}
引用次数: 0

Abstract

Exploratory data analysis (EDA) has emerged as a critical tool for users to gain deep insights into data and unearth hidden patterns. The integration of recommendation algorithms has enhanced its capabilities and further popularized its utilization. Most recommendation-based EDA methods concentrate on the extraction of pivotal insights from datasets, and the taxonomy of these insights is well-established. However, the support for further analytical endeavors to expand these initial findings remains constrained, as evidenced by the restricted scope of analytical intents that are tailored to specific scenarios. Moreover, these systems often lack sufficient context-awareness capabilities, failing to equip users with the necessary tools for a thorough exploration of extensive recommendations. To address these limitations, we introduce Qutaber, a task-based EDA system with enriched context-awareness. We first summarize six core analytical tasks tailored for EDA scenarios through literature reviews and expert interviews. Then, Qutaber integrates the use of small multiples, enhanced with a multi-metric re-ranking function, to enable a thorough and efficient examination of expanded charts pertaining to various analytical tasks. Furthermore, a machine learning method is leveraged to characterize the semantic features of these charts for a holistic landscape of recommended charts. Finally, a case study using a real-world dataset demonstrates Qutaber’s practical application, followed by a user study to further evaluate the usability of the proposed techniques. Our findings illustrate that Qutaber facilitates an effective and context-rich EDA experience for users.

Graphic abstract

Abstract Image

Qutaber:基于任务的探索性数据分析,具有丰富的上下文意识
摘要 探索性数据分析(EDA)已成为用户深入了解数据和发现隐藏模式的重要工具。推荐算法的集成增强了其功能,并进一步普及了其应用。大多数基于推荐的 EDA 方法都侧重于从数据集中提取关键见解,这些见解的分类方法也已确立。然而,对扩展这些初步发现的进一步分析工作的支持仍然受到限制,这一点从针对特定场景的分析意图范围有限就可见一斑。此外,这些系统往往缺乏足够的情境感知能力,无法为用户提供必要的工具来深入探讨广泛的建议。为了解决这些局限性,我们引入了 Qutaber,一个基于任务的、具有丰富情境感知能力的 EDA 系统。首先,我们通过文献综述和专家访谈总结了为 EDA 场景量身定制的六项核心分析任务。然后,Qutaber 整合了小倍数的使用,并通过多指标重新排序功能进行了增强,从而能够全面、高效地检查与各种分析任务相关的扩展图表。此外,还利用一种机器学习方法来描述这些图表的语义特征,从而获得推荐图表的整体景观。最后,使用真实世界数据集进行的案例研究展示了 Qutaber 的实际应用,随后进行了用户研究,以进一步评估所建议技术的可用性。我们的研究结果表明,Qutaber 为用户提供了有效且上下文丰富的 EDA 体验。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
Journal of Visualization
Journal of Visualization COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS-IMAGING SCIENCE & PHOTOGRAPHIC TECHNOLOGY
CiteScore
3.40
自引率
5.90%
发文量
79
审稿时长
>12 weeks
期刊介绍: Visualization is an interdisciplinary imaging science devoted to making the invisible visible through the techniques of experimental visualization and computer-aided visualization. The scope of the Journal is to provide a place to exchange information on the latest visualization technology and its application by the presentation of latest papers of both researchers and technicians.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信