Eliciting big data requirement from big data itself: A task-directed approach

Peng Wang, Ke Tao, Chenxu Gao, Xi Ning, Shuang Gu, Bo Deng
{"title":"Eliciting big data requirement from big data itself: A task-directed approach","authors":"Peng Wang, Ke Tao, Chenxu Gao, Xi Ning, Shuang Gu, Bo Deng","doi":"10.1109/SOFTWAREMINING.2017.8100849","DOIUrl":null,"url":null,"abstract":"The characteristics of big data not only challenge the processing methods of large volume of data, but also the way we make use of such semantic-rich resources, among which how users plan to manipulate the intermediate or final results requires to be well considered. This is especially challenging in building analytics systems as big data is richer in semantics and the heterogeneous data modalities also impose burdens on semantic fusion and visualization. While most of current research focuses on how to mine semantic information from big data, we emphasize the importance of the role of users in terms of requirement acquisition while building practical system functions in data analytics or management. This paper proposes a task-directed approach of requirement analysis for big data analytics which enhances requirement elicitation with modern data mining approaches. Based on the entities and semantic relationships learned from big data, a user-in-loop semi-automated requirement elicitation is carried out to generate a requirement repository which is affordable for further maintenance and modification. State-of-the-art user modeling methods can be incorporated aiming at refining the requirement iteratively according to the task interaction of users with system functions. Taking the case study of lifelogging analytics, we further discuss the advantages and limitations of our perspective.","PeriodicalId":377808,"journal":{"name":"2017 6th International Workshop on Software Mining (SoftwareMining)","volume":"15 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2017-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"5","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2017 6th International Workshop on Software Mining (SoftwareMining)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/SOFTWAREMINING.2017.8100849","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 5

Abstract

The characteristics of big data not only challenge the processing methods of large volume of data, but also the way we make use of such semantic-rich resources, among which how users plan to manipulate the intermediate or final results requires to be well considered. This is especially challenging in building analytics systems as big data is richer in semantics and the heterogeneous data modalities also impose burdens on semantic fusion and visualization. While most of current research focuses on how to mine semantic information from big data, we emphasize the importance of the role of users in terms of requirement acquisition while building practical system functions in data analytics or management. This paper proposes a task-directed approach of requirement analysis for big data analytics which enhances requirement elicitation with modern data mining approaches. Based on the entities and semantic relationships learned from big data, a user-in-loop semi-automated requirement elicitation is carried out to generate a requirement repository which is affordable for further maintenance and modification. State-of-the-art user modeling methods can be incorporated aiming at refining the requirement iteratively according to the task interaction of users with system functions. Taking the case study of lifelogging analytics, we further discuss the advantages and limitations of our perspective.
从大数据本身引出大数据需求:以任务为导向的方法
大数据的特性不仅对大数据量的处理方法提出了挑战,也对我们如何利用这些语义丰富的资源提出了挑战,其中用户如何计划操纵中间或最终结果需要充分考虑。这在构建分析系统时尤其具有挑战性,因为大数据具有更丰富的语义,异构数据模式也给语义融合和可视化带来了负担。虽然目前大多数研究都集中在如何从大数据中挖掘语义信息,但我们强调用户在需求获取方面的重要性,同时在数据分析或管理中构建实用的系统功能。本文提出了一种面向任务的大数据需求分析方法,该方法利用现代数据挖掘方法增强了需求的提取。基于从大数据中学习到的实体和语义关系,进行用户在循环的半自动化需求提取,生成可负担得起的需求库,以供进一步维护和修改。可以结合最先进的用户建模方法,旨在根据用户与系统功能的任务交互迭代地细化需求。以生活日志分析为例,进一步讨论了本文观点的优点和局限性。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信