语义耍蛇人搜索引擎:促进高科技产业领域数据科学的工具

Corrado Grappiolo, E. V. Gerwen, J. Verhoosel, L. Somers
{"title":"语义耍蛇人搜索引擎:促进高科技产业领域数据科学的工具","authors":"Corrado Grappiolo, E. V. Gerwen, J. Verhoosel, L. Somers","doi":"10.1145/3295750.3298915","DOIUrl":null,"url":null,"abstract":"The booming popularity of data science is also affecting high-tech industries. However, since these usually have different core competencies --- building cyber-physical systems rather than e.g. machine learning or data mining algorithms --- delving into data science by domain experts such as system engineers or architects might be more cumbersome than expected. In order to help domain experts to delve into data science we designed the Semantic Snake Charmer (SSC), a domain knowledge-based search engine for Jupyter Notebooks. SSC is composed of three modules: (1) a human-machine cooperative module to identify internal documentation which contains the most relevant domain knowledge, (2) a natural language processing module capable of transforming relevant documentation into several semantic graph types, (3) a reinforcement-learning based search engine which learns, given user feedback, the best mapping between input queries and semantic graph type to rely on. We believe SSC can be a fundamental asset to allow the easy landing of data science in industrial domains.","PeriodicalId":187771,"journal":{"name":"Proceedings of the 2019 Conference on Human Information Interaction and Retrieval","volume":"23 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2019-03-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"7","resultStr":"{\"title\":\"The Semantic Snake Charmer Search Engine: A Tool to Facilitate Data Science in High-tech Industry Domains\",\"authors\":\"Corrado Grappiolo, E. V. Gerwen, J. Verhoosel, L. Somers\",\"doi\":\"10.1145/3295750.3298915\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"The booming popularity of data science is also affecting high-tech industries. However, since these usually have different core competencies --- building cyber-physical systems rather than e.g. machine learning or data mining algorithms --- delving into data science by domain experts such as system engineers or architects might be more cumbersome than expected. In order to help domain experts to delve into data science we designed the Semantic Snake Charmer (SSC), a domain knowledge-based search engine for Jupyter Notebooks. SSC is composed of three modules: (1) a human-machine cooperative module to identify internal documentation which contains the most relevant domain knowledge, (2) a natural language processing module capable of transforming relevant documentation into several semantic graph types, (3) a reinforcement-learning based search engine which learns, given user feedback, the best mapping between input queries and semantic graph type to rely on. We believe SSC can be a fundamental asset to allow the easy landing of data science in industrial domains.\",\"PeriodicalId\":187771,\"journal\":{\"name\":\"Proceedings of the 2019 Conference on Human Information Interaction and Retrieval\",\"volume\":\"23 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2019-03-08\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"7\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Proceedings of the 2019 Conference on Human Information Interaction and Retrieval\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1145/3295750.3298915\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the 2019 Conference on Human Information Interaction and Retrieval","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/3295750.3298915","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 7

摘要

数据科学的蓬勃发展也影响着高科技产业。然而,由于这些通常具有不同的核心能力——构建网络物理系统而不是机器学习或数据挖掘算法——由系统工程师或架构师等领域专家深入研究数据科学可能比预期的要麻烦得多。为了帮助领域专家深入研究数据科学,我们为Jupyter Notebooks设计了基于领域知识的搜索引擎Semantic Snake Charmer (SSC)。SSC由三个模块组成:(1)人机协作模块,用于识别包含最相关领域知识的内部文档;(2)自然语言处理模块,能够将相关文档转换为几种语义图类型;(3)基于强化学习的搜索引擎,在给定用户反馈的情况下,学习输入查询与语义图类型之间的最佳映射。我们相信,SSC可以成为数据科学在工业领域轻松落地的基础资产。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
The Semantic Snake Charmer Search Engine: A Tool to Facilitate Data Science in High-tech Industry Domains
The booming popularity of data science is also affecting high-tech industries. However, since these usually have different core competencies --- building cyber-physical systems rather than e.g. machine learning or data mining algorithms --- delving into data science by domain experts such as system engineers or architects might be more cumbersome than expected. In order to help domain experts to delve into data science we designed the Semantic Snake Charmer (SSC), a domain knowledge-based search engine for Jupyter Notebooks. SSC is composed of three modules: (1) a human-machine cooperative module to identify internal documentation which contains the most relevant domain knowledge, (2) a natural language processing module capable of transforming relevant documentation into several semantic graph types, (3) a reinforcement-learning based search engine which learns, given user feedback, the best mapping between input queries and semantic graph type to rely on. We believe SSC can be a fundamental asset to allow the easy landing of data science in industrial domains.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信