客户项目的数据检索:将数据匹配到本体映射以生成相关性评估

2016 IEEE Systems and Information Engineering Design Symposium (SIEDS) Pub Date : 2016-04-29 DOI:10.1109/SIEDS.2016.7489304

Timothy Banach, Fanjia Kong, Ziding Liu, Dinesh Surapaneni, R. Bailey, Donald E. Brown

{"title":"客户项目的数据检索:将数据匹配到本体映射以生成相关性评估","authors":"Timothy Banach, Fanjia Kong, Ziding Liu, Dinesh Surapaneni, R. Bailey, Donald E. Brown","doi":"10.1109/SIEDS.2016.7489304","DOIUrl":null,"url":null,"abstract":"Discerning relevant data is becoming more difficult, time consuming, and costly as the amount of data available dramatically increases. Currently, consulting firms strive to use data to support client's business decisions with evidence. To be effective at this, consultants must consider the applicability of both internal and external data libraries to their clients' requirements. Frequently, evaluating the applicability of data sets is a manual process, which can be costly to the firm and the client. This paper describes a technical approach to automate this process. Specifically, it details the structure of a software application, named UVa Open Miner, capable of assessing the applicability of data sources to client projects. This UVa Open Miner aims to maximize the scale and diversity of candidate data sets, increase the relevance of data found, and maintain manageable computational complexity. UVa Open Miner consists of two segments: mapping and matching. The mapping component text mines web pages to identify an ontology of keywords describing the business requirement. This enables users to handle diverse business requirements from various industry verticals. The matching component scores data sets based on a relevance factor obtained from the ontology map. To validate the application, subject matter experts provided business requirements for a problem in their domain, and validated the application's results. Professionals in environment science, political science and policy-making fields found the application to be useful. Therefore, the application, along with the framework used, can be refactored into a reusable solution for consulting firms to use for their clients.","PeriodicalId":426864,"journal":{"name":"2016 IEEE Systems and Information Engineering Design Symposium (SIEDS)","volume":"90 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2016-04-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":"{\"title\":\"Data retrieval for client projects: Matching data onto an ontology map to produce a relevance assessment\",\"authors\":\"Timothy Banach, Fanjia Kong, Ziding Liu, Dinesh Surapaneni, R. Bailey, Donald E. Brown\",\"doi\":\"10.1109/SIEDS.2016.7489304\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Discerning relevant data is becoming more difficult, time consuming, and costly as the amount of data available dramatically increases. Currently, consulting firms strive to use data to support client's business decisions with evidence. To be effective at this, consultants must consider the applicability of both internal and external data libraries to their clients' requirements. Frequently, evaluating the applicability of data sets is a manual process, which can be costly to the firm and the client. This paper describes a technical approach to automate this process. Specifically, it details the structure of a software application, named UVa Open Miner, capable of assessing the applicability of data sources to client projects. This UVa Open Miner aims to maximize the scale and diversity of candidate data sets, increase the relevance of data found, and maintain manageable computational complexity. UVa Open Miner consists of two segments: mapping and matching. The mapping component text mines web pages to identify an ontology of keywords describing the business requirement. This enables users to handle diverse business requirements from various industry verticals. The matching component scores data sets based on a relevance factor obtained from the ontology map. To validate the application, subject matter experts provided business requirements for a problem in their domain, and validated the application's results. Professionals in environment science, political science and policy-making fields found the application to be useful. Therefore, the application, along with the framework used, can be refactored into a reusable solution for consulting firms to use for their clients.\",\"PeriodicalId\":426864,\"journal\":{\"name\":\"2016 IEEE Systems and Information Engineering Design Symposium (SIEDS)\",\"volume\":\"90 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2016-04-29\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"2\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2016 IEEE Systems and Information Engineering Design Symposium (SIEDS)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/SIEDS.2016.7489304\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2016 IEEE Systems and Information Engineering Design Symposium (SIEDS)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/SIEDS.2016.7489304","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 2

摘要

随着可用数据量的急剧增加，识别相关数据变得越来越困难、耗时和昂贵。目前，咨询公司正在努力使用数据来支持客户的商业决策。为了有效地做到这一点，顾问必须考虑内部和外部数据库对客户需求的适用性。通常，评估数据集的适用性是一个手动过程，这对公司和客户来说可能是昂贵的。本文描述了一种自动化此过程的技术方法。具体来说，它详细介绍了一个名为UVa Open Miner的软件应用程序的结构，该应用程序能够评估数据源对客户项目的适用性。这个UVa Open Miner旨在最大限度地扩大候选数据集的规模和多样性，增加发现数据的相关性，并保持可管理的计算复杂性。UVa Open Miner包括两个部分:映射和匹配。映射组件文本挖掘web页面，以识别描述业务需求的关键字本体。这使用户能够处理来自不同垂直行业的各种业务需求。匹配组件根据从本体图中获得的相关因子对数据集进行评分。为了验证应用程序，主题专家为其领域中的问题提供了业务需求，并验证了应用程序的结果。环境科学、政治学和政策制定领域的专业人士发现该应用程序非常有用。因此，应用程序以及所使用的框架可以重构为可重用的解决方案，供咨询公司为其客户使用。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Data retrieval for client projects: Matching data onto an ontology map to produce a relevance assessment

Discerning relevant data is becoming more difficult, time consuming, and costly as the amount of data available dramatically increases. Currently, consulting firms strive to use data to support client's business decisions with evidence. To be effective at this, consultants must consider the applicability of both internal and external data libraries to their clients' requirements. Frequently, evaluating the applicability of data sets is a manual process, which can be costly to the firm and the client. This paper describes a technical approach to automate this process. Specifically, it details the structure of a software application, named UVa Open Miner, capable of assessing the applicability of data sources to client projects. This UVa Open Miner aims to maximize the scale and diversity of candidate data sets, increase the relevance of data found, and maintain manageable computational complexity. UVa Open Miner consists of two segments: mapping and matching. The mapping component text mines web pages to identify an ontology of keywords describing the business requirement. This enables users to handle diverse business requirements from various industry verticals. The matching component scores data sets based on a relevance factor obtained from the ontology map. To validate the application, subject matter experts provided business requirements for a problem in their domain, and validated the application's results. Professionals in environment science, political science and policy-making fields found the application to be useful. Therefore, the application, along with the framework used, can be refactored into a reusable solution for consulting firms to use for their clients.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

2016 IEEE Systems and Information Engineering Design Symposium (SIEDS)

自引率

0.00%

发文量