{"title":"基于归一化Google距离和语义相似度度量方法的供应商能力评分文本分析框架","authors":"Kimia Zandbiglari, F. Ameri, Mohammad Javadi","doi":"10.1115/1.4062173","DOIUrl":null,"url":null,"abstract":"\n The unstructured data available on the websites of manufacturing suppliers and contractors can provide valuable insights into their technological and organizational capabilities. However, since the capability data are often represented in an unstructured and informal fashion using natural language text, it is not easy to efficiently search and analyze the capability data and learn from it. The objective of this work is to propose framework to enable automated classification and ranking of suppliers based on their online capability descriptions in the context of a supplier search and discovery use case. The proposed text analytics methods used in this work are supported by a formal thesaurus that uses SKOS (Simple Knowledge Organization System) that provides lexical and structural semantics. Normalized Google Distance (NGD) is used as the metric for measuring the relatedness of terms when ranking suppliers based on their similarities with the queried capabilities. The proposed framework is validated experimentally using a hypothetical supplier search scenario. The results indicate that the generated ranked list is highly correlated with human judgment, especially when the search space is partitioned into multiple classes of suppliers with distinct capabilities. However, the correlation decreases when multiple overlapping classes of suppliers are merged together to form a heterogenous search space. The proposed framework can support supplier screening and discovery solutions by improving the precision, reliability, and intelligence of their underlying search engines.","PeriodicalId":54856,"journal":{"name":"Journal of Computing and Information Science in Engineering","volume":"4 1","pages":""},"PeriodicalIF":2.6000,"publicationDate":"2023-03-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"A Text Analytics Framework for Supplier Capability Scoring Supported by Normalized Google Distance and Semantic Similarity Measurement Methods\",\"authors\":\"Kimia Zandbiglari, F. Ameri, Mohammad Javadi\",\"doi\":\"10.1115/1.4062173\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"\\n The unstructured data available on the websites of manufacturing suppliers and contractors can provide valuable insights into their technological and organizational capabilities. However, since the capability data are often represented in an unstructured and informal fashion using natural language text, it is not easy to efficiently search and analyze the capability data and learn from it. The objective of this work is to propose framework to enable automated classification and ranking of suppliers based on their online capability descriptions in the context of a supplier search and discovery use case. The proposed text analytics methods used in this work are supported by a formal thesaurus that uses SKOS (Simple Knowledge Organization System) that provides lexical and structural semantics. Normalized Google Distance (NGD) is used as the metric for measuring the relatedness of terms when ranking suppliers based on their similarities with the queried capabilities. The proposed framework is validated experimentally using a hypothetical supplier search scenario. The results indicate that the generated ranked list is highly correlated with human judgment, especially when the search space is partitioned into multiple classes of suppliers with distinct capabilities. However, the correlation decreases when multiple overlapping classes of suppliers are merged together to form a heterogenous search space. The proposed framework can support supplier screening and discovery solutions by improving the precision, reliability, and intelligence of their underlying search engines.\",\"PeriodicalId\":54856,\"journal\":{\"name\":\"Journal of Computing and Information Science in Engineering\",\"volume\":\"4 1\",\"pages\":\"\"},\"PeriodicalIF\":2.6000,\"publicationDate\":\"2023-03-21\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Journal of Computing and Information Science in Engineering\",\"FirstCategoryId\":\"5\",\"ListUrlMain\":\"https://doi.org/10.1115/1.4062173\",\"RegionNum\":3,\"RegionCategory\":\"工程技术\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q2\",\"JCRName\":\"COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Computing and Information Science in Engineering","FirstCategoryId":"5","ListUrlMain":"https://doi.org/10.1115/1.4062173","RegionNum":3,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS","Score":null,"Total":0}
A Text Analytics Framework for Supplier Capability Scoring Supported by Normalized Google Distance and Semantic Similarity Measurement Methods
The unstructured data available on the websites of manufacturing suppliers and contractors can provide valuable insights into their technological and organizational capabilities. However, since the capability data are often represented in an unstructured and informal fashion using natural language text, it is not easy to efficiently search and analyze the capability data and learn from it. The objective of this work is to propose framework to enable automated classification and ranking of suppliers based on their online capability descriptions in the context of a supplier search and discovery use case. The proposed text analytics methods used in this work are supported by a formal thesaurus that uses SKOS (Simple Knowledge Organization System) that provides lexical and structural semantics. Normalized Google Distance (NGD) is used as the metric for measuring the relatedness of terms when ranking suppliers based on their similarities with the queried capabilities. The proposed framework is validated experimentally using a hypothetical supplier search scenario. The results indicate that the generated ranked list is highly correlated with human judgment, especially when the search space is partitioned into multiple classes of suppliers with distinct capabilities. However, the correlation decreases when multiple overlapping classes of suppliers are merged together to form a heterogenous search space. The proposed framework can support supplier screening and discovery solutions by improving the precision, reliability, and intelligence of their underlying search engines.
期刊介绍:
The ASME Journal of Computing and Information Science in Engineering (JCISE) publishes articles related to Algorithms, Computational Methods, Computing Infrastructure, Computer-Interpretable Representations, Human-Computer Interfaces, Information Science, and/or System Architectures that aim to improve some aspect of product and system lifecycle (e.g., design, manufacturing, operation, maintenance, disposal, recycling etc.). Applications considered in JCISE manuscripts should be relevant to the mechanical engineering discipline. Papers can be focused on fundamental research leading to new methods, or adaptation of existing methods for new applications.
Scope: Advanced Computing Infrastructure; Artificial Intelligence; Big Data and Analytics; Collaborative Design; Computer Aided Design; Computer Aided Engineering; Computer Aided Manufacturing; Computational Foundations for Additive Manufacturing; Computational Foundations for Engineering Optimization; Computational Geometry; Computational Metrology; Computational Synthesis; Conceptual Design; Cybermanufacturing; Cyber Physical Security for Factories; Cyber Physical System Design and Operation; Data-Driven Engineering Applications; Engineering Informatics; Geometric Reasoning; GPU Computing for Design and Manufacturing; Human Computer Interfaces/Interactions; Industrial Internet of Things; Knowledge Engineering; Information Management; Inverse Methods for Engineering Applications; Machine Learning for Engineering Applications; Manufacturing Planning; Manufacturing Automation; Model-based Systems Engineering; Multiphysics Modeling and Simulation; Multiscale Modeling and Simulation; Multidisciplinary Optimization; Physics-Based Simulations; Process Modeling for Engineering Applications; Qualification, Verification and Validation of Computational Models; Symbolic Computing for Engineering Applications; Tolerance Modeling; Topology and Shape Optimization; Virtual and Augmented Reality Environments; Virtual Prototyping