Measuring the Direction of Innovation: Frontier Tools in Unassisted Machine Learning

Florenta Teodoridis, Jino Lu, Jeffrey L. Furman, Jeffrey L. Furman
{"title":"Measuring the Direction of Innovation: Frontier Tools in Unassisted Machine Learning","authors":"Florenta Teodoridis, Jino Lu, Jeffrey L. Furman, Jeffrey L. Furman","doi":"10.2139/ssrn.3596233","DOIUrl":null,"url":null,"abstract":"Understanding the factors affecting the direction of innovation is a central aim of research in the economics and strategic management of innovation. Progress on this topic has been inhibited by difficulties in measuring the location and movement of innovation in ideas space. To date, most efforts at measuring the direction of innovation rely on curated taxonomies, such as technology classes and keyword approaches, which either adapt slowly or are subject to gaming, and early generations of text analysis, which provide information on the similarity of sets of words, but not on the number of paths or direction of change. Relative to these, recent advances in machine learning offer promising paths forward. In this paper, we introduce and explore a particular approach based on an unassisted machine learning technique, Hierarchical Dirichlet Process (HDP), that flexibly generates categories from a corpus of text and enables calculations of the distance across knowledge categories and movement in ideas space. We apply our algorithm to the corpus of USPTO patent abstracts from the period 2000-2018 and demonstrate that, relative to the USPTO taxonomy of patent classes, our algorithm provides a leading indicator of shift in innovation topics and enables a more precise analysis of movement in ideas space. Working with such measures is important because it enables more accurate estimates of the direction of innovation and, hence, of economic actors’ responses to competitive environments and public policies. We share our algorithm, which can be applied to other innovation text corpora, as well as the patent data and measures we develop, with the aim of facilitating additional inquiries regarding the direction of innovation.","PeriodicalId":125544,"journal":{"name":"ERN: Intellectual Property (Topic)","volume":"7 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2020-05-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"4","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"ERN: Intellectual Property (Topic)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.2139/ssrn.3596233","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 4

Abstract

Understanding the factors affecting the direction of innovation is a central aim of research in the economics and strategic management of innovation. Progress on this topic has been inhibited by difficulties in measuring the location and movement of innovation in ideas space. To date, most efforts at measuring the direction of innovation rely on curated taxonomies, such as technology classes and keyword approaches, which either adapt slowly or are subject to gaming, and early generations of text analysis, which provide information on the similarity of sets of words, but not on the number of paths or direction of change. Relative to these, recent advances in machine learning offer promising paths forward. In this paper, we introduce and explore a particular approach based on an unassisted machine learning technique, Hierarchical Dirichlet Process (HDP), that flexibly generates categories from a corpus of text and enables calculations of the distance across knowledge categories and movement in ideas space. We apply our algorithm to the corpus of USPTO patent abstracts from the period 2000-2018 and demonstrate that, relative to the USPTO taxonomy of patent classes, our algorithm provides a leading indicator of shift in innovation topics and enables a more precise analysis of movement in ideas space. Working with such measures is important because it enables more accurate estimates of the direction of innovation and, hence, of economic actors’ responses to competitive environments and public policies. We share our algorithm, which can be applied to other innovation text corpora, as well as the patent data and measures we develop, with the aim of facilitating additional inquiries regarding the direction of innovation.
衡量创新方向:无辅助机器学习的前沿工具
了解影响创新方向的因素是创新经济学和战略管理研究的中心目标。由于难以衡量创新在思想空间中的位置和运动,这一主题的进展受到阻碍。到目前为止,大多数衡量创新方向的努力都依赖于有组织的分类法,如技术类别和关键字方法,它们要么适应缓慢,要么受游戏影响,还有早期的文本分析,它提供了单词集的相似性信息,但不提供路径的数量或变化的方向。相对于这些,机器学习的最新进展提供了有希望的前进道路。在本文中,我们介绍并探索了一种基于无辅助机器学习技术的特殊方法,即层次狄利克雷过程(HDP),该方法可以灵活地从文本语料库中生成类别,并能够计算知识类别之间的距离和思想空间中的运动。我们将我们的算法应用于2000-2018年期间的USPTO专利摘要语料库,并证明,相对于USPTO的专利类别分类,我们的算法提供了创新主题转移的领先指标,并能够更精确地分析思想空间的运动。使用这些措施很重要,因为它可以更准确地估计创新的方向,从而更准确地估计经济行为体对竞争环境和公共政策的反应。我们分享我们的算法,它可以应用于其他创新文本语料库,以及我们开发的专利数据和措施,目的是促进有关创新方向的额外查询。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信