Predicting the emergence of disruptive technologies by comparing with references via soft prompt-aware shared BERT

IF 4.3 3区 材料科学 Q1 ENGINEERING, ELECTRICAL & ELECTRONIC
Guoxiu He , Chenxi Lin , Jiayu Ren , Peichen Duan
{"title":"Predicting the emergence of disruptive technologies by comparing with references via soft prompt-aware shared BERT","authors":"Guoxiu He ,&nbsp;Chenxi Lin ,&nbsp;Jiayu Ren ,&nbsp;Peichen Duan","doi":"10.1016/j.joi.2024.101596","DOIUrl":null,"url":null,"abstract":"<div><div>The exponential increase in the annual volume of publications places a significant challenge in assessing the disruptive potential of technologies in new papers. Prior approaches to identifying disruptive technologies based on the accumulation of paper citations are characterized by their limited prospective and time-consuming nature. Moreover, the total citation count fails to capture the intricate network of citations associated with the focal papers. Consequently, we advocate for the utilization of the disruption index instead of depending on citation counts. Particularly, we devise a novel neural network, called Soft Prompt-aware Shared BERT (<strong>SPS-BERT</strong>), to predict the potential technological disruption index of immediately published papers. It incorporates separate soft prompts to enable BERT examining comparative details within a paper's abstract and its references. Additionally, a tailored attention mechanism is employed to intensify the semantic comparison. Based on the enhanced representation derived from BERT, we utilize a linear layer to estimate potential disruption index. Experimental results demonstrate that SPS-BERT outperforms existing state-of-the-art methods in predicting five-year disruption index across the DBLP and PubMed datasets. Additionally, we conduct an evaluation of our model to predict the ten-year disruption index and five-year citation increments, demonstrating its robustness and scalability. Notably, our model's predictions of disruptive technologies, based on papers published in 2022, align with the expert assessments released by MIT, highlighting its practical significance. The code is available at <span><span>https://github.com/ECNU-Text-Computing/SPS-BERT</span><svg><path></path></svg></span>.</div></div>","PeriodicalId":3,"journal":{"name":"ACS Applied Electronic Materials","volume":null,"pages":null},"PeriodicalIF":4.3000,"publicationDate":"2024-10-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"ACS Applied Electronic Materials","FirstCategoryId":"91","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S1751157724001081","RegionNum":3,"RegionCategory":"材料科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ENGINEERING, ELECTRICAL & ELECTRONIC","Score":null,"Total":0}
引用次数: 0

Abstract

The exponential increase in the annual volume of publications places a significant challenge in assessing the disruptive potential of technologies in new papers. Prior approaches to identifying disruptive technologies based on the accumulation of paper citations are characterized by their limited prospective and time-consuming nature. Moreover, the total citation count fails to capture the intricate network of citations associated with the focal papers. Consequently, we advocate for the utilization of the disruption index instead of depending on citation counts. Particularly, we devise a novel neural network, called Soft Prompt-aware Shared BERT (SPS-BERT), to predict the potential technological disruption index of immediately published papers. It incorporates separate soft prompts to enable BERT examining comparative details within a paper's abstract and its references. Additionally, a tailored attention mechanism is employed to intensify the semantic comparison. Based on the enhanced representation derived from BERT, we utilize a linear layer to estimate potential disruption index. Experimental results demonstrate that SPS-BERT outperforms existing state-of-the-art methods in predicting five-year disruption index across the DBLP and PubMed datasets. Additionally, we conduct an evaluation of our model to predict the ten-year disruption index and five-year citation increments, demonstrating its robustness and scalability. Notably, our model's predictions of disruptive technologies, based on papers published in 2022, align with the expert assessments released by MIT, highlighting its practical significance. The code is available at https://github.com/ECNU-Text-Computing/SPS-BERT.
通过软提示感知共享 BERT 与参考资料进行比较,预测颠覆性技术的出现
每年发表的论文数量呈指数级增长,这给评估新论文中技术的颠覆性潜力带来了巨大挑战。之前基于论文引用积累来识别颠覆性技术的方法具有前瞻性有限和耗时长的特点。此外,总引用次数无法捕捉到与焦点论文相关的错综复杂的引用网络。因此,我们主张使用干扰指数,而不是依赖引用次数。特别是,我们设计了一种名为 "软提示感知共享 BERT(Soft Prompt-aware Shared BERT,SPS-BERT)"的新型神经网络,用于预测即时发表论文的潜在技术中断指数。它结合了单独的软提示,使 BERT 能够检查论文摘要及其参考文献中的比较细节。此外,还采用了量身定制的关注机制来加强语义比较。根据 BERT 得出的增强表示法,我们利用线性层来估计潜在的干扰指数。实验结果表明,SPS-BERT 在预测 DBLP 和 PubMed 数据集的五年中断指数方面优于现有的最先进方法。此外,我们还对预测十年中断指数和五年引文增量的模型进行了评估,证明了该模型的鲁棒性和可扩展性。值得注意的是,我们的模型基于2022年发表的论文对颠覆性技术的预测与麻省理工学院发布的专家评估结果一致,突出了其实际意义。代码可在 https://github.com/ECNU-Text-Computing/SPS-BERT 上获取。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
CiteScore
7.20
自引率
4.30%
发文量
567
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信