Wei Xu , Nan Zhang , Hongxun Jiang , Shaokun Fan , Bin Zhu
{"title":"在灰烬中挖掘黄金:在大量无利可图的专利中识别睡美人","authors":"Wei Xu , Nan Zhang , Hongxun Jiang , Shaokun Fan , Bin Zhu","doi":"10.1016/j.joi.2025.101674","DOIUrl":null,"url":null,"abstract":"<div><div>This paper proposes an innovative deep-learning framework with multi-modal features to determine whether a currently unprofitable patent is a sleeping beauty at an early stage. Patent features include the textual content as well as the networked background information, such as the inventors and assignees, as well as the previous works they have created. The framework uses a Transformer to compare the patent with news or analytical reports concerning technological development trends, mining its content both semantically and syntactically. An active graphical convolutional network, mining the innovation collaboration network of a patent, is also employed as part of the framework to reveal the relationship between patents, companies, and inventors. This framework finally utilizes the obtained features to construct a multi-head self-attention model to predict a patent with the probability of being a sleeping beauty. This paper examines the proposed model by comparing it to several well-known baseline methods using real-world cases from the United States Patent and Trademark Office (USPTO). The proposed deep learning solution outperforms all baseline methods according to all performance metrics. Its long-term forecasting accuracy significantly exceeds its rivals. In the ablation experiments, features extracted from texts and networks are shown to improve the performance of prediction models.</div></div>","PeriodicalId":48662,"journal":{"name":"Journal of Informetrics","volume":"19 3","pages":"Article 101674"},"PeriodicalIF":3.4000,"publicationDate":"2025-05-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Uncovering gold in ash: identifying sleeping beauties among massive unprofitable patents\",\"authors\":\"Wei Xu , Nan Zhang , Hongxun Jiang , Shaokun Fan , Bin Zhu\",\"doi\":\"10.1016/j.joi.2025.101674\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>This paper proposes an innovative deep-learning framework with multi-modal features to determine whether a currently unprofitable patent is a sleeping beauty at an early stage. Patent features include the textual content as well as the networked background information, such as the inventors and assignees, as well as the previous works they have created. The framework uses a Transformer to compare the patent with news or analytical reports concerning technological development trends, mining its content both semantically and syntactically. An active graphical convolutional network, mining the innovation collaboration network of a patent, is also employed as part of the framework to reveal the relationship between patents, companies, and inventors. This framework finally utilizes the obtained features to construct a multi-head self-attention model to predict a patent with the probability of being a sleeping beauty. This paper examines the proposed model by comparing it to several well-known baseline methods using real-world cases from the United States Patent and Trademark Office (USPTO). The proposed deep learning solution outperforms all baseline methods according to all performance metrics. Its long-term forecasting accuracy significantly exceeds its rivals. In the ablation experiments, features extracted from texts and networks are shown to improve the performance of prediction models.</div></div>\",\"PeriodicalId\":48662,\"journal\":{\"name\":\"Journal of Informetrics\",\"volume\":\"19 3\",\"pages\":\"Article 101674\"},\"PeriodicalIF\":3.4000,\"publicationDate\":\"2025-05-03\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Journal of Informetrics\",\"FirstCategoryId\":\"91\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S1751157725000380\",\"RegionNum\":2,\"RegionCategory\":\"管理学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q2\",\"JCRName\":\"COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Informetrics","FirstCategoryId":"91","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S1751157725000380","RegionNum":2,"RegionCategory":"管理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS","Score":null,"Total":0}
Uncovering gold in ash: identifying sleeping beauties among massive unprofitable patents
This paper proposes an innovative deep-learning framework with multi-modal features to determine whether a currently unprofitable patent is a sleeping beauty at an early stage. Patent features include the textual content as well as the networked background information, such as the inventors and assignees, as well as the previous works they have created. The framework uses a Transformer to compare the patent with news or analytical reports concerning technological development trends, mining its content both semantically and syntactically. An active graphical convolutional network, mining the innovation collaboration network of a patent, is also employed as part of the framework to reveal the relationship between patents, companies, and inventors. This framework finally utilizes the obtained features to construct a multi-head self-attention model to predict a patent with the probability of being a sleeping beauty. This paper examines the proposed model by comparing it to several well-known baseline methods using real-world cases from the United States Patent and Trademark Office (USPTO). The proposed deep learning solution outperforms all baseline methods according to all performance metrics. Its long-term forecasting accuracy significantly exceeds its rivals. In the ablation experiments, features extracted from texts and networks are shown to improve the performance of prediction models.
期刊介绍:
Journal of Informetrics (JOI) publishes rigorous high-quality research on quantitative aspects of information science. The main focus of the journal is on topics in bibliometrics, scientometrics, webometrics, patentometrics, altmetrics and research evaluation. Contributions studying informetric problems using methods from other quantitative fields, such as mathematics, statistics, computer science, economics and econometrics, and network science, are especially encouraged. JOI publishes both theoretical and empirical work. In general, case studies, for instance a bibliometric analysis focusing on a specific research field or a specific country, are not considered suitable for publication in JOI, unless they contain innovative methodological elements.