迈向专利实验的新范式:WPI+

IF 1.9 Q2 INFORMATION SCIENCE & LIBRARY SCIENCE
Michail Salampasis , Eleni Kamateri , Vasileios Stamatis , Mihai Lupu , Allan Hanbury , Florina Piroi
{"title":"迈向专利实验的新范式:WPI+","authors":"Michail Salampasis ,&nbsp;Eleni Kamateri ,&nbsp;Vasileios Stamatis ,&nbsp;Mihai Lupu ,&nbsp;Allan Hanbury ,&nbsp;Florina Piroi","doi":"10.1016/j.wpi.2025.102389","DOIUrl":null,"url":null,"abstract":"<div><div>We enhance the WPI patent research collection, which is publicly accessible and free of charge, to facilitate more comparable, transparent, and reproducible experiments. This is accomplished through what we call “soft standardization” advocating the adoption of consistent methods in using the test collection. We offer data statistics, predefined collection subsets, ground-truth data for additional tasks, and open-source tools for using the collection, all on a public GitHub repository. These resources not only relieve researchers from performing essential collection analysis tasks but also implicitly guide them toward sound methods for conducting experiments with the collection. Our initiative is primarily motivated by the goal of enhancing comparability and reproducibility of patent research. This is achieved through the development of a carefully designed resource that will be continuously expanded and maintained. Our work is also driven by the observation that highly integrated Information Retrieval experiment platforms for large scale evaluation are not widely adopted by researchers. We provide examples of how the WPI+ resource/collection can be used for research on multiple patent specific tasks, including prior-art search, patent classification, and summarization. Overall, our work shows that the traditional concept of a test collection—limited to just a corpus, topics, and relevance assessments—can be broadened to support more efficient and reliable scientific experimentation.</div></div>","PeriodicalId":51794,"journal":{"name":"World Patent Information","volume":"83 ","pages":"Article 102389"},"PeriodicalIF":1.9000,"publicationDate":"2025-09-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Towards a new paradigm for patent experimentation: WPI+\",\"authors\":\"Michail Salampasis ,&nbsp;Eleni Kamateri ,&nbsp;Vasileios Stamatis ,&nbsp;Mihai Lupu ,&nbsp;Allan Hanbury ,&nbsp;Florina Piroi\",\"doi\":\"10.1016/j.wpi.2025.102389\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>We enhance the WPI patent research collection, which is publicly accessible and free of charge, to facilitate more comparable, transparent, and reproducible experiments. This is accomplished through what we call “soft standardization” advocating the adoption of consistent methods in using the test collection. We offer data statistics, predefined collection subsets, ground-truth data for additional tasks, and open-source tools for using the collection, all on a public GitHub repository. These resources not only relieve researchers from performing essential collection analysis tasks but also implicitly guide them toward sound methods for conducting experiments with the collection. Our initiative is primarily motivated by the goal of enhancing comparability and reproducibility of patent research. This is achieved through the development of a carefully designed resource that will be continuously expanded and maintained. Our work is also driven by the observation that highly integrated Information Retrieval experiment platforms for large scale evaluation are not widely adopted by researchers. We provide examples of how the WPI+ resource/collection can be used for research on multiple patent specific tasks, including prior-art search, patent classification, and summarization. Overall, our work shows that the traditional concept of a test collection—limited to just a corpus, topics, and relevance assessments—can be broadened to support more efficient and reliable scientific experimentation.</div></div>\",\"PeriodicalId\":51794,\"journal\":{\"name\":\"World Patent Information\",\"volume\":\"83 \",\"pages\":\"Article 102389\"},\"PeriodicalIF\":1.9000,\"publicationDate\":\"2025-09-29\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"World Patent Information\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S0172219025000560\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q2\",\"JCRName\":\"INFORMATION SCIENCE & LIBRARY SCIENCE\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"World Patent Information","FirstCategoryId":"1085","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0172219025000560","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"INFORMATION SCIENCE & LIBRARY SCIENCE","Score":null,"Total":0}
引用次数: 0

摘要

我们加强了WPI专利研究收集,该收集是公开和免费的,以促进更具可比性、透明度和可重复性的实验。这是通过我们所谓的“软标准化”来实现的,提倡在使用测试集合时采用一致的方法。我们提供数据统计,预定义的集合子集,额外任务的真实数据,以及用于使用集合的开源工具,所有这些都在公共GitHub存储库中。这些资源不仅使研究人员从执行基本的收集分析任务中解脱出来,而且还隐含地指导他们采用合理的方法进行收集实验。我们的倡议主要是为了提高专利研究的可比性和可重复性。这是通过开发一个精心设计的资源来实现的,这个资源将不断扩大和维护。我们的工作也是由于观察到用于大规模评估的高度集成的信息检索实验平台并未被研究人员广泛采用。我们提供了如何将WPI+资源/集合用于多个专利特定任务的研究的示例,包括现有技术搜索、专利分类和摘要。总的来说,我们的工作表明,传统的测试集合概念——仅限于语料库、主题和相关评估——可以被扩展,以支持更有效和可靠的科学实验。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Towards a new paradigm for patent experimentation: WPI+
We enhance the WPI patent research collection, which is publicly accessible and free of charge, to facilitate more comparable, transparent, and reproducible experiments. This is accomplished through what we call “soft standardization” advocating the adoption of consistent methods in using the test collection. We offer data statistics, predefined collection subsets, ground-truth data for additional tasks, and open-source tools for using the collection, all on a public GitHub repository. These resources not only relieve researchers from performing essential collection analysis tasks but also implicitly guide them toward sound methods for conducting experiments with the collection. Our initiative is primarily motivated by the goal of enhancing comparability and reproducibility of patent research. This is achieved through the development of a carefully designed resource that will be continuously expanded and maintained. Our work is also driven by the observation that highly integrated Information Retrieval experiment platforms for large scale evaluation are not widely adopted by researchers. We provide examples of how the WPI+ resource/collection can be used for research on multiple patent specific tasks, including prior-art search, patent classification, and summarization. Overall, our work shows that the traditional concept of a test collection—limited to just a corpus, topics, and relevance assessments—can be broadened to support more efficient and reliable scientific experimentation.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
World Patent Information
World Patent Information INFORMATION SCIENCE & LIBRARY SCIENCE-
CiteScore
3.50
自引率
18.50%
发文量
40
期刊介绍: The aim of World Patent Information is to provide a worldwide forum for the exchange of information between people working professionally in the field of Industrial Property information and documentation and to promote the widest possible use of the associated literature. Regular features include: papers concerned with all aspects of Industrial Property information and documentation; new regulations pertinent to Industrial Property information and documentation; short reports on relevant meetings and conferences; bibliographies, together with book and literature reviews.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信