专利权利要求分解以改进信息提取

Current Challenges in Patent Information Retrieval Pub Date : 2009-11-06 DOI:10.1145/1651343.1651351

Peter Parapatics, M. Dittenbach

{"title":"专利权利要求分解以改进信息提取","authors":"Peter Parapatics, M. Dittenbach","doi":"10.1145/1651343.1651351","DOIUrl":null,"url":null,"abstract":"In several application domains research in natural language processing and information extraction has spawned valuable tools that support humans in structuring, aggregating and managing large amounts of information available as text. Patent claims, although subject to a number of rigid constraints and therefore forced into foreseeable structures, are written in a language even good parsing algorithms tend to fail miserably at. This is primarily caused by long and complex sentences that are a concatenation of a multitude of descriptive elements. We present an approach to split patent claims into several parts in order to improve parsing performance for further automatic processing.","PeriodicalId":231312,"journal":{"name":"Current Challenges in Patent Information Retrieval","volume":"21 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2009-11-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"28","resultStr":"{\"title\":\"Patent claim decomposition for improved information extraction\",\"authors\":\"Peter Parapatics, M. Dittenbach\",\"doi\":\"10.1145/1651343.1651351\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"In several application domains research in natural language processing and information extraction has spawned valuable tools that support humans in structuring, aggregating and managing large amounts of information available as text. Patent claims, although subject to a number of rigid constraints and therefore forced into foreseeable structures, are written in a language even good parsing algorithms tend to fail miserably at. This is primarily caused by long and complex sentences that are a concatenation of a multitude of descriptive elements. We present an approach to split patent claims into several parts in order to improve parsing performance for further automatic processing.\",\"PeriodicalId\":231312,\"journal\":{\"name\":\"Current Challenges in Patent Information Retrieval\",\"volume\":\"21 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2009-11-06\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"28\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Current Challenges in Patent Information Retrieval\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1145/1651343.1651351\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Current Challenges in Patent Information Retrieval","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/1651343.1651351","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 28

摘要

在一些应用领域，对自然语言处理和信息提取的研究已经催生了一些有价值的工具，这些工具支持人类构建、聚合和管理大量可用的文本信息。专利权利要求虽然受到许多严格的约束，因此被迫进入可预见的结构，但它们是用一种即使是好的解析算法也容易失败的语言编写的。这主要是由长而复杂的句子引起的，这些句子是由大量描述性元素串联而成的。我们提出了一种将专利权利要求分割成几个部分的方法，以提高进一步自动处理的解析性能。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Patent claim decomposition for improved information extraction

In several application domains research in natural language processing and information extraction has spawned valuable tools that support humans in structuring, aggregating and managing large amounts of information available as text. Patent claims, although subject to a number of rigid constraints and therefore forced into foreseeable structures, are written in a language even good parsing algorithms tend to fail miserably at. This is primarily caused by long and complex sentences that are a concatenation of a multitude of descriptive elements. We present an approach to split patent claims into several parts in order to improve parsing performance for further automatic processing.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

Current Challenges in Patent Information Retrieval

自引率

0.00%

发文量