{"title":"That's Classified! Inventing a New Patent Taxonomy","authors":"S. Billington, Alan Hanna","doi":"10.2139/ssrn.3606142","DOIUrl":null,"url":null,"abstract":"We investigate how patent classification influences the interpretation of patent statistics. Innovation researchers currently make use of various patent classification schemas, which are hard to replicate. Using machine learning techniques, we construct a transparent, replicable and adaptable patent taxonomy, and a new automated methodology for classifying patents. We then contrast our new schema with existing ones using a long-run historical patent dataset. We find quantitative analysis of patent characteristics are sensitive to the choice of classification; our interpretation of regression coefficients is schema-dependent. We suggest much of the innovation literature should be carefully interpreted in light of our findings.","PeriodicalId":125544,"journal":{"name":"ERN: Intellectual Property (Topic)","volume":"43 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2020-05-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"4","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"ERN: Intellectual Property (Topic)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.2139/ssrn.3606142","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 4
Abstract
We investigate how patent classification influences the interpretation of patent statistics. Innovation researchers currently make use of various patent classification schemas, which are hard to replicate. Using machine learning techniques, we construct a transparent, replicable and adaptable patent taxonomy, and a new automated methodology for classifying patents. We then contrast our new schema with existing ones using a long-run historical patent dataset. We find quantitative analysis of patent characteristics are sensitive to the choice of classification; our interpretation of regression coefficients is schema-dependent. We suggest much of the innovation literature should be carefully interpreted in light of our findings.