José Carlos Ferrão, Mónica Duarte Oliveira, Filipe Janela, Henrique M G Martins, Daniel Gartner
{"title":"结构化的电子病历数据能支持临床编码吗?一种数据挖掘方法。","authors":"José Carlos Ferrão, Mónica Duarte Oliveira, Filipe Janela, Henrique M G Martins, Daniel Gartner","doi":"10.1080/20476965.2020.1729666","DOIUrl":null,"url":null,"abstract":"<p><p>Structured data formats are gaining momentum in electronic health records and can be leveraged for decision support and research. Nevertheless, such structured data formats have not been explored for clinical coding, which is an essential process requiring significant manual workload in health organisations. This article explores the extent to which fully structured clinical data can support assignment of clinical codes to inpatient episodes, through a methodology that tackles high dimensionality issues, addresses the multi-label nature of coding and optimises model parameters. The methodology encompasses transformation of raw data to define a feature set, build a data matrix representation, and testing combinations of feature selection methods with machine learning models to predict code assignment. The methodology was tested with a real hospital dataset and showed varying predictive power across codes, while demonstrating the potential of leveraging structuring data to reduce workload and increase efficiency in clinical coding.</p>","PeriodicalId":44699,"journal":{"name":"Health Systems","volume":"10 2","pages":"138-161"},"PeriodicalIF":1.2000,"publicationDate":"2020-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1080/20476965.2020.1729666","citationCount":"7","resultStr":"{\"title\":\"Can structured EHR data support clinical coding? A data mining approach.\",\"authors\":\"José Carlos Ferrão, Mónica Duarte Oliveira, Filipe Janela, Henrique M G Martins, Daniel Gartner\",\"doi\":\"10.1080/20476965.2020.1729666\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p><p>Structured data formats are gaining momentum in electronic health records and can be leveraged for decision support and research. Nevertheless, such structured data formats have not been explored for clinical coding, which is an essential process requiring significant manual workload in health organisations. This article explores the extent to which fully structured clinical data can support assignment of clinical codes to inpatient episodes, through a methodology that tackles high dimensionality issues, addresses the multi-label nature of coding and optimises model parameters. The methodology encompasses transformation of raw data to define a feature set, build a data matrix representation, and testing combinations of feature selection methods with machine learning models to predict code assignment. The methodology was tested with a real hospital dataset and showed varying predictive power across codes, while demonstrating the potential of leveraging structuring data to reduce workload and increase efficiency in clinical coding.</p>\",\"PeriodicalId\":44699,\"journal\":{\"name\":\"Health Systems\",\"volume\":\"10 2\",\"pages\":\"138-161\"},\"PeriodicalIF\":1.2000,\"publicationDate\":\"2020-03-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://sci-hub-pdf.com/10.1080/20476965.2020.1729666\",\"citationCount\":\"7\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Health Systems\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1080/20476965.2020.1729666\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q4\",\"JCRName\":\"HEALTH POLICY & SERVICES\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Health Systems","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1080/20476965.2020.1729666","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q4","JCRName":"HEALTH POLICY & SERVICES","Score":null,"Total":0}
Can structured EHR data support clinical coding? A data mining approach.
Structured data formats are gaining momentum in electronic health records and can be leveraged for decision support and research. Nevertheless, such structured data formats have not been explored for clinical coding, which is an essential process requiring significant manual workload in health organisations. This article explores the extent to which fully structured clinical data can support assignment of clinical codes to inpatient episodes, through a methodology that tackles high dimensionality issues, addresses the multi-label nature of coding and optimises model parameters. The methodology encompasses transformation of raw data to define a feature set, build a data matrix representation, and testing combinations of feature selection methods with machine learning models to predict code assignment. The methodology was tested with a real hospital dataset and showed varying predictive power across codes, while demonstrating the potential of leveraging structuring data to reduce workload and increase efficiency in clinical coding.