Juan Osvaldo Talavera, Ivonne Roy-García, Sofía Teresa Díaz-Torres, Lino Palacios-Cruz, Alejandro Noguez-Ramos, Marcela Pérez-Rodríguez, Miguel Ángel Martínez, Jessica E Silva-Guzmán, Rodolfo Rivas-Ruiz
{"title":"[Numerical expression of the clinical course of the disease. Data management].","authors":"Juan Osvaldo Talavera, Ivonne Roy-García, Sofía Teresa Díaz-Torres, Lino Palacios-Cruz, Alejandro Noguez-Ramos, Marcela Pérez-Rodríguez, Miguel Ángel Martínez, Jessica E Silva-Guzmán, Rodolfo Rivas-Ruiz","doi":"10.5281/zenodo.8319834","DOIUrl":null,"url":null,"abstract":"<p><p>Data management \"behind the scenes\" refers to collection, cleaning, imputation, and demarcation; and despite of being indispensable processes, they are usually neglected and thus, generate erroneous information. During the collection are errors: omission of covariates, deviation from the objective, and insufficient quality. The omission of covariates distorts the result attributed to the main manoeuvre. Deviation from the primary objective commonly occurs when the outcome is rare, delayed, or subjective and promotes substitution by non-equivalent surrogate variables. Moreover, insufficient quality occurs due to inadequate instruments, omission of the measurement procedure, or measurements out of context, such as attribution at the wrong time or equivalent. Furthermore, cleaning implies identifying erroneous, extreme, and missing values, which may or may not be imputed, depending on the percentage. The values of the manoeuvre or the outcome are never imputed, nor are patients eliminated due to a lack of values. Finally, the demarcation of each variable seeks to give it a clinical meaning about the outcome, for which a hierarchical sequence of criteria is followed: 1) previous clinical study, 2) expert agreement, 3) clinical judgment of the investigator/investigators, and 4) statistics. Acting without quality controls in data management frequently causes involuntary lies and confuses instead of clarifying.</p>","PeriodicalId":94200,"journal":{"name":"Revista medica del Instituto Mexicano del Seguro Social","volume":null,"pages":null},"PeriodicalIF":0.0000,"publicationDate":"2023-10-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10756149/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Revista medica del Instituto Mexicano del Seguro Social","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.5281/zenodo.8319834","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
Data management "behind the scenes" refers to collection, cleaning, imputation, and demarcation; and despite of being indispensable processes, they are usually neglected and thus, generate erroneous information. During the collection are errors: omission of covariates, deviation from the objective, and insufficient quality. The omission of covariates distorts the result attributed to the main manoeuvre. Deviation from the primary objective commonly occurs when the outcome is rare, delayed, or subjective and promotes substitution by non-equivalent surrogate variables. Moreover, insufficient quality occurs due to inadequate instruments, omission of the measurement procedure, or measurements out of context, such as attribution at the wrong time or equivalent. Furthermore, cleaning implies identifying erroneous, extreme, and missing values, which may or may not be imputed, depending on the percentage. The values of the manoeuvre or the outcome are never imputed, nor are patients eliminated due to a lack of values. Finally, the demarcation of each variable seeks to give it a clinical meaning about the outcome, for which a hierarchical sequence of criteria is followed: 1) previous clinical study, 2) expert agreement, 3) clinical judgment of the investigator/investigators, and 4) statistics. Acting without quality controls in data management frequently causes involuntary lies and confuses instead of clarifying.