Using self-generated identification codes to match anonymous longitudinal data in a sexual health study of secondary school students: a cohort study.

IF 3.3 3区 医学 Q2 MEDICAL INFORMATICS
Edmond Pui Hang Choi, Ellie Bostwick Andres, Heidi Sze Lok Fan, Lai Ming Ho, Alice Wai Chi Fung, Kevin Wing Chung Lau, Neda Hei Tung Ng, Monique Yeung, Janice Mary Johnston
{"title":"Using self-generated identification codes to match anonymous longitudinal data in a sexual health study of secondary school students: a cohort study.","authors":"Edmond Pui Hang Choi, Ellie Bostwick Andres, Heidi Sze Lok Fan, Lai Ming Ho, Alice Wai Chi Fung, Kevin Wing Chung Lau, Neda Hei Tung Ng, Monique Yeung, Janice Mary Johnston","doi":"10.1186/s12911-025-03028-1","DOIUrl":null,"url":null,"abstract":"<p><strong>Objective: </strong>This study aimed to (i) describe the procedures for generating self-generated identification codes (SGICs) in a prospective longitudinal evaluation of a sexual health program for secondary school students in Hong Kong; (ii) outline the matching strategies and processes; (iii) examine rates of successful matching and associated factors; and (iv) compare the responses of participants whose data could be matched to those whose data could not.</p><p><strong>Methods: </strong>A prospective longitudinal cohort study was conducted. The SGIC comprised a 5-element code with 4 digits and 3 letters. A matching algorithm was developed to link baseline and follow-up data collected from students in Years 1 to 3 (n = 1,064) during the 2019-2020 school year. Matching success and associated factors were analyzed, and responses from matched and unmatched participants were compared.</p><p><strong>Results: </strong>The rate of perfectly matched cases was 49.06%, while 23.59% were partially matched, and 27.35% were unmatched. Logistic regression analysis revealed that male students (adjusted odds ratio [aOR]: 0.63) and Year 1 students (vs. Year 3; aOR: 0.56) were less likely to be perfectly matched. Compared to unmatched cases, perfectly and partially matched cases were less likely to have missing values and more likely to exhibit positive attitudes toward the sexual health program and related topics, such as the importance of sexual health, equal relationships, and condom use.</p><p><strong>Conclusion: </strong>The use of SGICs successfully matched approximately 72.65% of the study sample over a one-year period. These findings highlight the potential of SGICs as a tool for longitudinal data matching while underscoring the need for further refinement of code generation processes and matching algorithms to minimize data wastage and improve effectiveness.</p>","PeriodicalId":9340,"journal":{"name":"BMC Medical Informatics and Decision Making","volume":"25 1","pages":"201"},"PeriodicalIF":3.3000,"publicationDate":"2025-06-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12131654/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"BMC Medical Informatics and Decision Making","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1186/s12911-025-03028-1","RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"MEDICAL INFORMATICS","Score":null,"Total":0}
引用次数: 0

Abstract

Objective: This study aimed to (i) describe the procedures for generating self-generated identification codes (SGICs) in a prospective longitudinal evaluation of a sexual health program for secondary school students in Hong Kong; (ii) outline the matching strategies and processes; (iii) examine rates of successful matching and associated factors; and (iv) compare the responses of participants whose data could be matched to those whose data could not.

Methods: A prospective longitudinal cohort study was conducted. The SGIC comprised a 5-element code with 4 digits and 3 letters. A matching algorithm was developed to link baseline and follow-up data collected from students in Years 1 to 3 (n = 1,064) during the 2019-2020 school year. Matching success and associated factors were analyzed, and responses from matched and unmatched participants were compared.

Results: The rate of perfectly matched cases was 49.06%, while 23.59% were partially matched, and 27.35% were unmatched. Logistic regression analysis revealed that male students (adjusted odds ratio [aOR]: 0.63) and Year 1 students (vs. Year 3; aOR: 0.56) were less likely to be perfectly matched. Compared to unmatched cases, perfectly and partially matched cases were less likely to have missing values and more likely to exhibit positive attitudes toward the sexual health program and related topics, such as the importance of sexual health, equal relationships, and condom use.

Conclusion: The use of SGICs successfully matched approximately 72.65% of the study sample over a one-year period. These findings highlight the potential of SGICs as a tool for longitudinal data matching while underscoring the need for further refinement of code generation processes and matching algorithms to minimize data wastage and improve effectiveness.

在中学生性健康研究中使用自生成识别码匹配匿名纵向数据:一项队列研究。
目的:本研究旨在(i)描述在香港中学生性健康计划的前瞻性纵向评估中产生自我生成识别码(SGICs)的程序;(ii)概述配对策略和程序;(iii)检查配对成功率及相关因素;(iv)比较数据可以匹配的参与者和数据不能匹配的参与者的反应。方法:采用前瞻性纵向队列研究。SGIC由5个元素组成,包含4位数字和3个字母。开发了一种匹配算法,将2019-2020学年从一年级至三年级学生(n = 1,064)收集的基线和随访数据联系起来。分析配对成功和相关因素,比较配对和未配对参与者的反应。结果:完全匹配率为49.06%,部分匹配率为23.59%,不匹配率为27.35%。Logistic回归分析显示男生(调整优势比[aOR]: 0.63)和一年级学生(相对于三年级;(or: 0.56)不太可能完全匹配。与未匹配的案例相比,完全匹配和部分匹配的案例不太可能有缺失的价值观,更有可能对性健康项目和相关话题表现出积极的态度,比如性健康的重要性、平等的关系和避孕套的使用。结论:在一年的时间里,sgic的使用成功匹配了大约72.65%的研究样本。这些发现突出了sgic作为纵向数据匹配工具的潜力,同时强调需要进一步改进代码生成过程和匹配算法,以最大限度地减少数据浪费并提高效率。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
CiteScore
7.20
自引率
5.70%
发文量
297
审稿时长
1 months
期刊介绍: BMC Medical Informatics and Decision Making is an open access journal publishing original peer-reviewed research articles in relation to the design, development, implementation, use, and evaluation of health information technologies and decision-making for human health.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信