应用解释模型识别体液混合物。

IF 3.1

Forensic science international. Genetics Pub Date : 2025-10-04 DOI:10.1016/j.fsigen.2025.103362

Courtney R H Lynch, Zhijian Wen, James M Curran

{"title":"应用解释模型识别体液混合物。","authors":"Courtney R H Lynch, Zhijian Wen, James M Curran","doi":"10.1016/j.fsigen.2025.103362","DOIUrl":null,"url":null,"abstract":"Confirmatory body fluid identification via the detection of specific messenger RNA (mRNA) species is important in circumstances where a conventional test is not available for the questioned fluid type (such as vaginal material) or when greater sensitivity, or specificity, is required. Samples forwarded for testing will often contain two or more body fluid types from different donors (mixtures). Endpoint reverse-transcription PCR (RT-PCR) or real-time quantitative reverse-transcription PCR (RT-qPCR) are two methods which may be used for confirmatory body fluid identification, with the former currently in use in forensic casework laboratories. The interpretation of such data has been a topic of recent interest, with two main approaches proposed: categorical and probabilistic. Categorical methods, which may use a scoring system or a threshold approach, do not utilise all relevant information. Probabilistic methods are able to incorporate known information and reflect uncertainty as part of classification. This paper explores the use of multi-class machine learning classifiers to predict the components of body fluids in mixed samples, extending prior work on single-source body fluids. When including mixture profiles within the training data as an additional class, we obtained high prediction accuracy. We also determine that quantitative information is more informative than binary (presence/absence) data for the prediction of body fluids in different mixture ratios. Finally, we investigate the modelling and prediction of an \"unknown\" category, composed of samples which do not have specific features in the data, using two approaches: metric learning and a leave-one-type-out simulation for missing data.","PeriodicalId":94012,"journal":{"name":"Forensic science international. Genetics","volume":"81 ","pages":"103362"},"PeriodicalIF":3.1000,"publicationDate":"2025-10-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Applying an interpretation model for body fluid mixture identification.\",\"authors\":\"Courtney R H Lynch, Zhijian Wen, James M Curran\",\"doi\":\"10.1016/j.fsigen.2025.103362\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Confirmatory body fluid identification via the detection of specific messenger RNA (mRNA) species is important in circumstances where a conventional test is not available for the questioned fluid type (such as vaginal material) or when greater sensitivity, or specificity, is required. Samples forwarded for testing will often contain two or more body fluid types from different donors (mixtures). Endpoint reverse-transcription PCR (RT-PCR) or real-time quantitative reverse-transcription PCR (RT-qPCR) are two methods which may be used for confirmatory body fluid identification, with the former currently in use in forensic casework laboratories. The interpretation of such data has been a topic of recent interest, with two main approaches proposed: categorical and probabilistic. Categorical methods, which may use a scoring system or a threshold approach, do not utilise all relevant information. Probabilistic methods are able to incorporate known information and reflect uncertainty as part of classification. This paper explores the use of multi-class machine learning classifiers to predict the components of body fluids in mixed samples, extending prior work on single-source body fluids. When including mixture profiles within the training data as an additional class, we obtained high prediction accuracy. We also determine that quantitative information is more informative than binary (presence/absence) data for the prediction of body fluids in different mixture ratios. Finally, we investigate the modelling and prediction of an \\\"unknown\\\" category, composed of samples which do not have specific features in the data, using two approaches: metric learning and a leave-one-type-out simulation for missing data.\",\"PeriodicalId\":94012,\"journal\":{\"name\":\"Forensic science international. Genetics\",\"volume\":\"81 \",\"pages\":\"103362\"},\"PeriodicalIF\":3.1000,\"publicationDate\":\"2025-10-04\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Forensic science international. Genetics\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1016/j.fsigen.2025.103362\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Forensic science international. Genetics","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1016/j.fsigen.2025.103362","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

摘要

通过检测特定的信使RNA （mRNA）物种进行确认性体液鉴定，在无法使用常规检测方法检测可疑液体类型（如阴道物质）或需要更高灵敏度或特异性的情况下非常重要。送交检测的样本通常含有来自不同供体的两种或两种以上的体液（混合物）。终点反转录PCR （RT-PCR）或实时定量反转录PCR （RT-qPCR）是两种可用于确认性体液鉴定的方法，前者目前在法医案件实验室中使用。对这些数据的解释最近一直是一个有趣的话题，提出了两种主要方法：分类和概率。分类方法可能使用评分系统或阈值方法，但不利用所有相关信息。概率方法能够纳入已知信息，并反映不确定性作为分类的一部分。本文探讨了使用多类机器学习分类器来预测混合样本中体液的成分，扩展了先前对单一来源体液的研究。当在训练数据中加入混合轮廓作为附加类时，我们获得了较高的预测精度。我们还确定，在预测不同混合比例的体液时，定量信息比二元（存在/不存在）数据更具信息性。最后，我们研究了一个“未知”类别的建模和预测，该类别由数据中没有特定特征的样本组成，使用两种方法：度量学习和遗漏数据的留一种模拟。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Applying an interpretation model for body fluid mixture identification.

Confirmatory body fluid identification via the detection of specific messenger RNA (mRNA) species is important in circumstances where a conventional test is not available for the questioned fluid type (such as vaginal material) or when greater sensitivity, or specificity, is required. Samples forwarded for testing will often contain two or more body fluid types from different donors (mixtures). Endpoint reverse-transcription PCR (RT-PCR) or real-time quantitative reverse-transcription PCR (RT-qPCR) are two methods which may be used for confirmatory body fluid identification, with the former currently in use in forensic casework laboratories. The interpretation of such data has been a topic of recent interest, with two main approaches proposed: categorical and probabilistic. Categorical methods, which may use a scoring system or a threshold approach, do not utilise all relevant information. Probabilistic methods are able to incorporate known information and reflect uncertainty as part of classification. This paper explores the use of multi-class machine learning classifiers to predict the components of body fluids in mixed samples, extending prior work on single-source body fluids. When including mixture profiles within the training data as an additional class, we obtained high prediction accuracy. We also determine that quantitative information is more informative than binary (presence/absence) data for the prediction of body fluids in different mixture ratios. Finally, we investigate the modelling and prediction of an "unknown" category, composed of samples which do not have specific features in the data, using two approaches: metric learning and a leave-one-type-out simulation for missing data.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

Forensic science international. Genetics

自引率

0.00%

发文量