Yinlei Lei , Yu Deng , Ruocheng Xia , Baoyan Xie , Zhenchen Yang , Shuangyun Xi , Pengyu Chen , Ruiyang Tao
{"title":"Full-length 16S rRNA-based exploration of body site-specific bacterial signatures for origin determination and individual identification","authors":"Yinlei Lei , Yu Deng , Ruocheng Xia , Baoyan Xie , Zhenchen Yang , Shuangyun Xi , Pengyu Chen , Ruiyang Tao","doi":"10.1016/j.forsciint.2025.112475","DOIUrl":null,"url":null,"abstract":"<div><div>When the available human-derived information at a crime scene is limited, it poses challenges in determining the origin of the biological materials and identifying their donors. In this context, microorganisms have gradually emerged as a valuable complementary tool. Nowadays, the application of third-generation sequencing technology for full-length 16S rRNA sequencing to explore the specific bacterial biomarkers in various biological materials holds significant research and practical value. In this study, we performed full-length 16S rRNA gene sequencing on sterile swabs from palmar skin, oral mucosa, and nasal cavity using the PacBio single-molecule real-time sequencing (SMRT) platform. Alongside identifying specific bacterial biomarkers for these biological materials from different body sites, the study also preliminarily explored the specific bacterial taxa in 19 individuals at the phylum, genus, and species levels. The results showed that the palmar skin bacteria primarily consist of <em>Cutibacterium</em>, <em>Staphylococcus</em>, and <em>Streptococcus</em>, the oral mucosal bacteria are dominated by <em>Streptococcus</em>, <em>Neisseria</em>, and <em>Haemophilus</em>, while the dominant bacteria in nasal cavity are <em>Staphylococcus</em> and <em>Cutibacterium</em>. Beta diversity analysis revealed significant differences in the bacterial community composition across the three origins of biological materials. Furthermore, classification models based on the bacterial species were constructed using the Random Forest, XGBoost, and KNN algorithms. The results showed that both Random Forest and XGBoost models achieved an accuracy of 97 %, significantly outperforming the KNN model (79 %). The prediction accuracy at the OTU level was comparable to that at the species level. In addition, bacterial community differences between individuals were observed at both the genus and species levels. Overall, this study further explores the potential of classification prediction methods based on bacterial features for distinguishing the body site origins of different biological materials and enabling individual traceability, thereby providing valuable data to support the application of microbiological techniques in forensic practice.</div></div>","PeriodicalId":12341,"journal":{"name":"Forensic science international","volume":"371 ","pages":"Article 112475"},"PeriodicalIF":2.2000,"publicationDate":"2025-04-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Forensic science international","FirstCategoryId":"3","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0379073825001136","RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"MEDICINE, LEGAL","Score":null,"Total":0}
引用次数: 0
Abstract
When the available human-derived information at a crime scene is limited, it poses challenges in determining the origin of the biological materials and identifying their donors. In this context, microorganisms have gradually emerged as a valuable complementary tool. Nowadays, the application of third-generation sequencing technology for full-length 16S rRNA sequencing to explore the specific bacterial biomarkers in various biological materials holds significant research and practical value. In this study, we performed full-length 16S rRNA gene sequencing on sterile swabs from palmar skin, oral mucosa, and nasal cavity using the PacBio single-molecule real-time sequencing (SMRT) platform. Alongside identifying specific bacterial biomarkers for these biological materials from different body sites, the study also preliminarily explored the specific bacterial taxa in 19 individuals at the phylum, genus, and species levels. The results showed that the palmar skin bacteria primarily consist of Cutibacterium, Staphylococcus, and Streptococcus, the oral mucosal bacteria are dominated by Streptococcus, Neisseria, and Haemophilus, while the dominant bacteria in nasal cavity are Staphylococcus and Cutibacterium. Beta diversity analysis revealed significant differences in the bacterial community composition across the three origins of biological materials. Furthermore, classification models based on the bacterial species were constructed using the Random Forest, XGBoost, and KNN algorithms. The results showed that both Random Forest and XGBoost models achieved an accuracy of 97 %, significantly outperforming the KNN model (79 %). The prediction accuracy at the OTU level was comparable to that at the species level. In addition, bacterial community differences between individuals were observed at both the genus and species levels. Overall, this study further explores the potential of classification prediction methods based on bacterial features for distinguishing the body site origins of different biological materials and enabling individual traceability, thereby providing valuable data to support the application of microbiological techniques in forensic practice.
期刊介绍:
Forensic Science International is the flagship journal in the prestigious Forensic Science International family, publishing the most innovative, cutting-edge, and influential contributions across the forensic sciences. Fields include: forensic pathology and histochemistry, chemistry, biochemistry and toxicology, biology, serology, odontology, psychiatry, anthropology, digital forensics, the physical sciences, firearms, and document examination, as well as investigations of value to public health in its broadest sense, and the important marginal area where science and medicine interact with the law.
The journal publishes:
Case Reports
Commentaries
Letters to the Editor
Original Research Papers (Regular Papers)
Rapid Communications
Review Articles
Technical Notes.