From castaways to discoveries: unveiling treasures in skin RNAseq using a novel multidimensional data processing workflow including infection-host dynamics.
Disha Bhargava, Alec Labadie, Rebecca L Hanson-Rios-Stutz, Austin Goodyke, Ella M Moses, Akansha S Das, Sophie Vanderweele, Janelle V Lemon, Taylor W Cook, David Pearson, Joseph M Redinger, Adam J Caulfield, Rosemary Olivero, Kate Foster, Kurt Ashack, Surender Rajasekaran, Caleb P Bupp, Timothy J Triche, Connie M Krawczyk, Dave Chesla, Matthew D Sims, Nicholas L Hartog, Jeremy W Prokop
{"title":"From castaways to discoveries: unveiling treasures in skin RNAseq using a novel multidimensional data processing workflow including infection-host dynamics.","authors":"Disha Bhargava, Alec Labadie, Rebecca L Hanson-Rios-Stutz, Austin Goodyke, Ella M Moses, Akansha S Das, Sophie Vanderweele, Janelle V Lemon, Taylor W Cook, David Pearson, Joseph M Redinger, Adam J Caulfield, Rosemary Olivero, Kate Foster, Kurt Ashack, Surender Rajasekaran, Caleb P Bupp, Timothy J Triche, Connie M Krawczyk, Dave Chesla, Matthew D Sims, Nicholas L Hartog, Jeremy W Prokop","doi":"10.1152/physiolgenomics.00093.2024","DOIUrl":null,"url":null,"abstract":"<p><p>Defining physiology and methods to measure biological mechanisms is essential. Extensive datasets such as RNA sequencing are used with little analysis of the knowledge gained from the various methodologies. Within this work, we have processed publicly available NCBI RNAseq datasets using a combination of bioinformatics tools for the largest physiological organ, the skin. In many datasets, we identify the quality of the sample, human transcript mapping, the sex of each sample, foreign RNA from bacteria/viruses/protists, and the presence of B/T-cell immune repertoire. Processing 8,274 samples from 132 different experiments for skin samples identifies common flora of skin with elevation of protists (such as <i>Leishmania</i>), bacteria (<i>Staphylococcus</i>, <i>Cutibacterium acnes</i>), and viruses [Human alphaherpesvirus (HSV), Human papillomavirus (HPV)] that may be involved in physiological differences. We observed samples with the Heilongjiang tick virus, human T-cell leukemia virus type I, and equine infectious anemia virus that likely play pathological roles in physiology. Integrating the various biomarkers identified five ideal datasets for skin pathologies that elucidated a novel correlation between the normal skin flora bacterium <i>Bacillus megaterium</i> with major histocompatibility complex (MHC) regulation and the immune repertoire clonal expansion, particularly in patients with hidradenitis suppurativa. Finally, we show that in multiple independent experiments, biological sex is associated with multiple sex chromosome gene differences, highlighting the importance of future work in studying sex differences in skin. Data integrations and multidimensional data mapping are critical for physiological omics advancements, and this work highlights the exciting ability to apply these tools to skin physiology.<b>NEW & NOTEWORTHY</b> Complex bioinformatics mapping to skin RNA sequencing datasets can simultaneously map biological sex, skin-specific genes, bacteria, viruses, protists, and the acquired immune response. The integration of these datasets elucidated bacterial signatures from common skin flora while identifying novel insights on <i>Bacillus megaterium</i> in the acquired immune response and novel viral signatures for Heilongjiang tick virus and equine infectious anemia virus.</p>","PeriodicalId":20129,"journal":{"name":"Physiological genomics","volume":" ","pages":"343-356"},"PeriodicalIF":2.5000,"publicationDate":"2025-05-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Physiological genomics","FirstCategoryId":"99","ListUrlMain":"https://doi.org/10.1152/physiolgenomics.00093.2024","RegionNum":4,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2025/3/12 0:00:00","PubModel":"Epub","JCR":"Q3","JCRName":"CELL BIOLOGY","Score":null,"Total":0}
引用次数: 0
Abstract
Defining physiology and methods to measure biological mechanisms is essential. Extensive datasets such as RNA sequencing are used with little analysis of the knowledge gained from the various methodologies. Within this work, we have processed publicly available NCBI RNAseq datasets using a combination of bioinformatics tools for the largest physiological organ, the skin. In many datasets, we identify the quality of the sample, human transcript mapping, the sex of each sample, foreign RNA from bacteria/viruses/protists, and the presence of B/T-cell immune repertoire. Processing 8,274 samples from 132 different experiments for skin samples identifies common flora of skin with elevation of protists (such as Leishmania), bacteria (Staphylococcus, Cutibacterium acnes), and viruses [Human alphaherpesvirus (HSV), Human papillomavirus (HPV)] that may be involved in physiological differences. We observed samples with the Heilongjiang tick virus, human T-cell leukemia virus type I, and equine infectious anemia virus that likely play pathological roles in physiology. Integrating the various biomarkers identified five ideal datasets for skin pathologies that elucidated a novel correlation between the normal skin flora bacterium Bacillus megaterium with major histocompatibility complex (MHC) regulation and the immune repertoire clonal expansion, particularly in patients with hidradenitis suppurativa. Finally, we show that in multiple independent experiments, biological sex is associated with multiple sex chromosome gene differences, highlighting the importance of future work in studying sex differences in skin. Data integrations and multidimensional data mapping are critical for physiological omics advancements, and this work highlights the exciting ability to apply these tools to skin physiology.NEW & NOTEWORTHY Complex bioinformatics mapping to skin RNA sequencing datasets can simultaneously map biological sex, skin-specific genes, bacteria, viruses, protists, and the acquired immune response. The integration of these datasets elucidated bacterial signatures from common skin flora while identifying novel insights on Bacillus megaterium in the acquired immune response and novel viral signatures for Heilongjiang tick virus and equine infectious anemia virus.
期刊介绍:
The Physiological Genomics publishes original papers, reviews and rapid reports in a wide area of research focused on uncovering the links between genes and physiology at all levels of biological organization. Articles on topics ranging from single genes to the whole genome and their links to the physiology of humans, any model organism, organ, tissue or cell are welcome. Areas of interest include complex polygenic traits preferably of importance to human health and gene-function relationships of disease processes. Specifically, the Journal has dedicated Sections focused on genome-wide association studies (GWAS) to function, cardiovascular, renal, metabolic and neurological systems, exercise physiology, pharmacogenomics, clinical, translational and genomics for precision medicine, comparative and statistical genomics and databases. For further details on research themes covered within these Sections, please refer to the descriptions given under each Section.