{"title":"Wave-LSTM: Multi-scale analysis of somatic whole genome copy number profiles","authors":"Charles Gadd, Christopher Yau","doi":"arxiv-2408.12636","DOIUrl":null,"url":null,"abstract":"Changes in the number of copies of certain parts of the genome, known as copy\nnumber alterations (CNAs), due to somatic mutation processes are a hallmark of\nmany cancers. This genomic complexity is known to be associated with poorer\noutcomes for patients but describing its contribution in detail has been\ndifficult. Copy number alterations can affect large regions spanning whole\nchromosomes or the entire genome itself but can also be localised to only small\nsegments of the genome and no methods exist that allow this multi-scale nature\nto be quantified. In this paper, we address this using Wave-LSTM, a signal\ndecomposition approach designed to capture the multi-scale structure of complex\nwhole genome copy number profiles. Using wavelet-based source separation in\ncombination with deep learning-based attention mechanisms. We show that\nWave-LSTM can be used to derive multi-scale representations from copy number\nprofiles which can be used to decipher sub-clonal structures from single-cell\ncopy number data and to improve survival prediction performance from patient\ntumour profiles.","PeriodicalId":501070,"journal":{"name":"arXiv - QuanBio - Genomics","volume":"86 1","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2024-08-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"arXiv - QuanBio - Genomics","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/arxiv-2408.12636","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
Changes in the number of copies of certain parts of the genome, known as copy
number alterations (CNAs), due to somatic mutation processes are a hallmark of
many cancers. This genomic complexity is known to be associated with poorer
outcomes for patients but describing its contribution in detail has been
difficult. Copy number alterations can affect large regions spanning whole
chromosomes or the entire genome itself but can also be localised to only small
segments of the genome and no methods exist that allow this multi-scale nature
to be quantified. In this paper, we address this using Wave-LSTM, a signal
decomposition approach designed to capture the multi-scale structure of complex
whole genome copy number profiles. Using wavelet-based source separation in
combination with deep learning-based attention mechanisms. We show that
Wave-LSTM can be used to derive multi-scale representations from copy number
profiles which can be used to decipher sub-clonal structures from single-cell
copy number data and to improve survival prediction performance from patient
tumour profiles.