Akito Y Kawahara, Caroline G Storer, Amanda Markee, Jacqueline Heckenhauer, Ashlyn Powell, David Plotkin, Scott Hotaling, Timothy P Cleland, Rebecca B Dikow, Torsten Dikow, Ryoichi B Kuranishi, Rebeccah Messcher, Steffen U Pauls, Russell J Stewart, Koji Tojo, Paul B Frandsen
{"title":"Long-read HiFi sequencing correctly assembles repetitive <i>heavy fibroin</i> silk genes in new moth and caddisfly genomes.","authors":"Akito Y Kawahara, Caroline G Storer, Amanda Markee, Jacqueline Heckenhauer, Ashlyn Powell, David Plotkin, Scott Hotaling, Timothy P Cleland, Rebecca B Dikow, Torsten Dikow, Ryoichi B Kuranishi, Rebeccah Messcher, Steffen U Pauls, Russell J Stewart, Koji Tojo, Paul B Frandsen","doi":"10.46471/gigabyte.64","DOIUrl":null,"url":null,"abstract":"<p><p>Insect silk is a versatile biomaterial. Lepidoptera and Trichoptera display some of the most diverse uses of silk, with varying strength, adhesive qualities, and elastic properties. Silk fibroin genes are long (>20 Kbp), with many repetitive motifs that make them challenging to sequence. Most research thus far has focused on conserved N- and C-terminal regions of fibroin genes because a full comparison of repetitive regions across taxa has not been possible. Using the PacBio Sequel II system and SMRT sequencing, we generated high fidelity (HiFi) long-read genomic and transcriptomic sequences for the Indianmeal moth (<i>Plodia interpunctella</i>) and genomic sequences for the caddisfly <i>Eubasilissa regina</i>. Both genomes were highly contiguous (N50 = 9.7 Mbp/32.4 Mbp, L50 = 13/11) and complete (BUSCO complete = 99.3%/95.2%), with complete and contiguous recovery of silk <i>heavy fibroin</i> gene sequences. We show that HiFi long-read sequencing is helpful for understanding genes with long, repetitive regions.</p>","PeriodicalId":73157,"journal":{"name":"GigaByte (Hong Kong, China)","volume":"2022 ","pages":"gigabyte64"},"PeriodicalIF":0.0000,"publicationDate":"2022-06-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9693786/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"GigaByte (Hong Kong, China)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.46471/gigabyte.64","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2022/1/1 0:00:00","PubModel":"eCollection","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
Insect silk is a versatile biomaterial. Lepidoptera and Trichoptera display some of the most diverse uses of silk, with varying strength, adhesive qualities, and elastic properties. Silk fibroin genes are long (>20 Kbp), with many repetitive motifs that make them challenging to sequence. Most research thus far has focused on conserved N- and C-terminal regions of fibroin genes because a full comparison of repetitive regions across taxa has not been possible. Using the PacBio Sequel II system and SMRT sequencing, we generated high fidelity (HiFi) long-read genomic and transcriptomic sequences for the Indianmeal moth (Plodia interpunctella) and genomic sequences for the caddisfly Eubasilissa regina. Both genomes were highly contiguous (N50 = 9.7 Mbp/32.4 Mbp, L50 = 13/11) and complete (BUSCO complete = 99.3%/95.2%), with complete and contiguous recovery of silk heavy fibroin gene sequences. We show that HiFi long-read sequencing is helpful for understanding genes with long, repetitive regions.