多尺度泛基因组位点分析

IF 2.9 4区 计算机科学 Q2 COMPUTER SCIENCE, SOFTWARE ENGINEERING
A. van den Brandt, E. Ståhlbom, F.J.M. van Workum, H. van de Wetering, C. Lundström, S. Smit, A. Vilanova
{"title":"多尺度泛基因组位点分析","authors":"A. van den Brandt,&nbsp;E. Ståhlbom,&nbsp;F.J.M. van Workum,&nbsp;H. van de Wetering,&nbsp;C. Lundström,&nbsp;S. Smit,&nbsp;A. Vilanova","doi":"10.1111/cgf.70147","DOIUrl":null,"url":null,"abstract":"<div>\n \n <p>Comparing gene organization across genomic sequences reveals insights into evolutionary and functional diversity among different organisms and varieties. Performing this task across many sequences, such as from a pangenome, is challenging because of the scale, the density of information, and the inherent variation. Often, analyses are centered on a genomic region of interest—a locus that might be associated with a trait or contain genes within the same family or biological pathway. Within these regions, researchers examine the conservation of gene order and orientation across organisms and assess sequence similarity, along with other gene content features such as gene size, to find biological variations or potential errors in the data. Automated methods in comparative genomics struggle to identify meaningful patterns due to varying and often unknown features of interest, leaving manual, time-intensive, and scalability-challenged visualization as the primary alternative. To address these challenges, we present a multiscale design for studying gene organization within pangenomes, developed in close collaboration with domain experts. Our tool, <i>M<span>ultipla</span></i>, enables users to explore organization at multiple levels of detail in a decluttered manner through layout abstractions, semantic zooming, and layouts with flexible distance definitions and feature selections, combining the advantages of manual and automated methods used in practice. We evaluate the design of <i>M<span>ultipla</span></i> through two pangenomic use cases and conclude with lessons learned from designing multiscale views for pangenomic locus analysis.</p>\n </div>","PeriodicalId":10687,"journal":{"name":"Computer Graphics Forum","volume":"44 3","pages":""},"PeriodicalIF":2.9000,"publicationDate":"2025-05-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1111/cgf.70147","citationCount":"0","resultStr":"{\"title\":\"Multipla: Multiscale Pangenomic Locus Analysis\",\"authors\":\"A. van den Brandt,&nbsp;E. Ståhlbom,&nbsp;F.J.M. van Workum,&nbsp;H. van de Wetering,&nbsp;C. Lundström,&nbsp;S. Smit,&nbsp;A. Vilanova\",\"doi\":\"10.1111/cgf.70147\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div>\\n \\n <p>Comparing gene organization across genomic sequences reveals insights into evolutionary and functional diversity among different organisms and varieties. Performing this task across many sequences, such as from a pangenome, is challenging because of the scale, the density of information, and the inherent variation. Often, analyses are centered on a genomic region of interest—a locus that might be associated with a trait or contain genes within the same family or biological pathway. Within these regions, researchers examine the conservation of gene order and orientation across organisms and assess sequence similarity, along with other gene content features such as gene size, to find biological variations or potential errors in the data. Automated methods in comparative genomics struggle to identify meaningful patterns due to varying and often unknown features of interest, leaving manual, time-intensive, and scalability-challenged visualization as the primary alternative. To address these challenges, we present a multiscale design for studying gene organization within pangenomes, developed in close collaboration with domain experts. Our tool, <i>M<span>ultipla</span></i>, enables users to explore organization at multiple levels of detail in a decluttered manner through layout abstractions, semantic zooming, and layouts with flexible distance definitions and feature selections, combining the advantages of manual and automated methods used in practice. We evaluate the design of <i>M<span>ultipla</span></i> through two pangenomic use cases and conclude with lessons learned from designing multiscale views for pangenomic locus analysis.</p>\\n </div>\",\"PeriodicalId\":10687,\"journal\":{\"name\":\"Computer Graphics Forum\",\"volume\":\"44 3\",\"pages\":\"\"},\"PeriodicalIF\":2.9000,\"publicationDate\":\"2025-05-23\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://onlinelibrary.wiley.com/doi/epdf/10.1111/cgf.70147\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Computer Graphics Forum\",\"FirstCategoryId\":\"94\",\"ListUrlMain\":\"https://onlinelibrary.wiley.com/doi/10.1111/cgf.70147\",\"RegionNum\":4,\"RegionCategory\":\"计算机科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q2\",\"JCRName\":\"COMPUTER SCIENCE, SOFTWARE ENGINEERING\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Computer Graphics Forum","FirstCategoryId":"94","ListUrlMain":"https://onlinelibrary.wiley.com/doi/10.1111/cgf.70147","RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"COMPUTER SCIENCE, SOFTWARE ENGINEERING","Score":null,"Total":0}
引用次数: 0

摘要

比较基因组序列中的基因组织揭示了不同生物和品种之间进化和功能多样性的见解。在许多序列中执行这项任务,例如来自泛基因组的序列,由于规模,信息密度和固有变化而具有挑战性。通常,分析集中在感兴趣的基因组区域——一个可能与特征相关或包含同一家族或生物途径中的基因的位点。在这些区域内,研究人员检查整个生物体的基因顺序和取向的保守性,并评估序列相似性,以及其他基因内容特征(如基因大小),以发现生物学变异或数据中的潜在错误。比较基因组学中的自动化方法很难识别有意义的模式,因为感兴趣的特征变化很大,而且通常是未知的,因此人工、耗时和具有可扩展性挑战的可视化是主要的选择。为了解决这些挑战,我们提出了一个多尺度设计,用于研究泛基因组中的基因组织,与领域专家密切合作开发。我们的工具,Multipla,使用户能够通过布局抽象,语义缩放,灵活的距离定义和特征选择布局,以一种整洁的方式探索多个细节层次的组织,结合了实践中使用的手动和自动化方法的优点。我们通过两个泛基因组用例评估了Multipla的设计,并总结了为泛基因组位点分析设计多尺度视图的经验教训。
本文章由计算机程序翻译,如有差异,请以英文原文为准。

Multipla: Multiscale Pangenomic Locus Analysis

Multipla: Multiscale Pangenomic Locus Analysis

Comparing gene organization across genomic sequences reveals insights into evolutionary and functional diversity among different organisms and varieties. Performing this task across many sequences, such as from a pangenome, is challenging because of the scale, the density of information, and the inherent variation. Often, analyses are centered on a genomic region of interest—a locus that might be associated with a trait or contain genes within the same family or biological pathway. Within these regions, researchers examine the conservation of gene order and orientation across organisms and assess sequence similarity, along with other gene content features such as gene size, to find biological variations or potential errors in the data. Automated methods in comparative genomics struggle to identify meaningful patterns due to varying and often unknown features of interest, leaving manual, time-intensive, and scalability-challenged visualization as the primary alternative. To address these challenges, we present a multiscale design for studying gene organization within pangenomes, developed in close collaboration with domain experts. Our tool, Multipla, enables users to explore organization at multiple levels of detail in a decluttered manner through layout abstractions, semantic zooming, and layouts with flexible distance definitions and feature selections, combining the advantages of manual and automated methods used in practice. We evaluate the design of Multipla through two pangenomic use cases and conclude with lessons learned from designing multiscale views for pangenomic locus analysis.

求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
Computer Graphics Forum
Computer Graphics Forum 工程技术-计算机:软件工程
CiteScore
5.80
自引率
12.00%
发文量
175
审稿时长
3-6 weeks
期刊介绍: Computer Graphics Forum is the official journal of Eurographics, published in cooperation with Wiley-Blackwell, and is a unique, international source of information for computer graphics professionals interested in graphics developments worldwide. It is now one of the leading journals for researchers, developers and users of computer graphics in both commercial and academic environments. The journal reports on the latest developments in the field throughout the world and covers all aspects of the theory, practice and application of computer graphics.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信