{"title":"GCTransNet: 3D mitochondrial instance segmentation based on Global Context Vision Transformers.","authors":"Chaoyi Chen, Yidan Yan, Jingpeng Wu, Wen-Biao Gan","doi":"10.1016/j.jsb.2025.108170","DOIUrl":null,"url":null,"abstract":"<p><p>Mitochondria are double membrane-bound organelles essential for generating energy in eukaryotic cells. Mitochondria can be readily visualized in 3D using Volume Electron Microscopy (vEM), and accurate image segmentation is vital for quantitative analysis of mitochondrial morphology and function. To address the challenge of segmenting small mitochondrial compartments in vEM images, we propose an automated mitochondrial segmentation method called GCTransNet. This method employs grayscale migration technology to preprocess images, effectively reducing intensity distribution differences across EM images. By utilizing 3D Global Context Vision Transformers (GC-ViT) combined with global context self-attention modules and local self-attention modules, GCTransNet precisely models long-range and short-range spatial interactions. The long-range interactions enable the model to capture the global structural relationships within the mitochondrial segmentation network, while the short-range interactions refine local details and boundaries. In our approach, the encoder of the 3D U-Net network, a classical multi-scale learning architecture that retains high-resolution features through skip connections and combines multi-scale features for precise segmentation, is replaced by a 3D GC-ViT. The GC-ViT leverages shifted window-based self-attention, capturing long-range dependencies and offering improved segmentation accuracy compared to traditional U-Net encoders. In the MitoEM mitochondrial segmentation challenge, GCTransNet achieved state-of-the-art results, demonstrating its superiority in automated mitochondrial segmentation. The code and its documentation are publicly available at https://github.com/GanLab123/GCTransNet.</p>","PeriodicalId":17074,"journal":{"name":"Journal of structural biology","volume":" ","pages":"108170"},"PeriodicalIF":3.0000,"publicationDate":"2025-01-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of structural biology","FirstCategoryId":"99","ListUrlMain":"https://doi.org/10.1016/j.jsb.2025.108170","RegionNum":3,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"BIOCHEMISTRY & MOLECULAR BIOLOGY","Score":null,"Total":0}
引用次数: 0
Abstract
Mitochondria are double membrane-bound organelles essential for generating energy in eukaryotic cells. Mitochondria can be readily visualized in 3D using Volume Electron Microscopy (vEM), and accurate image segmentation is vital for quantitative analysis of mitochondrial morphology and function. To address the challenge of segmenting small mitochondrial compartments in vEM images, we propose an automated mitochondrial segmentation method called GCTransNet. This method employs grayscale migration technology to preprocess images, effectively reducing intensity distribution differences across EM images. By utilizing 3D Global Context Vision Transformers (GC-ViT) combined with global context self-attention modules and local self-attention modules, GCTransNet precisely models long-range and short-range spatial interactions. The long-range interactions enable the model to capture the global structural relationships within the mitochondrial segmentation network, while the short-range interactions refine local details and boundaries. In our approach, the encoder of the 3D U-Net network, a classical multi-scale learning architecture that retains high-resolution features through skip connections and combines multi-scale features for precise segmentation, is replaced by a 3D GC-ViT. The GC-ViT leverages shifted window-based self-attention, capturing long-range dependencies and offering improved segmentation accuracy compared to traditional U-Net encoders. In the MitoEM mitochondrial segmentation challenge, GCTransNet achieved state-of-the-art results, demonstrating its superiority in automated mitochondrial segmentation. The code and its documentation are publicly available at https://github.com/GanLab123/GCTransNet.
期刊介绍:
Journal of Structural Biology (JSB) has an open access mirror journal, the Journal of Structural Biology: X (JSBX), sharing the same aims and scope, editorial team, submission system and rigorous peer review. Since both journals share the same editorial system, you may submit your manuscript via either journal homepage. You will be prompted during submission (and revision) to choose in which to publish your article. The editors and reviewers are not aware of the choice you made until the article has been published online. JSB and JSBX publish papers dealing with the structural analysis of living material at every level of organization by all methods that lead to an understanding of biological function in terms of molecular and supermolecular structure.
Techniques covered include:
• Light microscopy including confocal microscopy
• All types of electron microscopy
• X-ray diffraction
• Nuclear magnetic resonance
• Scanning force microscopy, scanning probe microscopy, and tunneling microscopy
• Digital image processing
• Computational insights into structure