Computational prediction of mutagenicity through comprehensive cell painting analysis.

IF 4.3 4区医学 Q3 GENETICS & HEREDITY

Mutagenesis Pub Date : 2025-10-17 DOI:10.1093/mutage/geaf014

Natacha Cerisier, Emily Truong, Taku Watanabe, Taro Oshiro, Tomohiro Takahashi, Shigeaki Ito, Olivier Taboureau

{"title":"Computational prediction of mutagenicity through comprehensive cell painting analysis.","authors":"Natacha Cerisier, Emily Truong, Taku Watanabe, Taro Oshiro, Tomohiro Takahashi, Shigeaki Ito, Olivier Taboureau","doi":"10.1093/mutage/geaf014","DOIUrl":null,"url":null,"abstract":"<p><p>The mutagenicity of chemical compounds is a key consideration in toxicology, drug development, and environmental safety. Traditional methods such as the Ames test, while reliable, are time-intensive and costly. With advances in imaging and machine learning (ML), high-content assays like cell painting offer new opportunities for predictive toxicology. Cell painting captures extensive morphological features of cells, which can correlate with chemical bioactivity. In this study, we leveraged cell painting data to develop ML models for predicting mutagenicity and compared their performance with structure-based models. We used two datasets: a Broad Institute dataset containing profiles of over 30 000 molecules and a U.S.-Environmental Protection Agency dataset with images of 1200 chemicals tested at multiple concentrations. By integrating these datasets, we aimed to improve the robustness of our models. Among three algorithms tested-Random Forest, Support Vector Machine, and Extreme Gradient Boosting-the third showed the best performance for both datasets. Notably, selecting the most relevant concentration per compound, the phenotypic altering concentration, significantly improved prediction accuracy. Our models outperformed traditional quantitative structure activity relationship (QSAR) tools such as the Virtual models for property Evaluation of chemicals within a Global Architecture (VEGA) and the CompTox Dashboard for the majority of compounds, demonstrating the utility of cell painting features. The cell painting-based models revealed morphological changes related to DNA and RNA perturbation, especially in mitochondria, endoplasmic reticulum and nuclei, aligning with mutagenicity mechanisms. Despite this, certain compounds remained challenging to predict due to inherent dataset limitations and inter-laboratory variability in cell painting technology. The findings highlight the potential of cell painting in mutagenicity prediction, offering a complementary perspective to chemical structure-based models. Future work could involve harmonizing cell painting methodologies across datasets and exploring deep learning techniques to enhance predictive accuracy. Ultimately, integrating cell painting data with QSAR descriptors in hybrid models may unlock novel insights into chemical mutagenicity.</p>","PeriodicalId":18889,"journal":{"name":"Mutagenesis","volume":" ","pages":"560-574"},"PeriodicalIF":4.3000,"publicationDate":"2025-10-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12534217/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Mutagenesis","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1093/mutage/geaf014","RegionNum":4,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"GENETICS & HEREDITY","Score":null,"Total":0}

引用次数: 0

Abstract

The mutagenicity of chemical compounds is a key consideration in toxicology, drug development, and environmental safety. Traditional methods such as the Ames test, while reliable, are time-intensive and costly. With advances in imaging and machine learning (ML), high-content assays like cell painting offer new opportunities for predictive toxicology. Cell painting captures extensive morphological features of cells, which can correlate with chemical bioactivity. In this study, we leveraged cell painting data to develop ML models for predicting mutagenicity and compared their performance with structure-based models. We used two datasets: a Broad Institute dataset containing profiles of over 30 000 molecules and a U.S.-Environmental Protection Agency dataset with images of 1200 chemicals tested at multiple concentrations. By integrating these datasets, we aimed to improve the robustness of our models. Among three algorithms tested-Random Forest, Support Vector Machine, and Extreme Gradient Boosting-the third showed the best performance for both datasets. Notably, selecting the most relevant concentration per compound, the phenotypic altering concentration, significantly improved prediction accuracy. Our models outperformed traditional quantitative structure activity relationship (QSAR) tools such as the Virtual models for property Evaluation of chemicals within a Global Architecture (VEGA) and the CompTox Dashboard for the majority of compounds, demonstrating the utility of cell painting features. The cell painting-based models revealed morphological changes related to DNA and RNA perturbation, especially in mitochondria, endoplasmic reticulum and nuclei, aligning with mutagenicity mechanisms. Despite this, certain compounds remained challenging to predict due to inherent dataset limitations and inter-laboratory variability in cell painting technology. The findings highlight the potential of cell painting in mutagenicity prediction, offering a complementary perspective to chemical structure-based models. Future work could involve harmonizing cell painting methodologies across datasets and exploring deep learning techniques to enhance predictive accuracy. Ultimately, integrating cell painting data with QSAR descriptors in hybrid models may unlock novel insights into chemical mutagenicity.

查看原文本刊更多论文

“通过综合细胞绘画分析计算突变性预测”。

化合物的致突变性是毒理学、药物开发和环境安全的一个重要考虑因素。传统的方法，如Ames测试，虽然可靠，但耗时且昂贵。随着成像和机器学习技术的进步，像Cell Painting这样的高含量分析为预测毒理学提供了新的机会。细胞绘画捕捉细胞的广泛形态特征，这可能与化学生物活性相关。在这项研究中，我们利用Cell Painting数据开发了用于预测突变性的机器学习模型，并将其性能与基于结构的模型进行了比较。我们使用了两个数据集：一个是Broad研究所的数据集，包含超过30,000个分子的概况，另一个是美国环保署的数据集，包含1200种化学物质在多种浓度下的测试图像。通过整合这些数据集，我们旨在提高模型的稳健性。在测试的三种算法中——随机森林、支持向量机和极端梯度增强——第三种算法在两个数据集上都表现出最好的性能。值得注意的是，选择每种化合物最相关的浓度，即表型改变浓度，显著提高了预测精度。对于大多数化合物，我们的模型优于传统的QSAR工具，如VEGA和CompTox Dashboard，证明了Cell Painting功能的实用性。基于细胞绘画的模型揭示了与DNA/RNA和内质网扰动相关的形态学变化，特别是在线粒体和细胞核中，与致突变性机制一致。尽管如此，由于固有的数据集限制和细胞绘画技术的实验室间可变性，某些化合物的预测仍然具有挑战性。这些发现突出了细胞绘画在突变性预测中的潜力，为基于化学结构的模型提供了补充视角。未来的工作可能包括协调跨数据集的细胞绘画方法，并探索深度学习技术，以提高预测的准确性。最终，将Cell Painting数据与混合模型中的QSAR描述符整合在一起，可能会解开对化学致突变性的新见解。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Mutagenesis 生物-毒理学

CiteScore

5.90

自引率

3.70%

发文量

审稿时长

6-12 weeks

期刊介绍： Mutagenesis is an international multi-disciplinary journal designed to bring together research aimed at the identification, characterization and elucidation of the mechanisms of action of physical, chemical and biological agents capable of producing genetic change in living organisms and the study of the consequences of such changes.