{"title":"Cancer Classification Through p53 Hotspot Mutations: An Ensemble Learning Approach.","authors":"Manisha R Patil, Anand Bihari","doi":"10.1002/cbin.70041","DOIUrl":null,"url":null,"abstract":"<p><p>Tumor suppressor protein p53 is attracting a lot of attention in cancer research because of its role in both tumor cell survival and apoptosis. The most frequently altered tumor suppressor gene in human cancer is TP53. TP53 mutations affecting residues in the protein's DNA binding domain (102-292) account for 80% of the alterations detected in tumors. These are called hotspot mutations because they lose their wild-type function and acquire oncogenic functions that accelerate cancer progression. These functions include promoting the growth, migration, invasion, and initiation of cancer cells and granting drug resistance to cancer cells. Six residues of the p53 protein (Arg175, Gly245, Arg249, Arg248, Arg273, and Arg282) are often altered in human cancer, known as hotspot mutations. Based on these hotspot codons, we identified the cancer types and stability of protein p53 in this study. This study aims to classify cancer types with a high degree of accuracy and precision. The main contribution of this study is that our work presented mutation data (clinically and biologically meaningful features and the role of hotspot codon of protein p53) to classify types of cancer by learning from the labeled data using an ensemble approach. Our research on the classification of cancer types outperformed using the Extreme Gradient boosting classifier (XGBoost) with an accuracy of 99.85%, precision of 99.80%, area under the curve of 100%, MCC of 99. 85%, and F1 of 99.80%.</p>","PeriodicalId":9806,"journal":{"name":"Cell Biology International","volume":" ","pages":""},"PeriodicalIF":3.3000,"publicationDate":"2025-05-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Cell Biology International","FirstCategoryId":"99","ListUrlMain":"https://doi.org/10.1002/cbin.70041","RegionNum":3,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"CELL BIOLOGY","Score":null,"Total":0}
引用次数: 0
Abstract
Tumor suppressor protein p53 is attracting a lot of attention in cancer research because of its role in both tumor cell survival and apoptosis. The most frequently altered tumor suppressor gene in human cancer is TP53. TP53 mutations affecting residues in the protein's DNA binding domain (102-292) account for 80% of the alterations detected in tumors. These are called hotspot mutations because they lose their wild-type function and acquire oncogenic functions that accelerate cancer progression. These functions include promoting the growth, migration, invasion, and initiation of cancer cells and granting drug resistance to cancer cells. Six residues of the p53 protein (Arg175, Gly245, Arg249, Arg248, Arg273, and Arg282) are often altered in human cancer, known as hotspot mutations. Based on these hotspot codons, we identified the cancer types and stability of protein p53 in this study. This study aims to classify cancer types with a high degree of accuracy and precision. The main contribution of this study is that our work presented mutation data (clinically and biologically meaningful features and the role of hotspot codon of protein p53) to classify types of cancer by learning from the labeled data using an ensemble approach. Our research on the classification of cancer types outperformed using the Extreme Gradient boosting classifier (XGBoost) with an accuracy of 99.85%, precision of 99.80%, area under the curve of 100%, MCC of 99. 85%, and F1 of 99.80%.
期刊介绍:
Each month, the journal publishes easy-to-assimilate, up-to-the minute reports of experimental findings by researchers using a wide range of the latest techniques. Promoting the aims of cell biologists worldwide, papers reporting on structure and function - especially where they relate to the physiology of the whole cell - are strongly encouraged. Molecular biology is welcome, as long as articles report findings that are seen in the wider context of cell biology. In covering all areas of the cell, the journal is both appealing and accessible to a broad audience. Authors whose papers do not appeal to cell biologists in general because their topic is too specialized (e.g. infectious microbes, protozoology) are recommended to send them to more relevant journals. Papers reporting whole animal studies or work more suited to a medical journal, e.g. histopathological studies or clinical immunology, are unlikely to be accepted, unless they are fully focused on some important cellular aspect.
These last remarks extend particularly to papers on cancer. Unless firmly based on some deeper cellular or molecular biological principle, papers that are highly specialized in this field, with limited appeal to cell biologists at large, should be directed towards journals devoted to cancer, there being very many from which to choose.