R. Vignesh, D. Deepa, Suja Cherukullapurath Mana, B. Samhitha, A. T
{"title":"癌症数据集的基因表达分析","authors":"R. Vignesh, D. Deepa, Suja Cherukullapurath Mana, B. Samhitha, A. T","doi":"10.1109/ICOEI51242.2021.9452965","DOIUrl":null,"url":null,"abstract":"Genes are the basis of tumor formations around the body, which is better known as cancer. They inhibit basic processes such as cell death (apoptosis) and promote cell division to an unhealthy extent. The expression of every gene provides a baseline to know the progress of cancer from the organ or tissue it originated from along with its approximated course of action. The analysis of such gene expression values using traditional machine learning methods provide a higher efficiency and accuracy in finding relationships between genes and also it may serve as a future for diagnosing the cancer by using these values. The main challenge is to use the bases that are created to efficiently compute the highly effective genes to treat specific types of cancer by using their expression values and thus, raise the question of a potential relationship between them for each type. A Random Forest Model has been used to perform Feature Selection over the dataset in order to extract the important features (i.e.) the most influential genes. They are then visualized by using traditional packages in Python (i.e. Scikit-plot, Matplotlib, Seaborn) and using a data visualization tool called Tableau to project the result of the analysis.","PeriodicalId":420826,"journal":{"name":"2021 5th International Conference on Trends in Electronics and Informatics (ICOEI)","volume":"130 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-06-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Gene Expression Analysis on Cancer Dataset\",\"authors\":\"R. Vignesh, D. Deepa, Suja Cherukullapurath Mana, B. Samhitha, A. T\",\"doi\":\"10.1109/ICOEI51242.2021.9452965\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Genes are the basis of tumor formations around the body, which is better known as cancer. They inhibit basic processes such as cell death (apoptosis) and promote cell division to an unhealthy extent. The expression of every gene provides a baseline to know the progress of cancer from the organ or tissue it originated from along with its approximated course of action. The analysis of such gene expression values using traditional machine learning methods provide a higher efficiency and accuracy in finding relationships between genes and also it may serve as a future for diagnosing the cancer by using these values. The main challenge is to use the bases that are created to efficiently compute the highly effective genes to treat specific types of cancer by using their expression values and thus, raise the question of a potential relationship between them for each type. A Random Forest Model has been used to perform Feature Selection over the dataset in order to extract the important features (i.e.) the most influential genes. They are then visualized by using traditional packages in Python (i.e. Scikit-plot, Matplotlib, Seaborn) and using a data visualization tool called Tableau to project the result of the analysis.\",\"PeriodicalId\":420826,\"journal\":{\"name\":\"2021 5th International Conference on Trends in Electronics and Informatics (ICOEI)\",\"volume\":\"130 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2021-06-03\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2021 5th International Conference on Trends in Electronics and Informatics (ICOEI)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ICOEI51242.2021.9452965\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2021 5th International Conference on Trends in Electronics and Informatics (ICOEI)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICOEI51242.2021.9452965","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Genes are the basis of tumor formations around the body, which is better known as cancer. They inhibit basic processes such as cell death (apoptosis) and promote cell division to an unhealthy extent. The expression of every gene provides a baseline to know the progress of cancer from the organ or tissue it originated from along with its approximated course of action. The analysis of such gene expression values using traditional machine learning methods provide a higher efficiency and accuracy in finding relationships between genes and also it may serve as a future for diagnosing the cancer by using these values. The main challenge is to use the bases that are created to efficiently compute the highly effective genes to treat specific types of cancer by using their expression values and thus, raise the question of a potential relationship between them for each type. A Random Forest Model has been used to perform Feature Selection over the dataset in order to extract the important features (i.e.) the most influential genes. They are then visualized by using traditional packages in Python (i.e. Scikit-plot, Matplotlib, Seaborn) and using a data visualization tool called Tableau to project the result of the analysis.