{"title":"文本分类技术综述","authors":"M. M. Evangeline, K. Shyamala","doi":"10.1109/ICIPTM52218.2021.9388332","DOIUrl":null,"url":null,"abstract":"The amount of data being generated during recent times has been exponentially huge. The data mainly comprises of unstructured data in the form of textual information like emails, tweets, articles etc. To gain information from these textual data, traditional way of analyzing cannot be used. There is a need for efficient techniques for analyzing these data. Text mining is defined as the process of transforming this unstructured data into understandable and meaningful information. Text Mining is a subfield of Artificial Intelligence which aims to automatically process the data and gain insights from the huge voluminous data. In this paper, several techniques used for classifying the data have been discussed. An overview about the dimensionality reduction methodology and how it can enhance the categorization process has been highlighted. It also aims with a future research scope in extending this categorization process along with dimensionality reduction procedures.","PeriodicalId":315265,"journal":{"name":"2021 International Conference on Innovative Practices in Technology and Management (ICIPTM)","volume":"40 12 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-02-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":"{\"title\":\"Text Categorization Techniques: A Survey\",\"authors\":\"M. M. Evangeline, K. Shyamala\",\"doi\":\"10.1109/ICIPTM52218.2021.9388332\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"The amount of data being generated during recent times has been exponentially huge. The data mainly comprises of unstructured data in the form of textual information like emails, tweets, articles etc. To gain information from these textual data, traditional way of analyzing cannot be used. There is a need for efficient techniques for analyzing these data. Text mining is defined as the process of transforming this unstructured data into understandable and meaningful information. Text Mining is a subfield of Artificial Intelligence which aims to automatically process the data and gain insights from the huge voluminous data. In this paper, several techniques used for classifying the data have been discussed. An overview about the dimensionality reduction methodology and how it can enhance the categorization process has been highlighted. It also aims with a future research scope in extending this categorization process along with dimensionality reduction procedures.\",\"PeriodicalId\":315265,\"journal\":{\"name\":\"2021 International Conference on Innovative Practices in Technology and Management (ICIPTM)\",\"volume\":\"40 12 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2021-02-17\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"1\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2021 International Conference on Innovative Practices in Technology and Management (ICIPTM)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ICIPTM52218.2021.9388332\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2021 International Conference on Innovative Practices in Technology and Management (ICIPTM)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICIPTM52218.2021.9388332","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
The amount of data being generated during recent times has been exponentially huge. The data mainly comprises of unstructured data in the form of textual information like emails, tweets, articles etc. To gain information from these textual data, traditional way of analyzing cannot be used. There is a need for efficient techniques for analyzing these data. Text mining is defined as the process of transforming this unstructured data into understandable and meaningful information. Text Mining is a subfield of Artificial Intelligence which aims to automatically process the data and gain insights from the huge voluminous data. In this paper, several techniques used for classifying the data have been discussed. An overview about the dimensionality reduction methodology and how it can enhance the categorization process has been highlighted. It also aims with a future research scope in extending this categorization process along with dimensionality reduction procedures.