{"title":"基于案例推理方法的文本分类任务","authors":"Igor Nikonov, I. Kurilenko","doi":"10.1109/REEPE49198.2020.9059249","DOIUrl":null,"url":null,"abstract":"This paper discusses the method of solving the text classification task by using case-based reasoning. As a part of the text classification module in the implemented system, modified TF-IDF (Term Frequency - Inverse Document Frequency) measure of the text content based on the known information on the distribution of documents between different categories is proposed. This modification allows to improve the quality of the classification by considering the information on the distribution of the words in the entire case base. Implementation of the prototype of text classification module used in the interactive voice response system is presented. This module allows to get away from button menus to automated voice interaction between the user and the system. The result of computational experiments confirming the effectiveness of the developed system is proposed.","PeriodicalId":142369,"journal":{"name":"2020 International Youth Conference on Radio Electronics, Electrical and Power Engineering (REEPE)","volume":"8 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2020-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Using Case-Based Reasoning Approach for Text Classification Task\",\"authors\":\"Igor Nikonov, I. Kurilenko\",\"doi\":\"10.1109/REEPE49198.2020.9059249\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"This paper discusses the method of solving the text classification task by using case-based reasoning. As a part of the text classification module in the implemented system, modified TF-IDF (Term Frequency - Inverse Document Frequency) measure of the text content based on the known information on the distribution of documents between different categories is proposed. This modification allows to improve the quality of the classification by considering the information on the distribution of the words in the entire case base. Implementation of the prototype of text classification module used in the interactive voice response system is presented. This module allows to get away from button menus to automated voice interaction between the user and the system. The result of computational experiments confirming the effectiveness of the developed system is proposed.\",\"PeriodicalId\":142369,\"journal\":{\"name\":\"2020 International Youth Conference on Radio Electronics, Electrical and Power Engineering (REEPE)\",\"volume\":\"8 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2020-03-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2020 International Youth Conference on Radio Electronics, Electrical and Power Engineering (REEPE)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/REEPE49198.2020.9059249\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2020 International Youth Conference on Radio Electronics, Electrical and Power Engineering (REEPE)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/REEPE49198.2020.9059249","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
摘要
本文讨论了基于案例推理解决文本分类任务的方法。作为实现系统中文本分类模块的一部分,基于已知的文档在不同类别之间分布的信息,提出了对文本内容进行改进的TF-IDF (Term Frequency - Inverse Document Frequency)度量。这种修改允许通过考虑整个案例库中单词分布的信息来提高分类的质量。介绍了交互式语音应答系统中文本分类模块原型的实现。该模块允许从按钮菜单到用户和系统之间的自动语音交互。计算实验结果证实了所开发系统的有效性。
Using Case-Based Reasoning Approach for Text Classification Task
This paper discusses the method of solving the text classification task by using case-based reasoning. As a part of the text classification module in the implemented system, modified TF-IDF (Term Frequency - Inverse Document Frequency) measure of the text content based on the known information on the distribution of documents between different categories is proposed. This modification allows to improve the quality of the classification by considering the information on the distribution of the words in the entire case base. Implementation of the prototype of text classification module used in the interactive voice response system is presented. This module allows to get away from button menus to automated voice interaction between the user and the system. The result of computational experiments confirming the effectiveness of the developed system is proposed.