{"title":"Using TF-IDF on Kisan Call Centre Dataset for Obtaining Query Answers","authors":"S. K. Mohapatra, Anamika Upadhyay","doi":"10.1109/IC3IOT.2018.8668134","DOIUrl":null,"url":null,"abstract":"Getting semantic similarity in short texts plays an important role for many tasks in the field of information retrieval. This helps in getting search results, fetching answers to queries, building summary of documents etc. We present an approach for manually and automatically getting answers to the different problems and queries of the farmers for their day to day agricultural work. Using this approach, we can provide a query to the model, to find relevant questions asked for that query and their possible answers. We have first preprocessed the data and converted to a similarity matrix which we save in a database using mongoDb. By taking the saved data from database we trained the model to get the information of the query based on similarity between the sentences of the queries, and then the application will find the best possible answer according to the similarity. We will be using Term-frequency-inverse document frequency (TF-IDF) to find the similar queries. With TF-IDF, every word is given weight, the TF-IDF is measured by frequency the relevance is not taken into consideration for this model. This information can be used in trainings to boost call agent effectiveness and improve the customer experience. As an added bonus, more effective communication reduces handle time and operating costs.","PeriodicalId":155587,"journal":{"name":"2018 International Conference on Communication, Computing and Internet of Things (IC3IoT)","volume":"14 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2018-02-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"6","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2018 International Conference on Communication, Computing and Internet of Things (IC3IoT)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/IC3IOT.2018.8668134","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 6
Abstract
Getting semantic similarity in short texts plays an important role for many tasks in the field of information retrieval. This helps in getting search results, fetching answers to queries, building summary of documents etc. We present an approach for manually and automatically getting answers to the different problems and queries of the farmers for their day to day agricultural work. Using this approach, we can provide a query to the model, to find relevant questions asked for that query and their possible answers. We have first preprocessed the data and converted to a similarity matrix which we save in a database using mongoDb. By taking the saved data from database we trained the model to get the information of the query based on similarity between the sentences of the queries, and then the application will find the best possible answer according to the similarity. We will be using Term-frequency-inverse document frequency (TF-IDF) to find the similar queries. With TF-IDF, every word is given weight, the TF-IDF is measured by frequency the relevance is not taken into consideration for this model. This information can be used in trainings to boost call agent effectiveness and improve the customer experience. As an added bonus, more effective communication reduces handle time and operating costs.