{"title":"印度语言的深层释义检测","authors":"Rupal Bhargava, Gargi Sharma, Yashvardhan Sharma","doi":"10.1145/3110025.3122119","DOIUrl":null,"url":null,"abstract":"This paper presents an approach to the problem of paraphrase identification in English and Indian languages using Convolutional Neural Network (CNN) and Recurrent Neural Network (RNN). Traditional machine learning approaches used features that involved using resources such as POS taggers, dependency parsers, etc. for English. The lack of similar resources for Indian languages has been a deterrent to the advancement of paraphrase detection task in Indian languages. Deep learning helps in overcoming the shortcomings of traditional machine Learning techniques. In this paper, three approaches have been proposed, a simple CNN that uses word embeddings as input, a CNN that uses WordNet scores as input and RNN based approach with both LSTM and bi-directional LSTM.","PeriodicalId":399660,"journal":{"name":"Proceedings of the 2017 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining 2017","volume":"27 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2017-07-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"8","resultStr":"{\"title\":\"Deep Paraphrase Detection in Indian Languages\",\"authors\":\"Rupal Bhargava, Gargi Sharma, Yashvardhan Sharma\",\"doi\":\"10.1145/3110025.3122119\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"This paper presents an approach to the problem of paraphrase identification in English and Indian languages using Convolutional Neural Network (CNN) and Recurrent Neural Network (RNN). Traditional machine learning approaches used features that involved using resources such as POS taggers, dependency parsers, etc. for English. The lack of similar resources for Indian languages has been a deterrent to the advancement of paraphrase detection task in Indian languages. Deep learning helps in overcoming the shortcomings of traditional machine Learning techniques. In this paper, three approaches have been proposed, a simple CNN that uses word embeddings as input, a CNN that uses WordNet scores as input and RNN based approach with both LSTM and bi-directional LSTM.\",\"PeriodicalId\":399660,\"journal\":{\"name\":\"Proceedings of the 2017 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining 2017\",\"volume\":\"27 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2017-07-31\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"8\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Proceedings of the 2017 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining 2017\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1145/3110025.3122119\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the 2017 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining 2017","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/3110025.3122119","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
This paper presents an approach to the problem of paraphrase identification in English and Indian languages using Convolutional Neural Network (CNN) and Recurrent Neural Network (RNN). Traditional machine learning approaches used features that involved using resources such as POS taggers, dependency parsers, etc. for English. The lack of similar resources for Indian languages has been a deterrent to the advancement of paraphrase detection task in Indian languages. Deep learning helps in overcoming the shortcomings of traditional machine Learning techniques. In this paper, three approaches have been proposed, a simple CNN that uses word embeddings as input, a CNN that uses WordNet scores as input and RNN based approach with both LSTM and bi-directional LSTM.