{"title":"使词嵌入适应可追溯性恢复","authors":"Qingsong Tian, Qi-Wei Cao, Qing Sun","doi":"10.1109/ICISCAE.2018.8666883","DOIUrl":null,"url":null,"abstract":"Maintaining the traceability links of a software is tedious, error-prone task, but an essential requirement. Information retrieval has been approached to help to generate traceability links. Traceability links are usually determined by the similarity between two artifacts. However, methods are put forward mainly based on vector space model, topic model etc. which ignored the word semantic. According to that, this paper adapts the popular word embedding technique to traceability recovery tasks, and handle the out-of-vocabulary words at test time. In the end, a machine learning method is used (learning to rank) to improve our final result. Several contrast experiments are conducted on five public datasets, and the baseline methods are outperformed under the same condition.","PeriodicalId":129861,"journal":{"name":"2018 International Conference on Information Systems and Computer Aided Education (ICISCAE)","volume":"147 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2018-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"3","resultStr":"{\"title\":\"Adapting Word Embeddings to Traceability Recovery\",\"authors\":\"Qingsong Tian, Qi-Wei Cao, Qing Sun\",\"doi\":\"10.1109/ICISCAE.2018.8666883\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Maintaining the traceability links of a software is tedious, error-prone task, but an essential requirement. Information retrieval has been approached to help to generate traceability links. Traceability links are usually determined by the similarity between two artifacts. However, methods are put forward mainly based on vector space model, topic model etc. which ignored the word semantic. According to that, this paper adapts the popular word embedding technique to traceability recovery tasks, and handle the out-of-vocabulary words at test time. In the end, a machine learning method is used (learning to rank) to improve our final result. Several contrast experiments are conducted on five public datasets, and the baseline methods are outperformed under the same condition.\",\"PeriodicalId\":129861,\"journal\":{\"name\":\"2018 International Conference on Information Systems and Computer Aided Education (ICISCAE)\",\"volume\":\"147 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2018-07-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"3\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2018 International Conference on Information Systems and Computer Aided Education (ICISCAE)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ICISCAE.2018.8666883\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2018 International Conference on Information Systems and Computer Aided Education (ICISCAE)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICISCAE.2018.8666883","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Maintaining the traceability links of a software is tedious, error-prone task, but an essential requirement. Information retrieval has been approached to help to generate traceability links. Traceability links are usually determined by the similarity between two artifacts. However, methods are put forward mainly based on vector space model, topic model etc. which ignored the word semantic. According to that, this paper adapts the popular word embedding technique to traceability recovery tasks, and handle the out-of-vocabulary words at test time. In the end, a machine learning method is used (learning to rank) to improve our final result. Several contrast experiments are conducted on five public datasets, and the baseline methods are outperformed under the same condition.