{"title":"双语可比语料库研究综述","authors":"Fei Lv, Zede Zhu","doi":"10.1109/ICSGEA.2019.00138","DOIUrl":null,"url":null,"abstract":"How to construct bilingual comparable corpus and further mine fine-grained bilingual resources, which is an important way to solve the lack of bilingual resources in cross-language information processing. Firstly, this paper expounds the important value of bilingual comparable corpus construction. Secondly, the methods of quality evaluation are discussed on bilingual comparable corpus. Thirdly, the research status of comparable corpus construction is analyzed, including lexicon-based feature, document-oriented representation and non-content-based approach. Furthermore, the progress in the application of bilingual comparable corpus has been expounded, which mainly involves bilingual lexicons extraction, bilingual multiword expressions extraction, parallel sentences extraction and cross-language information processing. Finally, the existing problems and future research directions are proposed.","PeriodicalId":201721,"journal":{"name":"2019 International Conference on Smart Grid and Electrical Automation (ICSGEA)","volume":"33 2","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2019-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"A Summary of Studies on Bilingual Comparable Corpus\",\"authors\":\"Fei Lv, Zede Zhu\",\"doi\":\"10.1109/ICSGEA.2019.00138\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"How to construct bilingual comparable corpus and further mine fine-grained bilingual resources, which is an important way to solve the lack of bilingual resources in cross-language information processing. Firstly, this paper expounds the important value of bilingual comparable corpus construction. Secondly, the methods of quality evaluation are discussed on bilingual comparable corpus. Thirdly, the research status of comparable corpus construction is analyzed, including lexicon-based feature, document-oriented representation and non-content-based approach. Furthermore, the progress in the application of bilingual comparable corpus has been expounded, which mainly involves bilingual lexicons extraction, bilingual multiword expressions extraction, parallel sentences extraction and cross-language information processing. Finally, the existing problems and future research directions are proposed.\",\"PeriodicalId\":201721,\"journal\":{\"name\":\"2019 International Conference on Smart Grid and Electrical Automation (ICSGEA)\",\"volume\":\"33 2\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2019-08-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2019 International Conference on Smart Grid and Electrical Automation (ICSGEA)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ICSGEA.2019.00138\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2019 International Conference on Smart Grid and Electrical Automation (ICSGEA)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICSGEA.2019.00138","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
A Summary of Studies on Bilingual Comparable Corpus
How to construct bilingual comparable corpus and further mine fine-grained bilingual resources, which is an important way to solve the lack of bilingual resources in cross-language information processing. Firstly, this paper expounds the important value of bilingual comparable corpus construction. Secondly, the methods of quality evaluation are discussed on bilingual comparable corpus. Thirdly, the research status of comparable corpus construction is analyzed, including lexicon-based feature, document-oriented representation and non-content-based approach. Furthermore, the progress in the application of bilingual comparable corpus has been expounded, which mainly involves bilingual lexicons extraction, bilingual multiword expressions extraction, parallel sentences extraction and cross-language information processing. Finally, the existing problems and future research directions are proposed.