{"title":"多语言社交媒体信息整合中日益自动化的比较研究","authors":"Qixuan Hou, A. Musaev, Yang Yang, C. Pu","doi":"10.1109/CIC.2017.00049","DOIUrl":null,"url":null,"abstract":"Multilingual support in global applications that integrate and filter social media data is a significant challenge due to the cost of manually developing such social media filters for each language. Using LITMUS landslide information system as an experimental platform, we compared six design alternatives with varied combinations of manually developed filters and automatically translated filters for integrating and filtering social media data. Our experiments on Japanese tweets show that automatically translated filtering produces comparable or better results than manually created filters, achieving similar result quality: in false positives, false negatives, and F-1 scores. Compared to manual development, our results suggest automated translation may be a faster and cheaper approach to the integration of non-English languages, without sacrificing data quality. These results are encouraging for reducing the development cost of localized applications that integrate social media, by using automated translation tools.","PeriodicalId":156843,"journal":{"name":"2017 IEEE 3rd International Conference on Collaboration and Internet Computing (CIC)","volume":"12 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2017-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":"{\"title\":\"A Comparative Study of Increasing Automation in the Integration of Multilingual Social Media Information\",\"authors\":\"Qixuan Hou, A. Musaev, Yang Yang, C. Pu\",\"doi\":\"10.1109/CIC.2017.00049\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Multilingual support in global applications that integrate and filter social media data is a significant challenge due to the cost of manually developing such social media filters for each language. Using LITMUS landslide information system as an experimental platform, we compared six design alternatives with varied combinations of manually developed filters and automatically translated filters for integrating and filtering social media data. Our experiments on Japanese tweets show that automatically translated filtering produces comparable or better results than manually created filters, achieving similar result quality: in false positives, false negatives, and F-1 scores. Compared to manual development, our results suggest automated translation may be a faster and cheaper approach to the integration of non-English languages, without sacrificing data quality. These results are encouraging for reducing the development cost of localized applications that integrate social media, by using automated translation tools.\",\"PeriodicalId\":156843,\"journal\":{\"name\":\"2017 IEEE 3rd International Conference on Collaboration and Internet Computing (CIC)\",\"volume\":\"12 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2017-10-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"2\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2017 IEEE 3rd International Conference on Collaboration and Internet Computing (CIC)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/CIC.2017.00049\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2017 IEEE 3rd International Conference on Collaboration and Internet Computing (CIC)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/CIC.2017.00049","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
A Comparative Study of Increasing Automation in the Integration of Multilingual Social Media Information
Multilingual support in global applications that integrate and filter social media data is a significant challenge due to the cost of manually developing such social media filters for each language. Using LITMUS landslide information system as an experimental platform, we compared six design alternatives with varied combinations of manually developed filters and automatically translated filters for integrating and filtering social media data. Our experiments on Japanese tweets show that automatically translated filtering produces comparable or better results than manually created filters, achieving similar result quality: in false positives, false negatives, and F-1 scores. Compared to manual development, our results suggest automated translation may be a faster and cheaper approach to the integration of non-English languages, without sacrificing data quality. These results are encouraging for reducing the development cost of localized applications that integrate social media, by using automated translation tools.