{"title":"大数据,小决策:加强数据处理和人类专业知识之间的循环","authors":"Zack Bennett, Marc G. L'Heureux","doi":"10.1109/CTS.2013.6567205","DOIUrl":null,"url":null,"abstract":"This presentation is a case study examining how LexisNexis uses scaled active learning on the HPCC Systems environment to focus manual topical annotations on critical documents pulled from a large corpus. The active learning system uses natural language processing and machine learning techniques to identify and present “next best” training set candidates to legal editors, combining massive parallel processing with expert human analysis to improve classifier accuracy while minimizing human effort.","PeriodicalId":256633,"journal":{"name":"2013 International Conference on Collaboration Technologies and Systems (CTS)","volume":"39 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2013-05-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Big data, little decisions: Tightening the loop between data crunching and human expertise\",\"authors\":\"Zack Bennett, Marc G. L'Heureux\",\"doi\":\"10.1109/CTS.2013.6567205\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"This presentation is a case study examining how LexisNexis uses scaled active learning on the HPCC Systems environment to focus manual topical annotations on critical documents pulled from a large corpus. The active learning system uses natural language processing and machine learning techniques to identify and present “next best” training set candidates to legal editors, combining massive parallel processing with expert human analysis to improve classifier accuracy while minimizing human effort.\",\"PeriodicalId\":256633,\"journal\":{\"name\":\"2013 International Conference on Collaboration Technologies and Systems (CTS)\",\"volume\":\"39 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2013-05-20\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2013 International Conference on Collaboration Technologies and Systems (CTS)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/CTS.2013.6567205\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2013 International Conference on Collaboration Technologies and Systems (CTS)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/CTS.2013.6567205","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Big data, little decisions: Tightening the loop between data crunching and human expertise
This presentation is a case study examining how LexisNexis uses scaled active learning on the HPCC Systems environment to focus manual topical annotations on critical documents pulled from a large corpus. The active learning system uses natural language processing and machine learning techniques to identify and present “next best” training set candidates to legal editors, combining massive parallel processing with expert human analysis to improve classifier accuracy while minimizing human effort.