{"title":"基于混合样本选择策略的主动学习框架","authors":"Longfei Pan, Xiaojun Wang","doi":"10.1145/3318299.3318362","DOIUrl":null,"url":null,"abstract":"Machine learning has achieved outstanding performance in many fields, but its success heavily relies on the large number of annotated training samples. However, for many professional fields, data annotation is not only tedious and time consuming, but also demanding specialty-oriented knowledge and skills, which are not easily accessible. To significantly reduce the cost of annotation, we propose a novel active learning framework called ALBS. ALBS uses the syncretic strategy which incorporates \"most discriminative\" and \"most representative\" to seek \"worthy\" samples from unlabeled dataset and update the model incrementally to enhance the performance continuously. We have evaluated our method on two different audio datasets, demonstrating that the syncretic strategy can makes the promotion of model model's performance more robust and faster than the other strategies, and subsampling the historical labeled dataset can reduce unnecessary computing costs and storage space.","PeriodicalId":164987,"journal":{"name":"International Conference on Machine Learning and Computing","volume":"29 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2019-02-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"ALBS: An Active Learning Framework Based on Syncretic Sample Selection Strategy\",\"authors\":\"Longfei Pan, Xiaojun Wang\",\"doi\":\"10.1145/3318299.3318362\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Machine learning has achieved outstanding performance in many fields, but its success heavily relies on the large number of annotated training samples. However, for many professional fields, data annotation is not only tedious and time consuming, but also demanding specialty-oriented knowledge and skills, which are not easily accessible. To significantly reduce the cost of annotation, we propose a novel active learning framework called ALBS. ALBS uses the syncretic strategy which incorporates \\\"most discriminative\\\" and \\\"most representative\\\" to seek \\\"worthy\\\" samples from unlabeled dataset and update the model incrementally to enhance the performance continuously. We have evaluated our method on two different audio datasets, demonstrating that the syncretic strategy can makes the promotion of model model's performance more robust and faster than the other strategies, and subsampling the historical labeled dataset can reduce unnecessary computing costs and storage space.\",\"PeriodicalId\":164987,\"journal\":{\"name\":\"International Conference on Machine Learning and Computing\",\"volume\":\"29 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2019-02-22\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"International Conference on Machine Learning and Computing\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1145/3318299.3318362\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"International Conference on Machine Learning and Computing","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/3318299.3318362","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
ALBS: An Active Learning Framework Based on Syncretic Sample Selection Strategy
Machine learning has achieved outstanding performance in many fields, but its success heavily relies on the large number of annotated training samples. However, for many professional fields, data annotation is not only tedious and time consuming, but also demanding specialty-oriented knowledge and skills, which are not easily accessible. To significantly reduce the cost of annotation, we propose a novel active learning framework called ALBS. ALBS uses the syncretic strategy which incorporates "most discriminative" and "most representative" to seek "worthy" samples from unlabeled dataset and update the model incrementally to enhance the performance continuously. We have evaluated our method on two different audio datasets, demonstrating that the syncretic strategy can makes the promotion of model model's performance more robust and faster than the other strategies, and subsampling the historical labeled dataset can reduce unnecessary computing costs and storage space.