{"title":"焦损提高了模型在数据不平衡的多标签图像分类中的性能","authors":"Jianxiang Dong","doi":"10.1145/3411016.3411020","DOIUrl":null,"url":null,"abstract":"Dealing with imbalanced data has always been a great challenge in statistical learning. In classification problems, instances for different classes are unequal in an imbalanced dataset. Therefore, traditional machine learning classification algorithms are usually sensitive to this imbalance both in the training and inference processes. In this paper, by addressing the class imbalance on the basis of Focal Loss, we introduce an approach to improve the performance of convolutional neural networks (CNNs) on the multi-label image classification with an extremely imbalanced dataset. As focal loss puts more focus on hard and misclassified examples when comparing with classic cross-entropy loss, our results demonstrate that such loss function indeed achieves a significant improvement of CNN models (ResNet-50, ResNet-101, and SE-ResNeXt-101) regarding the mean F2 scores on the test set.","PeriodicalId":251897,"journal":{"name":"Proceedings of the 2nd International Conference on Industrial Control Network And System Engineering Research","volume":"19 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2020-06-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":"{\"title\":\"Focal Loss Improves the Model Performance on Multi-Label Image Classifications with Imbalanced Data\",\"authors\":\"Jianxiang Dong\",\"doi\":\"10.1145/3411016.3411020\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Dealing with imbalanced data has always been a great challenge in statistical learning. In classification problems, instances for different classes are unequal in an imbalanced dataset. Therefore, traditional machine learning classification algorithms are usually sensitive to this imbalance both in the training and inference processes. In this paper, by addressing the class imbalance on the basis of Focal Loss, we introduce an approach to improve the performance of convolutional neural networks (CNNs) on the multi-label image classification with an extremely imbalanced dataset. As focal loss puts more focus on hard and misclassified examples when comparing with classic cross-entropy loss, our results demonstrate that such loss function indeed achieves a significant improvement of CNN models (ResNet-50, ResNet-101, and SE-ResNeXt-101) regarding the mean F2 scores on the test set.\",\"PeriodicalId\":251897,\"journal\":{\"name\":\"Proceedings of the 2nd International Conference on Industrial Control Network And System Engineering Research\",\"volume\":\"19 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2020-06-19\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"2\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Proceedings of the 2nd International Conference on Industrial Control Network And System Engineering Research\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1145/3411016.3411020\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the 2nd International Conference on Industrial Control Network And System Engineering Research","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/3411016.3411020","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Focal Loss Improves the Model Performance on Multi-Label Image Classifications with Imbalanced Data
Dealing with imbalanced data has always been a great challenge in statistical learning. In classification problems, instances for different classes are unequal in an imbalanced dataset. Therefore, traditional machine learning classification algorithms are usually sensitive to this imbalance both in the training and inference processes. In this paper, by addressing the class imbalance on the basis of Focal Loss, we introduce an approach to improve the performance of convolutional neural networks (CNNs) on the multi-label image classification with an extremely imbalanced dataset. As focal loss puts more focus on hard and misclassified examples when comparing with classic cross-entropy loss, our results demonstrate that such loss function indeed achieves a significant improvement of CNN models (ResNet-50, ResNet-101, and SE-ResNeXt-101) regarding the mean F2 scores on the test set.