{"title":"跨模态人再识别的两阶段度量学习","authors":"Jiabao Wang, Shanshan Jiao, Yang Li, Zhuang Miao","doi":"10.1145/3381271.3381285","DOIUrl":null,"url":null,"abstract":"Cross-modality person re-identification faces big challenges as the different characteristics of images collected by visible and thermal cameras. The existing deep learning methods always use metric learning to learn the discriminative features. However, the existing metric learning is executed based on batch examples, the solution is local optimal. In order to learn a global solution, we propose a two-stage metric learning (TML) method, which uses local and global metric learning successively. In the first stage, a local metric learning is used based on mini-batch images via triplet loss. A new mixed-modality triplet loss is proposed to train more valid triplet examples. It supervises to learn more efficient features for the next stage. In the second stage, a global metric learning is adopted based on the features of all training images. Experiments are conducted on the public SYSU-MM01 dataset. The TML achieved 39.75% in Rank-1 and 42.73% in mAP, which surpass the state-of-the-art performance.","PeriodicalId":124651,"journal":{"name":"Proceedings of the 5th International Conference on Multimedia and Image Processing","volume":"110 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2020-01-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"5","resultStr":"{\"title\":\"Two-stage metric learning for cross-modality person re-identification\",\"authors\":\"Jiabao Wang, Shanshan Jiao, Yang Li, Zhuang Miao\",\"doi\":\"10.1145/3381271.3381285\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Cross-modality person re-identification faces big challenges as the different characteristics of images collected by visible and thermal cameras. The existing deep learning methods always use metric learning to learn the discriminative features. However, the existing metric learning is executed based on batch examples, the solution is local optimal. In order to learn a global solution, we propose a two-stage metric learning (TML) method, which uses local and global metric learning successively. In the first stage, a local metric learning is used based on mini-batch images via triplet loss. A new mixed-modality triplet loss is proposed to train more valid triplet examples. It supervises to learn more efficient features for the next stage. In the second stage, a global metric learning is adopted based on the features of all training images. Experiments are conducted on the public SYSU-MM01 dataset. The TML achieved 39.75% in Rank-1 and 42.73% in mAP, which surpass the state-of-the-art performance.\",\"PeriodicalId\":124651,\"journal\":{\"name\":\"Proceedings of the 5th International Conference on Multimedia and Image Processing\",\"volume\":\"110 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2020-01-10\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"5\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Proceedings of the 5th International Conference on Multimedia and Image Processing\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1145/3381271.3381285\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the 5th International Conference on Multimedia and Image Processing","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/3381271.3381285","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Two-stage metric learning for cross-modality person re-identification
Cross-modality person re-identification faces big challenges as the different characteristics of images collected by visible and thermal cameras. The existing deep learning methods always use metric learning to learn the discriminative features. However, the existing metric learning is executed based on batch examples, the solution is local optimal. In order to learn a global solution, we propose a two-stage metric learning (TML) method, which uses local and global metric learning successively. In the first stage, a local metric learning is used based on mini-batch images via triplet loss. A new mixed-modality triplet loss is proposed to train more valid triplet examples. It supervises to learn more efficient features for the next stage. In the second stage, a global metric learning is adopted based on the features of all training images. Experiments are conducted on the public SYSU-MM01 dataset. The TML achieved 39.75% in Rank-1 and 42.73% in mAP, which surpass the state-of-the-art performance.