State of the art on the Market-1501+500k dataset
In this page, will summarize the state-of-the-art methods on Market-1501+500k dataset. We report both mAP and rank-1 accuracy under different gallery sizes. Note that this may not be the only performance measurement. Other metrics, such as retrieval time, are also important. Should you have any inquiries please let me know at liangzheng06@gmail.com.
Reference | Gallery size | Notes | ||||||
metric | 19,732 | 119,732 | 219,732 | 519,732 | ||||
"Scalable person re-identification: a benchmark", Liang Zheng, Liyue Shen, Lu Tian, Shengjin Wang, Jingdong Wang, Qi Tian, ICCV 2015 | mAP | 13.94 | 11.44 | 10.52 | 8.66 | BoW, Euclidean distance, single query | ||
mAP | 13.85 | 10.88 | 9.75 | 7.56 | BoW+ANN [1], single query | |||
mAP | 18.38 | 15.95 | 14.88 | 12.60 | BoW, Euclidean distance, multiple query | |||
mAP | 18.26 | 15.09 | 13.75 | 10.92 | BoW+ANN [1], multiple query | |||
"Person re-identification: Past, Present and Future", Liang Zheng, Yi Yang, Alexander Hauptmann, Arxiv 2016 | rank-1 | 73.69 | 72.15 | 71.55 | 70.67 | ResNet50 baseline. The 2,048-dim feature from pool5 is used under Euclidean distance. Code can be accessed here. | ||
mAP | 51.48 | 48.72 | 47.57 | 46.05 | ||||
Current state of the art | ||||||||
"A Discriminatively Learned CNN Embedding for Person Re-identification", Zhedong Zheng, Liang Zheng and Yi Yang, Arxiv 2017. | rank-1 | 79.51 | 73.78 | 71.50 | 68.26 | A two-stream network based on ResNet50, single query. Code is available upon request. | ||
mAP | 59.87 | 52.28 | 49.11 | 45.24 | ||||
"Improving Person Re-identification by Attribute and Identity Learning", Yutian Lin, Liang Zheng, Zhedong Zheng, Yu Wu and Yi Yang, Arxiv 2017. | rank-1 | 83.99 | 79.89 | 78.20 | 75.44 | Attribute and ID classification is jointly learned. ResNet50 is used as backbone. Pool5 feature is used under Euclidean distance, single query. Attribute labels can be accessed here. | ||
mAP | 62.83 | 56.46 | 53.58 | 49.78 | ||||
"In Defense of the Triplet Loss for Person Re-Identification", Alexander Hermans, Lucas Beyer and Bastian Leibe, Arxiv 2017. | rank-1 | 84.92 | 79.69 | 77.88 | 74.70 | single query. The triplet-loss based network is fine-tuned. Image size: 256x128. The last layer in ResNet is replaced with one 1,024-dim layer and one 128-dim layer. Batch normalization. | ||
mAP | 69.14 | 61.93 | 58.74 | 53.63 |
References
[1] J. Wang and S. Li. Query-driven iterated neighborhood graph search for large scale indexing. In ACM MM, 2012.