Xiu-Shen Wei4* Quan Cui1,2 Lei Yang1 Peng Wang3 Lingqiao Liu3 Jian Yang4
1Megvii Research Nanjing, Megvii Technology Ltd., Nanjing, China
2Graduate School of IPS, Waseda University, Fukuoka, Japan
3School of Computer Science, The University of Adelaide, Adelaide, Australia
4School of Computer Science and Engineering, Nanjing University of Science and Technology, Nanjing, China
Over recent years, emerging interest has occurred in integrating computer vision technology into the retail industry. Automatic checkout (ACO) is one of the critical problems in this area which aims to automatically generate the shopping list from the images of the products to purchase. The main challenge of this problem comes from the large scale and the fine-grained nature of the product categories as well as the difficulty for collecting training images that reflect the realistic checkout scenarios due to continuous update of the products. Despite its significant practical and research value, this problem is not extensively studied in the computer vision community, largely due to the lack of a high-quality dataset. To fill this gap, in this work we propose a new dataset to facilitate relevant research. Our dataset enjoys the following characteristics: (1) It is by far the largest dataset in terms of both product image quantity and product categories. (2) It includes single-product images taken in a controlled environment and multi-product images taken by the checkout system. (3) It provides different levels of annotations for the checkout images. Comparing with the existing datasets, ours is closer to the realistic setting and can derive a variety of research problems. Besides the dataset, we also benchmark the performance on this dataset with various approaches.
*Notice: If downloading from Kaggle is not accessable, you can alternatively download the dataset using Baidu Drive.
Split | # images | # objects | # objects/image | # categories/image |
---|---|---|---|---|
Training set (Exemplar images) | 53,739 | 53,739 | 1 | 1 |
Validation set (Checkout images) | 6,000 | 73,602 | 12.27 | 6.33 |
Test set (Checkout images) | 24,000 | 294,333 | 12.26 | 6.31 |
Clutter mode | # images | # objects | # objects/image | # categories/image |
---|---|---|---|---|
Easy | 10,000 | 71,496 | 7.15 | 3.81 |
Medium | 10,000 | 122,961 | 12.30 | 6.27 |
Hard | 10,000 | 173,478 | 17.35 | 8.87 |
If you have been successful in creating a model based on the training set and it performs well on the validation set, we encourage you to run your model on the test set. The rpctool
(in the next section in this project page) will contribute to return the corresponding results of the evaluation metrics. You can submit your results on the RPC leaderboard by creating a new issue. Your results will be ranked in the leaderboard and to benchmark your approach against that of other machine learners. We are looking forward to your submission. Please click here to submit.
rpctool
: A Python package for evaluating your methods on the RPC dataset. It can return several evaluation metrics (listed in the aforementioned table in Sec. 4.2). More information can be found in rpctool
.
This dataset and code packages are free for academic usage. You can run them at your own risk. For other purposes, please contact the corresponding author Dr. Xiu-Shen Wei (weixs.gm [at] gmail.com).