FedScale icon indicating copy to clipboard operation
FedScale copied to clipboard

[Datasets] Missing/Incorrect client data mapping in OpenImage Detection dataset

Open justinpan0 opened this issue 9 months ago • 0 comments

What happened + What you expected to happen

The official data downloaded from The PASCAL Visual Object Classes Challenge 2007 has the raw data format like this:

$ ls VOCdevkit/VOC2007/JPEGImages/ -U | head -n 10
002588.jpg
001540.jpg
005556.jpg
008047.jpg
000188.jpg
002007.jpg
003111.jpg
000421.jpg
005766.jpg
005575.jpg

This looks different from the ones in client_data_mapping downloaded from umich server https://fedscale.eecs.umich.edu/dataset/openimage_detection.tar.gz:

$ cat detection/client_data_mapping/trainval.csv | head -n 10
client_id,data_path,label_name,label_id
0,4fa8054781a4c382.jpg,,-1
1,b37f763ae67d0888.jpg,,-1
2,7e8584b0f487cb9e.jpg,,-1
3,249086e72671397d.jpg,,-1
4,3c2fad96113ca97e.jpg,,-1
5,c9e409f562248252.jpg,,-1
6,4ce684871c147d4c.jpg,,-1
7,8d0cbf68276262bb.jpg,,-1
8,51c5d8d5d9cd87ca.jpg,,-1

Versions / Dependencies

N/A

Reproduction script

See above.

Issue Severity

Low: It annoys or frustrates me.

justinpan0 avatar Oct 01 '23 19:10 justinpan0