zero123 icon indicating copy to clipboard operation
zero123 copied to clipboard

Question about `total_view` used for dataset

Open Xiashangning opened this issue 2 years ago • 3 comments

Hi, first thanks for your great project! I am trying to reproduce the results with diffusers so I have examined the code quite thoroughly. But it seems that the hyper-parameter total_view in the config file provided is inconsistent with the paper. If I understand the logic of getitem in ObjaverseData correctly, it will select randomly two images of the same object in the range [0, total_view[ but the value in the config file is just 4 instead of 12... Do you only use the first 4 camera poses for training or is it a mistake when uploading the config file?

Xiashangning avatar Sep 08 '23 19:09 Xiashangning

Hello, I am also trying to reproduce the results of the paper. I was wondering if you have downloaded the complete data file views_release.tar.gz and how much storage space it takes up approximately after decompression?Thanks

Luh1124 avatar Sep 11 '23 10:09 Luh1124

Around 1.7T

Xiashangning avatar Sep 11 '23 13:09 Xiashangning

@Xiashangning Hello, I was wondering if you have figured the problem out. But I think it may be alright to just change the value in the config file from 4 to 12 since there are 12 images for each object. And I believe this may have something to do with https://github.com/cvlab-columbia/zero123/issues/60#issue-1800595706

Jason-aplp avatar Dec 26 '23 07:12 Jason-aplp