beyond-image-to-depth icon indicating copy to clipboard operation
beyond-image-to-depth copied to clipboard

run.sh

Open catherine-qian opened this issue 4 years ago • 3 comments

For the run.sh script: CUDA_VISIBLE_DEVICES=2 python train.py
--validation_on
--dataset mp3d
--img_path /data1/kranti/audio-visual-depth/dataset/visual_echoes/images/mp3d_split_wise
--metadatapath /data1/kranti/audio-visual-depth/dataset/visual_echoes/metadata/mp3d
--audio_path /data1/kranti/audio-visual-depth/dataset/visual_echoes/echoes/mp3d/echoes_navigable
--checkpoints_dir /data1/kranti/audio-visual-depth/checkpoints
--init_material_weight ./checkpoints/material_pre_trained_minc.pth

Could you please let me know how to get the images/ and the metadata/?

catherine-qian avatar Mar 16 '22 05:03 catherine-qian

The metadata is available inside the dataset folder of this repo. For the images, we cannot share the extracted frames directly due to the restriction from Matterport. However, you can use the steps mentioned in the SoundSpaces repo to generate those frames.

krantiparida avatar Mar 16 '22 18:03 krantiparida

thanks for the reply.

Could you please kindly share the scripts to generate those image frames? I have checked soundspaces but cannot successfully generate them.

Best

catherine-qian avatar Mar 17 '22 02:03 catherine-qian

It's been a while but could I ask for the same script? I checked the SoundSpaces repo but it only shows how to create one pkl file for each scene but not how to create your train.pkl file which the code seems to search?

SaschaHornauer avatar Jan 19 '23 18:01 SaschaHornauer