CrevNet-Traffic4cast
CrevNet-Traffic4cast copied to clipboard
Want bair code
This is related to the former question "Want moving_mnist code"(#1) by ManWingloeng.
Could you please offer us bair.py as well? We would like to test using RobotPush dataset.
This code is from data_utils.py,
from data.moving_mnist import MovingMNIST from data.bair import RobotPush from data.kth128 import KTH
You can find the dataloader and data preprocessing code for BAIR in the following link https://github.com/edenton/svg/tree/master/data Note that the architectures used for BAIR is different from the one used for Moving MNIST.
Thank you very much for sharing the bair code. I just want to confirm, when you say the architecture is different, you mean the dataset (and not the algorithm)?
Thank you very much for sharing the bair code. I just want to confirm, when you say the architecture is different, you mean the dataset (and not the algorithm)?
No. For BAIR, the task is stochastic video generation instead of video prediction. So the model architecture is a combination of SVG and CrevNet which was not reported in the paper. The pipeline looks like below.
Thank you for the response. We are currently running into a problem when using bair dataset. We get an error when trying to train using --channels=3. This is the steps we took.
① Using download_bair.sh from the link you sent us, we got the tar file and decompressed it. ② Using convert_bair.py, we extracted the images from tfrecords. ③After changing some arguments on model_mnist.py (--dataset and --data_root), we ran model_mnist.py ④ We got images like the ones below (in black and white) ⑤ If we change the number of channels to 3 (--channels=3), we get this error.
Traceback (most recent call last):
File "model_mnist.py", line 264, in
How can we solve this? Thank you very much for helping us.

@rastaman7 , I get same problem with you, when i set channel to 3. Do you resolve this problem?Thanks.
@rrxi , No, we haven't been able to figure this out yet...
@rastaman7 , @rrxi You can set the number of hidden layers - rnn_size as 96, to match the output feature size from the encoder.