Bilinear-CNN-TensorFlow icon indicating copy to clipboard operation
Bilinear-CNN-TensorFlow copied to clipboard

Can you expleain please, why training image size is 488 * 488 ?

Open mahfuj9346449 opened this issue 7 years ago • 5 comments

build_hdf5_image_dataset(new_train, image_shape=(488, 488), mode='file', output_path='new_train_488.h5', categorical_labels=True, normalize=False)

In your code :

sess = tf.Session()     ## Start session to create training graph

imgs = tf.placeholder(tf.float32, [None, 448, 448, 3])
target = tf.placeholder("float", [None, 100])

Can you explain please why 488,488 and 448,448

mahfuj9346449 avatar Aug 11 '17 17:08 mahfuj9346449

As we are looking at features which are useful for fine grained visual recognition, scaling of images is likely to improve results for the classification.

abhaydoke09 avatar Aug 14 '17 14:08 abhaydoke09

The resolution of image in train.h5 is 488, while the input of the placeholder is in 448. There is an error in create_h5 file.

alg-leon avatar Mar 24 '18 03:03 alg-leon

@thkinglee It's not an error. I am keeping the resolution to 488 in create_h5, because at the training time I am cropping an image of 448x448 randomly from the original image of 488x488. This is called data augmentation. It is used for better generalisation and expanding the dataset.

abhaydoke09 avatar Apr 02 '18 19:04 abhaydoke09

hello, I still have a problem. After running the second part of the whole model, I will finish training. It seems that the final model is not saved in the code. Why is this done in the absence of the training model? Can you give me some details?

JUSTDODoDo avatar Oct 28 '18 10:10 JUSTDODoDo

In the first step,I set the breaking epoch at x then I got the last_layers_epoch_x.npz .That .npz was loaded to do the second part for finetuning the whole model.So the training model was not absent.I don't know whether this can solve your problem.

xiaomiHuang avatar Apr 14 '19 01:04 xiaomiHuang