recurrent-visual-attention issues

Results 18 recurrent-visual-attention issues

Sort by recently updated

Is this model can support multi-labels per an image ?

HI, @clvcooke @kevinzakka @malashinroman Is this model can support multi-labels per an image ? I need classification model for images which has multiple classes. I wonder ... Thanks in advance....

bemoregt

The comment mistake

The comment in module.py (line 21) says "x" is a 4D tensor of shape (B, H, W, C) but it's actually a 4d tensor of shape (B, C, H, W)...

cyccchs

how negative numbers affect gradient descent.

The loss may be negative number in the model. The reason is that the reinforce loss is often to be a negative number since the reward is the larger the...

yxiao54

pycharm step debug freeze, bug fixed

python3 main.py --use_gpu False --is_train True ~~~~~~~~~~~~~~~~~~main.py~~~~~~~~~~~~~~~ #kwargs = {} if config.use_gpu: torch.cuda.manual_seed(config.random_seed) kwargs = {"num_workers": 1, "pin_memory": True} else: kwargs = {} # instantiate data loaders '''if config.is_train: dloader...

fatalfeel

I tried to increasse the hidden units size from 128 to 256 but I am getting following error.

Epoch: 1/1500 - LR: 0.000300 0%| | 0/11044 [00:04

DeccanKing1

Performance is not good when using my dataset.

Hi, @kevinzakka I entered my own data with MNIST Format(256x256, Gray Images, 5000 Images/class) But Performance is not good. What's wrong with me? ------------------------------------------------ Epoch: 196/500 - LR: 0.000300 0.8s...

bemoregt

How to use our own dataset on this library?

I am trying ot understand the code for my own data, how to provide this data to the repo in pytorch data format?

DeccanKing1

Start with random initial location

Does anyone know how can I start with random initial coordinates for the first square patch?

GKalliatakis

Detaching l_t

At the moment the location tensor _l_t_ is never detached from the computational graph in spite of both being produced by and 'consumed' by trainable modules. As far as I...

Pozimek

accuracy calculation bit confusing

How do you calculate the final accuracy? If you have 8 steps with one glimpse, do you only consider last step as final prediction for accuracy calculations or do you...

DeccanKing1

recurrent-visual-attention
recurrent-visual-attention copied to clipboard

Metadata

Is this model can support multi-labels per an image ?

The comment mistake

how negative numbers affect gradient descent.

pycharm step debug freeze, bug fixed

I tried to increasse the hidden units size from 128 to 256 but I am getting following error.

Performance is not good when using my dataset.

How to use our own dataset on this library?

Start with random initial location

Detaching l_t

accuracy calculation bit confusing

← Metadata

Owner

Metadata

recurrent-visual-attention recurrent-visual-attention copied to clipboard

Metadata

← Metadata

Owner

Metadata

recurrent-visual-attention
recurrent-visual-attention copied to clipboard