Johann Diep comments

Results 22 comments of


                                            Johann Diep

Noisy output & "text_use_bert_cls" error

That looks amazing!

Noisy output & "text_use_bert_cls" error

Thats very interesting, I have never trained it for so long, max only around 6 hours! Will give it a go!

Noisy output & "text_use_bert_cls" error

Btw it does look like per video, you have more than 5 frames. Did you increase the number of frames accepted by the model as well?

Noisy output & "text_use_bert_cls" error

Alright, let me increase the frame number as well and give it a go. Report you the results in a couple of days!

Noisy output & "text_use_bert_cls" error

@DaddyWesker How did you plot those little GIFs of the results actually?

Noisy output & "text_use_bert_cls" error

@DaddyWesker And have you tried testing it on a more sophisticated dataset, i.e. Kinetic-600 with their text annotation? Would be very interesting to see how the results are conditioned on...

Noisy output & "text_use_bert_cls" error

@DaddyWesker Have to admit, your results looks far better than mine: ![Screenshot 2022-08-09 050924](https://user-images.githubusercontent.com/105214231/183555335-db765257-c029-47ad-a66e-e29c3f3a9768.png) This took me 3 days to train, and I only got 1000 epochs. How were you...

Johann Diep

Noisy output & "text_use_bert_cls" error

Noisy output & "text_use_bert_cls" error

Noisy output & "text_use_bert_cls" error

Noisy output & "text_use_bert_cls" error

Noisy output & "text_use_bert_cls" error

Noisy output & "text_use_bert_cls" error

Noisy output & "text_use_bert_cls" error

Duplicate dividing in relative positional encoding

Conditioning on image + text embedding

the commitment: one more residual