Shamane Siri issues

Results 58 issues of


                                            Shamane Siri

Application of dropout when calculating the consistency loss

Hi , I am using BERT based UDA. When I calculated the KL loss between augmented text and unaugmented text, it is similar to when I send the same text...

Where is the entropy minimization loss ?

Did you use this loss ?

would be nice to add an input mask, so we can use arbitrary length input during the training.

The forward function of the TransformerEncoderLayer can have **src_key_padding_mask**. Maybe we can update it too.

can you please help me to understand the retrival part?

What kind of search method are you using? do you use faiss library?

What is the maximum input sequence length that the T5 model can handle?

Can we increase the sequence length?

Nice work! Can you please provide link to download the pretrained models.

how to save the checkpoints here?

Can you illiterate more in the initial state of the LSTM cell

Here, after a given number of episodes(Bath Size) we train the A3C agent with calculating the return. So we need to feed states, return, advantage function as a batch to...

Give an error saying " No module named 'spatial_correlation_sampler'"

DeepMind Retro

# 🌟 New model addition Basically a retrieval augmented model like RAG, but without expensive retriever end2end training ## Open source status * [ ] the model implementation is available:...

New model