SUNIL KUMAR SAHU

Results 4 issues of SUNIL KUMAR SAHU

Hi I was trying to understand the code. I found that you are feeding an abstract separately for each candidate pair of the abstract in the model. However, in the...

**Describe the bug** We are in the process of fine-tuning Mixtral-8x22b using an instruction fine-tuning dataset. The model is divided using PP=8 and TP=4. Our experiments are conducted on DGX...

bug
stale