SUNIL KUMAR SAHU
Results
4
issues of
SUNIL KUMAR SAHU
Hi I was trying to understand the code. I found that you are feeding an abstract separately for each candidate pair of the abstract in the model. However, in the...
**Describe the bug** We are in the process of fine-tuning Mixtral-8x22b using an instruction fine-tuning dataset. The model is divided using PP=8 and TP=4. Our experiments are conducted on DGX...
bug
stale