Muhammad Khalifa

Results 4 issues of Muhammad Khalifa

Thank you for the amazing repo. I am curious why are some titles missing from the tfidf index. It seems that during evaluation we get multiple such warnings: ``` Oranjegekte_0...

Hello, Thank you for this tool. I would like to add the possibility of training using Reinforcement Learning using a reward such as ROUGE or BLEU, for seq2seq tasks. I...

I want my reward function to depend on the prompt used. Mainly, I want to fine-tune an LM for a conditional generation task e.g., summarization. It seems that the reward...

Could you release the batch code of calculator-based sampling with activation caching?