Muhammad Khalifa
Muhammad Khalifa
Thank you for the amazing repo. I am curious why are some titles missing from the tfidf index. It seems that during evaluation we get multiple such warnings: ``` Oranjegekte_0...
Hello, Thank you for this tool. I would like to add the possibility of training using Reinforcement Learning using a reward such as ROUGE or BLEU, for seq2seq tasks. I...
I want my reward function to depend on the prompt used. Mainly, I want to fine-tune an LM for a conditional generation task e.g., summarization. It seems that the reward...
Could you release the batch code of calculator-based sampling with activation caching?