rotobart
rotobart copied to clipboard
Pre-training BART in Flax on The Pile dataset
Results
1
rotobart issues
Sort by
recently updated
recently updated
newest added
First, thank you for your public code. But I have a questeion, In "DataCollatorForDenoisingTasks" code, I find batch["decoder_input_ids"] = self.shift_tokens_right(batch["input_ids"]) is original order data, but the batch["labels"] = self.add_whole_word_mask(batch["input_ids"], do_permutate)...