german-gpt2
german-gpt2 copied to clipboard
Evaluation metric for generated text
Hello, thank you for your great project! I tried to fine-tune the model with my german text and noticed, that generated sentences are mostly copied from my training dataset. Could you please advice which metric to use to evaluate the generated sentences? Is there a possibility to check, how much of the generated sentence was "copied" from dataset? Thanks in advance!