depccg
depccg copied to clipboard
Customising tokenization
Hello author,
Greetings. I found there is a config_en.jsonnet, which contains several en.jsonnet files specifying lots of tokens and ccg rules. May I know that,
- If I want to customise the tokenizer, after modifying these files, do I need to retrain the model?
- Does the number of tokens in tokens.en.jsonnet have any relationship with the number of targets in the targets.en.json?
Thanks and Best Regards, Chriss IT. Leong