RWKV-LM
RWKV-LM copied to clipboard
Paper covering additional tokens idea
Hi there. You mention in the readme that you're interested in potentially adding some special tokens/markers to represent stuff like capitalisation. Just wanted to let you know we tried that in the ULMFiT paper, and it worked pretty well. You can read the details here: https://arxiv.org/abs/1801.06146 . We went beyond capitalisation and added some other tokens too.
Anyhoo this is just an FYI in case it's helpful to you.