python-bpe
python-bpe copied to clipboard
How to generate a GPT-2 (openAI) encoder.json ?
How to use this library to generate an encoder.json file such as the ones used for GPT-2 model ?
I'm not knowledgeable of the encoder.json
that GPT-2 uses, or the tokenization it uses, could you link me to relevant documentation?