TextRL issues

Results 4 TextRL issues

Sort by recently updated

Errors may occur after changing the batchsize and update interval of the agent

I follow the example: https://voidful.dev/jupyter/2021/07/25/textrl-elon-musk.html I wonder why batchsize is larger than update_ Interval, so I modify as follows: **before:** `agent = actor.agent_ppo(update_interval=10, minibatch_size=2000, epochs=20)` **after:** `agent = actor.agent_ppo(update_interval=100, minibatch_size=10,...

rongaoli

unfreeze_layer_from_past parameter

Nice repo!!! it seems that the default parameter for the policy will freeze all the layers of the language model we are using and just update the lm_head I tried...

JhonDan1999

Problems in the inference process

Nice repo!! I completed the training using code examples and now make predictions on the test set. But I found that using ```actor. predict``` to obtain the output of the...

ignorejjj

token classification test

Tried to create notebook in examples folder for token classification problem. Please help me develop this.

hemangjoshi37a

TextRL
TextRL copied to clipboard

Metadata

Errors may occur after changing the batchsize and update interval of the agent

unfreeze_layer_from_past parameter

Problems in the inference process

token classification test

← Metadata

Owner

Metadata

TextRL TextRL copied to clipboard

Metadata

Errors may occur after changing the batchsize and update interval of the agent

unfreeze_layer_from_past parameter

Problems in the inference process

token classification test

← Metadata

Owner

Metadata

TextRL
TextRL copied to clipboard