TextRL icon indicating copy to clipboard operation
TextRL copied to clipboard

token classification test

Open hemangjoshi37a opened this issue 1 year ago • 3 comments

Tried to create notebook in examples folder for token classification problem. Please help me develop this.

hemangjoshi37a avatar Mar 29 '23 13:03 hemangjoshi37a

Can you explain how reinforcement learning can be used for token classification? I would appreciate more information on this topic.

voidful avatar Mar 30 '23 18:03 voidful

I tried doing it but failed. But you can find here the failed attempt and have some leads. Actually I want to have RLHF for as many tasks as possible from only token classification to image classification, image segmentation, etc etc

hemangjoshi37a avatar Mar 31 '23 05:03 hemangjoshi37a

If you want to collaborate we can work together on this and make a adaptor type model that adds RLHF extension on the output side of any model and instead of human labbeled data it does RLHF for supervised learning

hemangjoshi37a avatar Mar 31 '23 05:03 hemangjoshi37a