RLHF How to fix the following errors?

How to fix the following errors?

Open missflash opened this issue 1 year ago • 1 comments

The following error occurred while running cell 10 in 6. Tune language model using PPO with our preference model. After adding __init__.py to /content/trlx/examples/summarize_rlhf/reward_model/, I still get the same error. How can I fix it?

     10 import torch
     11 from datasets import load_dataset
---> 12 from reward_model.reward_model import GPTRewardModel
     13 from tqdm import tqdm
     14 from transformers import AutoTokenizer

ModuleNotFoundError: No module named 'reward_model.reward_model'; 'reward_model' is not a package

Sep 11 '23 00:09 missflash

RLHF RLHF copied to clipboard

How to fix the following errors?

RLHF
RLHF copied to clipboard