RLHF
RLHF copied to clipboard
How to fix the following errors?
The following error occurred while running cell 10 in 6. Tune language model using PPO with our preference model.
After adding __init__.py
to /content/trlx/examples/summarize_rlhf/reward_model/
, I still get the same error.
How can I fix it?
10 import torch
11 from datasets import load_dataset
---> 12 from reward_model.reward_model import GPTRewardModel
13 from tqdm import tqdm
14 from transformers import AutoTokenizer
ModuleNotFoundError: No module named 'reward_model.reward_model'; 'reward_model' is not a package