Converting HF model to Nemo gets an error
Describe the bug I am trying to convert the OPT-350m hugging face model to Nemo format. How can I map the following keys?
- model.decoder.project_out.weight
- model.decoder.project_in.weight
for other weight names, I can successfully map to Nemo format.
A clear and concise description of what the bug is.
Steps/Code to reproduce bug
Please list minimal steps or code snippet for us to be able to reproduce the bug.
A helpful guide on on how to craft a minimal bug report http://matthewrocklin.com/blog/work/2018/02/28/minimal-bug-reports.
Expected behavior
A clear and concise description of what you expected to happen.
Environment overview (please complete the following information)
- Environment location: [Bare-metal, Docker, Cloud(specify cloud provider - AWS, Azure, GCP, Collab)] Docker
- Method of NeMo install: [pip install or from source]. Please specify exact commands you used to install. nemo:24.03.01.framework
- If method of install is [Docker], provide
docker pull&docker runcommands used
Environment details
If NVIDIA docker image is used you don't need to specify these. Otherwise, please provide:
- OS version : Ubuntu 20.04
- PyTorch version : nemo:24.03.01.framework
- Python version : nemo:24.03.01.framework
Additional context
Add any other context about the problem here. Example: GPU model : H100