mergekit icon indicating copy to clipboard operation
mergekit copied to clipboard

RuntimeError: Unsupported architecture OPTForCausalLM

Open varunlmxd opened this issue 10 months ago • 3 comments

I am trying to merge OPT architecture with Mistral 7B model and got this error is there any way to merge OPT models with Mistral or Llama architecture Error: Traceback (most recent call last): File "/usr/local/bin/mergekit-yaml", line 8, in sys.exit(main()) File "/usr/local/lib/python3.10/dist-packages/click/core.py", line 1157, in call return self.main(*args, **kwargs) File "/usr/local/lib/python3.10/dist-packages/click/core.py", line 1078, in main rv = self.invoke(ctx) File "/usr/local/lib/python3.10/dist-packages/click/core.py", line 1434, in invoke return ctx.invoke(self.callback, **ctx.params) File "/usr/local/lib/python3.10/dist-packages/click/core.py", line 783, in invoke return __callback(*args, **kwargs) File "/content/mergekit/mergekit/options.py", line 78, in wrapper f(*args, **kwargs) File "/content/mergekit/mergekit/scripts/run_yaml.py", line 47, in main run_merge( File "/content/mergekit/mergekit/merge.py", line 45, in run_merge model_arch_info = [ File "/content/mergekit/mergekit/merge.py", line 46, in get_architecture_info(m.config(trust_remote_code=options.trust_remote_code)) File "/content/mergekit/mergekit/architecture.py", line 362, in get_architecture_info raise RuntimeError(f"Unsupported architecture {arch_name}") RuntimeError: Unsupported architecture OPTForCausalLM

varunlmxd avatar Apr 01 '24 13:04 varunlmxd