TensorRT-LLM icon indicating copy to clipboard operation
TensorRT-LLM copied to clipboard

Medusa convert cpt doesn't work with safetensors version

Open nivibilla opened this issue 11 months ago • 3 comments

System Info

N/A

Who can help?

No response

Information

  • [x] The official example scripts
  • [ ] My own modified scripts

Tasks

  • [x] An officially supported task in the examples folder (such as GLUE/SQuAD, ...)
  • [ ] My own task or dataset (give details below)

Reproduction

try converting a medusa checkpoint using text-generation-inference/Mistral-7B-Instruct-v0.2-medusa

Expected behavior

Should load from safe tensors

actual behavior

script expects a "medusa_lm_head.pt" in folder

additional notes

n/a

nivibilla avatar Feb 27 '24 16:02 nivibilla

Thanks for reporting this. @nivibilla You are correct. The script needs to be updated to be able to load safe tensors as well.

rakib-hasan avatar Feb 29 '24 02:02 rakib-hasan

Opened a PR #1535 for this

Tushar-ml avatar May 02 '24 08:05 Tushar-ml

@ncomly-nvidia kindly review this PR

Tushar-ml avatar May 04 '24 08:05 Tushar-ml

This has been merged, we can close this issue

Tushar-ml avatar Jun 03 '24 16:06 Tushar-ml