llx comments

Results 5 comments of

llx

出现问题 Setting `pad_token_id` to `eos_token_id`:106068 for open-end generation.

我是用多卡推理时也出现了这个问题

`NotImplementedError: Cannot copy out of meta tensor; no data!` with `functorch.make_functional`

Maybe you could try using huggingface/accelerate to load the actual weight to your module [https://github.com/huggingface/accelerate](url) ``` from accelerate.utils import named_module_tensors, set_module_tensor_to_device ... for name, _ in named_module_tensors(module): set_module_tensor_to_device(module, name, your_execution_device)...

fatal error: cuda_fp8.h: No such file or directory

Hi, I encountered same problem when building [flashinfer](https://github.com/flashinfer-ai/flashinfer), have you guys figured out how to fix that?

Activation Channel Scales and Calibration

You can find it in Huggingface: [Link](https://huggingface.co/datasets/mit-han-lab/pile-val-backup)，but after i downloaded the dataset and use it, it output that dataset is corrupt and unusable

A demo without gradio

Hi, you can try extracting gradio's inference operations manually, as in the following code ``` if args.model_type == 'vicuna': chat_state = default_conversation.copy() else: chat_state = conv_llava_llama_2.copy() video_path = "your_path" chat_state.system...