blofn comments

Results 4 comments of


                                            blofn

Try using the original adversarial losses for the 2D LDM tutorial

I meet the same problem，3dvqgan’s results are good，but LDM can't generate right latent vector.

使用vllm推理InternVL3-8B-hf时返回ValueError: `limit_mm_per_prompt` is only supported for multimodal models.

@Kuangdd01 ![Image](https://github.com/user-attachments/assets/de4b58f6-999d-44b9-958d-79ecd41f9caa) 能详细说明一下怎么替换这5个json文件吗，我训练加了额外的tokens，当我把全量微调后的checkpoint里的json替换原始chat里的json，然后使用官方vllm推理时会效果变差，用llamafactory的huggingface框架 API推理效果是正常的。

使用vllm推理InternVL3-8B-hf时返回ValueError: `limit_mm_per_prompt` is only supported for multimodal models.

@Kuangdd01 我试了一下用hugging face里提供的模版推理保存的checkpoint： from transformers import AutoProcessor, AutoModelForImageTextToText import torch torch_device = "cuda" model_checkpoint = "OpenGVLab/InternVL3-1B-hf" processor = AutoProcessor.from_pretrained(model_checkpoint) model = AutoModelForImageTextToText.from_pretrained(model_checkpoint, device_map=torch_device, torch_dtype=torch.bfloat16) messages = [ { "role": "user",...

OOM and slow tokenization after upgrade LLaMA-Factory

same problem @hiyouga