FastChat issues

multiprocessing train error

1

WARNING:torch.distributed.elastic.multiprocessing.api:Sending process 3765 closing signal SIGTERM WARNING:torch.distributed.elastic.multiprocessing.api:Sending process 3766 closing signal SIGTERM ERROR:torch.distributed.elastic.multiprocessing.api:failed (exitcode: -7) local_rank: 2 (pid: 3767) of binary: /usr/bin/python3 Traceback (most recent call last): File "/usr/local/bin/torchrun", line...

landerson85

Does the Vicuna API currently support multi-round conversations?

3

wujunjiesd

Support Programmatic Usage of FastChat with Standard Streams

3

I've been working on a ProgrammaticChatIO class to enable control of FastChat by other programs through standard streams (stdin/stdout/stderr). This was motivated by a desire to use it in JavaScript...

laidybug

Fine-tuning: Can train_mem.py run on CPU

2

I want to fine-tuning vicuna via train_mem.py. It requires module flash_attn, and flash_attn requires nvcc. Based on that, I assume train_mem.py can only run on GPU. Is my understanding right?...

tonyaw

AssertionError: No inf checks were recorded for this optimizer.

3

While tuning I am getting the following error. AssertionError: No inf checks were recorded for this optimizer. Can anyone help me with this? Here are my training arguments: per_device_train_batch_size=2, warmup_steps=100,...

samarthsarin

bug

start model_worker no response

2

controller: ![image](https://user-images.githubusercontent.com/43819980/233242260-2b9dc550-af4f-4d05-a83e-81eea58d474b.png) model_worker: I waited for an hour with no output and no errors. Can someone tell me what's going on?

raythalis

Should the inference code be improved by a sliding window?

1

Now prompt is cut to `max_src_len`, so prompt + new tokens is less than context_len. Then a lot prompt would be excluded, like system prompt. Could it use a sliding...

qZhang88

question

execute fastchat.serve.cli error

5

execute command: python -m fastchat.serve.cli --model-path ./model_weights/lmsys/vicuna-7b-delta-v1.1 --load-8bit error content: Loading checkpoint shards: 100%|█████████████████████████████████████████████████████████| 2/2 [00:04 Tensor: │ │ │ │ D:\python\Lib\site-packages\torch\cuda\__init__.py:239 in _lazy_init │ │ │ │ 236 │...

ch930410

the shape of params lm_head.wegiht is not compatible between base weight and delta weight

2

when running apply_delta.py of weight version v1.1, I find that the the shape of params lm_head.wegiht is [32000, 5120] in base weight, and is [32001, 5120] in delta weight. However,...

sewellxie

NETWORK ERROR DUE TO HIGH TRAFFIC. PLEASE REGENERATE OR REFRESH THIS PAGE.

6

Kubuntu 22.10, Nvidia driver Version: 530.30.02, CUDA Version: 12.1, RTX 3080 12G When the gradio webpage is left open for a few seconds after the response, the bot does not...

daddyparodz

FastChat
FastChat copied to clipboard

Metadata

multiprocessing train error

Does the Vicuna API currently support multi-round conversations?

Support Programmatic Usage of FastChat with Standard Streams

Fine-tuning: Can train_mem.py run on CPU

AssertionError: No inf checks were recorded for this optimizer.

start model_worker no response

Should the inference code be improved by a sliding window?

execute fastchat.serve.cli error

the shape of params lm_head.wegiht is not compatible between base weight and delta weight

NETWORK ERROR DUE TO HIGH TRAFFIC. PLEASE REGENERATE OR REFRESH THIS PAGE.

← Metadata

Owner

Metadata

FastChat FastChat copied to clipboard

Metadata

← Metadata

Owner

Metadata

FastChat
FastChat copied to clipboard