LongChat issues

About the print message

2

Hi, the printed message I can not understand. I set ``` def __init__( self, dim, ratio, max_position_embeddings=2048, base=10000, device=None ): ``` ratio=2, max_position_embeddings=1024 Since my GPU can not fit minimal...

lucasjinreal

About the learning rate

1

from the script provided, I think longchat is full sft rather than lora, but the equal batch size total is just 1 (batch_size * gradient_accum * num_gpus) But vicuna original...

lucasjinreal

Xformers Monkey Patch Compatibility

1

Does your xformers monkey patch compatible for llama model with default context (without condensed rotary embedding)? And you said in another issue that xformers can be use in non-A100 gpu,...

fahadh4ilyas

Longchat inference configuration

1

What do you guys recommend as for the inference configuration for : temperature and repeat penalty etc ... ?

SeekWrldTea

longchat-13b-16k chat not work

9

reply like this: ![xxxxxxxxxx](https://github.com/DachengLi1/LongChat/assets/207163/15d36ccd-db82-40c9-8636-32731f4742f2)

ahkimkoo

Can inference be run on consumer hardware?

8

AMD? CPU? Single GPU? Is this all possible via FastChat?

GrahamboJangles

OutOfMemoryError: CUDA out of memory.

5

I have 9 V100 16G GPUs,but training CUDA out of memory. The specific errors are as follows: Formatting inputs...Skip in lazy mode /home/chat_glm6b/anaconda3/envs/longeval/lib/python3.10/site-packages/torch/distributed/fsdp/_init_utils.py:295: UserWarning: FSDP is switching to use `NO_SHARD`...

brewswang

torch.distributed.elastic.multiprocessing.errors.ChildFailedError:

when i was run： python -m torch.distributed.run --nproc_per_node=2 longchat/train/fine_tune/train.py --model_name_or_path /mnt/yuchao/open_model/longchat/longchat-13b-16k --data_path /mnt/workspace/sft_data.json --bf16 --output_dir /mnt/yuchao/yuchao/longchat-13b-16k --num_train_epochs 3 --per_device_train_batch_size 1 --per_device_eval_batch_size 4 --gradient_accumulation_steps 1 --evaluation_strategy no --save_strategy steps --save_steps 1000...

ChaoyuHuang