Daniel comments

Results 101 comments of


                                            Daniel

The VRAM usage is more than 48GB.

> In the paper, it was mentioned that 48G of graphics memory can train 65B of LLaMA > > ``` > We present QLORA, an efficient finetuning approach that reduces...

Multi GPU inference example

Is there an example for training on multiple-GPUs?

Errors happen during loading llama 65B for tuning.

load another model: ValueError: Target modules ['dense_h_to_4h', 'dense', 'dense_4h_to_h', 'query_key_value'] not found in the base model. Please check the target modules and try again. python qlora.py –learning_rate 0.0001 --model_name_or_path huggyllama/llama-7b

Errors happen during loading llama 65B for tuning.

what's wrong with my setting?

Could it load and tune falcon-40B ?

Yes. I tested it with 8* A100, it occupates 12G on each GPU and still not fast, especially alter the input length more than 500.

Can GGML apply in CNN or RCNN ?

thanks:)

Releasing model?

> @internlm-team You told us number of parameters at least :) We present InternLM, a multilingual foundational language model with 104B parameters. InternLM is pre-trained on a large corpora with...

Daniel

热词设置能加上就更好了

about data

The VRAM usage is more than 48GB.

Multi GPU inference example

Multi-GPU Training

Errors happen during loading llama 65B for tuning.

Errors happen during loading llama 65B for tuning.

Could it load and tune falcon-40B ?

Can GGML apply in CNN or RCNN ?

Releasing model?