disperaller comments

Results 16 comments of


                                            disperaller

GPU

> How to manually deploy the GPU？ i think you need to modify the code to move the everything to GPU

Reserved special tokens

if this is how LLaMa3 was pretrained, then in the sft process, should we include these special tokens (, , etc...), which means to unmask them in the attention_mask?

Error(s) in loading state_dict for SoftmaxNN

This issue is addressed by [https://github.com/thunlp/OpenNRE/issues/312](url), however when i tried installing transformers==3.4.0, a problem with Rust came up saying no rust compiler found. Thus, after a little search, the following...

LLama3 starts talking to it self

> @Arian-Akbari could you post some example output? I'm not too familiar with modelfusion, but taking a look at the repo. > > One thing that may also help is...

HELP! An error occur run: 'from elmoformanylangs import Embedder'

Indeed, I ran into the same issue when running 'from elmoformanylangs import Embedder' could someone help on this?

多卡推理，内存溢出[Bug]

> [qwen-v1.5-14b-hf/LongBench_vcsum,qwen-v1.5-14b-hf/LongBench_narrativeqa,qwen-v1.5-14b-hf/LongBench_multifieldqa_zh,qwen-v1.5-14b-hf/LongBench_lsht,qwen-v1.5-14b-hf/LongBench_dureader,qwen-v1.5-14b-hf/LongBench_passage_retrieval_zh] For this tasks information, it seems that you use partitioner to allocate 4 tasks on 4 gpus, so if you want to use 4 gpus only for one...

disperaller

GPU

Reserved special tokens

Error(s) in loading state_dict for SoftmaxNN

LLama3 starts talking to it self

HELP! An error occur run: 'from elmoformanylangs import Embedder'

多卡推理，内存溢出[Bug]

[Feature] 请问使用vllm评测时怎么实现类似HF多卡数据并行？

[BUG] Zero2 offload overflow

[BUG] Zero2 offload overflow

long-llm run for more than 1 epoch