wangshuai09 comments

Results 21 comments of


                                            wangshuai09

Training loss does not converge

Thanks for your reply! Here is the training loss of weight_cross_entropy where pos_weight = beta/(1-beta) passing to F.binary_cross_entropy_with_logits as weight param， ---------------------------------------------------------------------- ``` Fri Apr 16 17:54:21 2021 Epoch: 0...

Add training support and change lspci for Ascend NPU

If there are no video card and NPU, the `torch_command` will not change. https://github.com/AUTOMATIC1111/stable-diffusion-webui/blob/cf2772fab0af5573da775e7437e6acdca424f26e/modules/launch_utils.py#L318 It will use cpu as backend and here is screen shoot running with a downloaded embedding,...

Add training support and change lspci for Ascend NPU

Sorry for my misunderstanding. It will print error on screen using `elif eval "npu-smi info`. Your advise is so great.

[BUG] RuntimeError: NPU out of memory. Tried to allocate 268.00 MiB

This problem is that torch_npu don't support multi-card communication within a single process. And this will be fixed in the official torch-npu&cann release at the end of April. FYI https://github.com/huggingface/accelerate/issues/2368....

How can I use Multiple NPUs ?

FYI https://github.com/lm-sys/FastChat/issues/3237

[CANN] Add Ascend NPU backend

`Issue 6: slow!!` has been initially fixed by memory reuse, almost speedup 10x. Also, there are other space for optimization. Current inference speed: ![llama](https://github.com/ggerganov/llama.cpp/assets/25071151/6f1575b6-fea4-4bcb-babf-f22c1b9260c8)

wangshuai09

Training loss does not converge

Add training support and change lspci for Ascend NPU

Add training support and change lspci for Ascend NPU

[BUG] RuntimeError: NPU out of memory. Tried to allocate 268.00 MiB

How can I use Multiple NPUs ?

[CANN] Add Ascend NPU backend

[CANN] Add Ascend NPU backend

[CANN] Add Ascend NPU backend

train on own dataset

[CANN] Add Ascend NPU support