Wenxuan Tan comments

Results 46 comments of


                                            Wenxuan Tan

Question about the test set of the GLUE benchmark

It's using exactly the evaluation set.

Module 'accuracy' doesn't exist on the Hugging Face Hub either.

It's mostly a network issue. I tried inside docker, and successfully downloaded the metric outside it.

Does colossalai support rocm?

Not currently, but some collaboration with AMD is underway

Jitting Error: can't pass bfloat16 as a tl.dype to my kernel!

> To resolve this issue, you can try converting the Triton data type (tl.dtype) to a native Python data type before passing it to the kernel. Here's how you can...

Could not reproduce the results listed in your paper using a single 3090 card.

@Forence1999 could you share how you reproduced it? I only got 32.1 with the original hyperparameters. Thanks! ``` python qlora.py \ --model_name_or_path huggyllama/llama-7b \ --use_auth \ --output_dir /fly/results/qlora \ --logging_steps...

RuntimeError: CUDA error: CUBLAS_STATUS_NOT_INITIALIZED when calling `cublasCreate(handle)`

This is typically out of memory error.