Tianxing Wang

Results 6 issues of Tianxing Wang

I have a very strange problem for my clangd. It can provide intellgence for my cpp files correctly, but when it comes to .cu files, it don't work. Here's my...

bug

I am configuring the project on wsl2. when I load the CMakeList.txt in the fbgemm_gpu folder, It gives the following error. ``` Cannot get compiler information: Compiler exited with error...

Hello, I am reading your paper and code and I have some problems about the merge-path SpMM. (1) In my understanding, `limit[i]` meaning paritition i should start from row limit[i]...

hello, I' reading your paper and I wonder what is stored in nPtr[i]? Can you explain to me what target is in alg1 line13 `target = min(nPtr[i] +ePerGPU,nPtr[numNodes])` ? ![image](https://github.com/YukeWang96/MGG_OSDI23/assets/55205022/b55e6cb8-9d56-424f-ad6a-6e3bd96fad54)

Hi, thanks for your awesome demo of speculative sample. Some of your code maybe outdated in new version of transformer. In the `KVCacheModel` class, Bloom model' k cache shape is...

**Describe the bug** I'm running the BingBertSquad example in `DeepSpeedExamples/training/BingBertSquad`, I use the `google-bert/bert-large-uncased` model from hugging face. I use `bash run_squad_deepspeed.sh 4 ckpt/bert_large_uncased/pytorch_model.bin /dataset /output` to finetune model. In...

bug
training