Li Shuai issues

Results 5 issues of


                                            Li Shuai

About the performance

Thanks for your owesome work! Thanks for your opened code, it's very helpful for me! I reproduced the released code on the S3DIS dataset, but I only got 0.552240 test...

源码编译cmake时遇到Something Wrong here, this backward op (global_gather_grad)'s forward op (global_gather) does not exist.的问题

### 问题描述 Issue Description 源码编译过程中使用命令`time cmake .. -DPY_VERSION=3.10 -DWITH_GPU=ON -DWITH_TESTING=ON`遇到如下问题： ```bash -- commit: f8a40a7d3e -- branch: develop /home/sun/anaconda3/envs/paddle-dev/lib/python3.10/site-packages/yaml /home/sun/anaconda3/envs/paddle-dev/lib/python3.10/site-packages/jinja2 parse op yamls: - /home/sun/Desktop/ShuaiLi/LLM/Paddle/paddle/phi/ops/yaml/ops.yaml - /home/sun/Desktop/ShuaiLi/LLM/Paddle/paddle/phi/ops/yaml/inconsistent/dygraph_ops.yaml - /home/sun/Desktop/ShuaiLi/LLM/Paddle/paddle/phi/ops/yaml/backward.yaml - /home/sun/Desktop/ShuaiLi/LLM/Paddle/paddle/phi/ops/yaml/inconsistent/dygraph_backward.yaml...

status/new-issue

type/build

硕士研究生论文模板

硕士研究生学位论文模板：https://github.com/lishuai-97/BJFU_MasterThesis

[QUESTION] `--adam-beta2` in Mixtral 8x7B pretraining script

**Your question** The original implementation of Mixtral 8x7B sets `--adam-beta2` to 0.999 by default during pretraining. However, empirical observations reveal an apparent trade-off: using `β₂=0.999` consistently induces training instability (loss...

Multi-GPU Training

Hello, I have a question regarding multi-GPU training with the latest version of CityGS V2. When I used the `scripts/run_citygs_mc_aerial.sh` script to train the aerial view of MatrixCity, the training...

stale