Li Shuai
Li Shuai
Thanks for your owesome work! Thanks for your opened code, it's very helpful for me! I reproduced the released code on the S3DIS dataset, but I only got 0.552240 test...
### 问题描述 Issue Description 源码编译过程中使用命令`time cmake .. -DPY_VERSION=3.10 -DWITH_GPU=ON -DWITH_TESTING=ON`遇到如下问题: ```bash -- commit: f8a40a7d3e -- branch: develop /home/sun/anaconda3/envs/paddle-dev/lib/python3.10/site-packages/yaml /home/sun/anaconda3/envs/paddle-dev/lib/python3.10/site-packages/jinja2 parse op yamls: - /home/sun/Desktop/ShuaiLi/LLM/Paddle/paddle/phi/ops/yaml/ops.yaml - /home/sun/Desktop/ShuaiLi/LLM/Paddle/paddle/phi/ops/yaml/inconsistent/dygraph_ops.yaml - /home/sun/Desktop/ShuaiLi/LLM/Paddle/paddle/phi/ops/yaml/backward.yaml - /home/sun/Desktop/ShuaiLi/LLM/Paddle/paddle/phi/ops/yaml/inconsistent/dygraph_backward.yaml...
硕士研究生学位论文模板:https://github.com/lishuai-97/BJFU_MasterThesis
**Your question** The original implementation of Mixtral 8x7B sets `--adam-beta2` to 0.999 by default during pretraining. However, empirical observations reveal an apparent trade-off: using `β₂=0.999` consistently induces training instability (loss...
Hello, I have a question regarding multi-GPU training with the latest version of CityGS V2. When I used the `scripts/run_citygs_mc_aerial.sh` script to train the aerial view of MatrixCity, the training...