Ethan He
Ethan He
same problem
@gengwb the differences are listed in the [readme](https://github.com/yihui-he/caffe-pro/tree/a4f0a8735fee43a5b59f20ad4de4135467256e67)
use `./caffe/build/tools/caffe train -solver temp/solver.prototxt -weights temp/3c_vgg.caffemodel -gpu $1` instead of finetune.sh. Does it work?
I downloaded my code and ran again. However, I failed to reproduce these problems.
they are the same Best, Yihui He ________________________________ From: lanka Sent: Thursday, October 26, 2017 9:43:19 AM To: yihui-he/channel-pruning Cc: Subscribed Subject: [yihui-he/channel-pruning] How can set cfgs for resnet? (#42)...
@Slyne could you move doc to `docs/source/multimodal/mllm/video_neva.rst` ?
@yaoyu-33
You can't use interleaved schedule without pipeline parallel ![image](https://github.com/NVIDIA/Megatron-LM/assets/10027339/6e945b62-4a23-4779-829a-fe36c01c47ff)
when you use --use-mcore-models,, you cannot use local. --use-flash-attn decides whether to use the OSS flash attention implmentation or cudnn implmementation.