Ethan He

Results 36 comments of Ethan He

@gengwb the differences are listed in the [readme](https://github.com/yihui-he/caffe-pro/tree/a4f0a8735fee43a5b59f20ad4de4135467256e67)

use `./caffe/build/tools/caffe train -solver temp/solver.prototxt -weights temp/3c_vgg.caffemodel -gpu $1` instead of finetune.sh. Does it work?

I downloaded my code and ran again. However, I failed to reproduce these problems.

they are the same Best, Yihui He ________________________________ From: lanka Sent: Thursday, October 26, 2017 9:43:19 AM To: yihui-he/channel-pruning Cc: Subscribed Subject: [yihui-he/channel-pruning] How can set cfgs for resnet? (#42)...

@Slyne could you move doc to `docs/source/multimodal/mllm/video_neva.rst` ?

You can't use interleaved schedule without pipeline parallel ![image](https://github.com/NVIDIA/Megatron-LM/assets/10027339/6e945b62-4a23-4779-829a-fe36c01c47ff)

when you use --use-mcore-models,, you cannot use local. --use-flash-attn decides whether to use the OSS flash attention implmentation or cudnn implmementation.