zzningxp
zzningxp
SoftmaxWithLossLayer::Forward_cpu in src/caffe/layers/softmax_loss_layer.cpp is NOT_IMPLEMENTED, while ./src/caffe/layers/box_annotator_ohem_layer.cpp ./src/caffe/layers/psroi_pooling_layer.cpp ./src/caffe/layers/smooth_L1_loss_layer.cpp are the same. Is there any plan to realize these CPU part?
The bottleneck of the first block is 1/4 of the channels number after channels concat. Or, one of results: 0.25x shufflenet will get a wrong 13.5 bottleneck.
in MXShuffleNet/image-classification/symbols/shufflenet.py (https://github.com/ZiyueHuang/MXShuffleNet/blob/master/image-classification/symbols/shufflenet.py ) "bottleneck_channels = out_channels // 4" should be at the place before "out_channels -= in_channels"
``` 2023:11:28-16:35:45:(35980) |CCL_WARN| did not find MPI-launcher specific variables, switch to ATL/OFI, to force enable ATL/MPI set CCL_ATL_TRANSPORT=mpi 2023:11:28-16:35:45:(35980) |CCL_WARN| could not get local_idx/count from environment variables, trying to get...
phimoe? ``` PhimoeModel( (embed_tokens): Embedding(51200, 768, padding_idx=50258) (layers): ModuleList( (0-31): 32 x PhimoeDecoderLayer( (self_attn): PhimoeSdpaAttention( (q_proj): Linear(in_features=768, out_features=768, bias=False) (k_proj): Linear(in_features=768, out_features=192, bias=False) (v_proj): Linear(in_features=768, out_features=192, bias=False) (o_proj): Linear(in_features=768, out_features=768,...
I configured the environment according to the configuration method in the readme and executed `scripts/gpt/merge-by-usage-frequency-weighted.sh` `scripts/gpt/permute-moe.sh` The error is as follows: It seems that there is a lack of the...
### Checklist - [x] 1. I have searched related issues but cannot get the expected help. - [ ] 2. The bug has not been fixed in the latest version....