piano_123 comments

Results 15 comments of


                                            piano_123

[FEA] LinearCombinationSilu epilogue

using ThreadblockShape0 = cutlass::gemm::GemmShape; using WarpShape0 = cutlass::gemm::GemmShape; using InstructionShape = cutlass::gemm::GemmShape; I have a doubt how to determine the values of these three variables（ThreadblockShape，WarpShape， InstructionShape） and what are they...

[FEA] LinearCombinationSilu epilogue

@hwu36 it works，thanks. And I have a doubt: for input Activation=f16:nhwc --Filter=f16:nhwc --Output=f16 --accumulator-type=f16, the input and output channel requirements are multiples of 8, and for channels that are not...

[FEA] LinearCombinationSilu epilogue

For the situation:input Activation=f16:nhwc --Filter=f16:nhwc --Output=f16 --accumulator-type=f16 , Does setting accumulator-type=f16 cause overflow?

[FEA] LinearCombinationSilu epilogue

Hi @hwu36 ,I still have a doubt about Silu. In Example 17 ,under the same configuration except EpilogueOp, I just replaced EpilogueOp from LinearCombination to LinearCombinationSilu, and the run time...

[FEA] LinearCombinationSilu epilogue

@hwu36 Hi，does cutlass support dethpthwise convlution? I haven't seen the introduction of dethwise in the documentation, sample and header files. I set group=inchannel=outchannel in the conv problem size in example...

[FEA] LinearCombinationSilu epilogue

@hwu36 “You can set the channel number as 1 just like what you did” means set group=inchannel=outchannel ? Do you plan to do efficient depthwise conv in the future?

evaluate_model and train_model functions not available in the video tutorials

@quic-ssiddego How can I run successfully the script in the video and where I can find the script?

where is the temporal fusion code part in petr_v2?

@exiawsh Hi ， would you like to explain more detail? In PETRV2, the loaded data camera=12, after backbone processing, is equivalent to the 2d feature map concat of the t-1...

where is the temporal fusion code part in petr_v2?

@exiawsh When i run test.py,https://github.com/megvii-research/PETR/blob/e48faec8aa24bdd14f95692428ddc4982f2f71cb/projects/mmdet3d_plugin/datasets/pipelines/loading.py#L114 choices=14,and this is not key frame , so I wondering if it can directly use the backbone results of the previous key frame, saving half...

where is the temporal fusion code part in petr_v2?

@exiawsh Thanks a lot！