Zero Zeng comments

Results 575 comments of


                                            Zero Zeng

Could not find any implementation for node /model.0/conv/Conv + PWN(PWN(/model.0/act/Sigmoid), /model.0/act/Mul)

Could you please share a reproduce? Thanks!

Could not find any implementation for node /model.0/conv/Conv + PWN(PWN(/model.0/act/Sigmoid), /model.0/act/Mul)

I can also reproduce the issue on TRT 8.6 on x86. But in my test it's been fixed in TRT 10.

Could not find any implementation for node /model.0/conv/Conv + PWN(PWN(/model.0/act/Sigmoid), /model.0/act/Mul)

9.2 also pass.

Could not find any implementation for node /model.0/conv/Conv + PWN(PWN(/model.0/act/Sigmoid), /model.0/act/Mul)

Sorry for the late reply, I'm checking internally.

Could not find any implementation for node /model.0/conv/Conv + PWN(PWN(/model.0/act/Sigmoid), /model.0/act/Mul)

Could you please try mark `/model.0/conv/Conv ` or `/model.0/act/Sigmoid` as network output so that the layer fusion can be break? this can be done quickly with `polygraphy run model.onnx --mark...

Do tensorrt 9.2 support flash attention v2

I believe it's supported OOTB, you can export the model to onnx and check.

Do tensorrt 9.2 support flash attention v2

yes. even speed-up

Do tensorrt 9.2 support flash attention v2

OOTB should be supported. You can also use https://github.com/NVIDIA/TensorRT-LLM

Do tensorrt 9.2 support flash attention v2

@nvpohanh I'm kind of not sure about what the best practice for such issue after TRT-LLM released, could you please shed some light here :-D

Customized TensorRT operator Col2Im, but parsing failed of TensorRT9.2

Could you please try recompile the plugin with TRT 9.2? Thanks!