Fridah-nv comments

Results 20 comments of


                                            Fridah-nv

feat:[AutoDeploy] E2E build example for llama4 VLM

/bot run --disable-fail-fast --stage-list "DGX_H100-4_GPUs-PyTorch-[Post-Merge]"

feat: [AutoDeploy] DeepseekV3 e2e support with sdpa attention

> TODO: DeepseekV3 weights are in FP8. Need to handle this case to run e2e example with weights I think we currently don't have example support for quantized model not...

feat: [AutoDeploy] DeepseekV3 e2e support with sdpa attention

I wonder if this change enables `deepseek-ai/DeepSeek-R1` to run as well?

feat:[AutoDeploy] utilize torch._inductor.pattern_matcher to write pattern matcher

> when we have a pattern matched node from a previous pattern matcher and then have another pattern matcher that uses that pattern matched node as input, there is an...

feat:[AutoDeploy] utilize torch._inductor.pattern_matcher to write pattern matcher

> def _interleaved_rope_pattern2(q, k, cos, sin, unsqueeze_dim=1): b, h, s, d = q.shape q = q.view(b, h, s, d // 2, 2).transpose(4, 3).reshape(b, h, s, d) b, h, s, d...

feat:[AutoDeploy] utilize torch._inductor.pattern_matcher to write pattern matcher

merged in https://github.com/nv-auto-deploy/TensorRT-LLM/pull/7

fix: [AutoDeploy] Update README.md

/bot run

fix: [AutoDeploy] Update README.md

/bot skip

fix: [AutoDeploy] Update README.md

/bot skip

fix: [AutoDeploy] Update README.md

/bot skip --comment "minor document update"