Loki

Results 13 issues of Loki

*GitHub Issue #, if available:* Note: If merging this PR should also close the associated Issue, please also add that Issue # to the Linked Issues section on the right....

build
huggingface
Size:XS

**Introduction** I'm proposing a caching strategy for TRT-LLM to streamline the process of re-compiling engines after fine-tuning. This strategy aims to significantly reduce build times and improve overall efficiency. I...

## Overview This PR adds 2 new flags to trtllm-build to support debugging. `--visualize-network` dumps the finalized TRT Network as SVG files for visual analysis. `--dry-run` runs through all the...