Martin Marciniszyn Mehringer
Martin Marciniszyn Mehringer
Note that the [BeakerX](https://github.com/twosigma/beakerx) Scala kernel supports this feature. Maybe some parts of their implementation could be reused. 
The assertion is already gone in the `main` branch.
We do not have support in the runtime for that at the moment. Is this something that could be handled inside the engine, @QiJune ?
I am not in favor of having function parameter defaults that change depending on the environment. These should be compile time constants. I suggest changing `run.py` instead so that it...
@Funatiq, could you please take a look at the PR?
@Marks101, the logits processor is supported on `ModelRunnerCppExecutor`: https://github.com/NVIDIA/TensorRT-LLM/blob/main/tensorrt_llm/runtime/model_runner_cpp.py#L48 Could you try that please?
Thanks for the feedback @shangshng. It should be support in the Python bindings of the Executor API. @dcampora, could you please add support to `ModelRunnerCpp`? @Marks101, you can use the...
@trevor-m , could you please review @AlessioNetti's feedback?
/bot run --skip-test