Martin Marciniszyn Mehringer

Results 15 comments of Martin Marciniszyn Mehringer

Note that the [BeakerX](https://github.com/twosigma/beakerx) Scala kernel supports this feature. Maybe some parts of their implementation could be reused. ![TabCompletion](https://user-images.githubusercontent.com/11665257/60190413-e3d9cf80-9832-11e9-9fb4-7f88551390d1.PNG)

The assertion is already gone in the `main` branch.

We do not have support in the runtime for that at the moment. Is this something that could be handled inside the engine, @QiJune ?

I am not in favor of having function parameter defaults that change depending on the environment. These should be compile time constants. I suggest changing `run.py` instead so that it...

@Marks101, the logits processor is supported on `ModelRunnerCppExecutor`: https://github.com/NVIDIA/TensorRT-LLM/blob/main/tensorrt_llm/runtime/model_runner_cpp.py#L48 Could you try that please?

Thanks for the feedback @shangshng. It should be support in the Python bindings of the Executor API. @dcampora, could you please add support to `ModelRunnerCpp`? @Marks101, you can use the...

@trevor-m , could you please review @AlessioNetti's feedback?