Ed Henry comments

Results 5 comments of


                                            Ed Henry

[Feature Request] llama v3 support

Using the [`inflight_batcher_llm`](https://github.com/triton-inference-server/tensorrtllm_backend/tree/main/all_models/inflight_batcher_llm) from [tensorrtllm_backend ](https://github.com/triton-inference-server/tensorrtllm_backend/tree/main) along with some modifications to the `preprocessing` model and tokenizer configurations, I was able to get the model functional within the TensorRT-LLM backend. **This...

Restrict parameter list in kubeflow experiment run to one pipeline only

I ran into this issue and modified my pipelines and the plugin to accommodate. Below is a summary of what I've done. 1. I ported my pipelines to use [modular_pipelines](https://docs.kedro.org/en/stable/nodes_and_pipelines/modular_pipelines.html)....

[bug] output from llama_index tracing doesn't propagate up to the agent

Would this also be the root cause of the issue I see here?: ![image](https://github.com/Arize-ai/openinference/assets/5572998/f36c0e50-ea36-4728-bbaf-1c215ac90574) Where I can trace the entire set of calls end-to-end, but when using them as part...

[bug] output from llama_index tracing doesn't propagate up to the agent

I can confirm it was related to how I was structuring some of my objects. Apologies for jumping in on this issue as it isn't related! > I've never seen...

PartitionedDataset - Allow for parallelization when saving and allow logging of exceptions

Just ran into this issue over the last few weeks (again) and just want to give this a +1. :)