Eero Tamminen comments

Results 726 comments of


                                            Eero Tamminen

tgi gaudi fails with health test in ChatQnA

> Thanks a lot. I extended it to 40min, but unfortunately shards preparation haven't finish within this time if I deploy TGI service as kata-qemu-tdx (with TDX protection). Any hint...

tgi gaudi fails with health test in ChatQnA

> TD VM (kata-qemu-tdx) pod is created without persistent storage, so while deploing new TGI pod, it has to download data model from network. I assume TDX is used for...

[ChatQnA] TGI Service fail on a system with only 1 Gaudi card.

I haven't tried using Gaudis (nor Docker-compose), but thought of few possible issues... Based on your error output, sharding is enabled. TGI tries by default to use all _available_ devices,...

[ChatQnA] TGI Service fail on a system with only 1 Gaudi card.

> Problem here is that TEI and TGI seems to try to compete with each other for the only 1 Gaudi card, and TGI failed with the error message. Ah,...

[ChatQnA] TGI Service fail on a system with only 1 Gaudi card.

@louie-tsai Please don't assign things to me as I'm not a developer in this project (just another user testing it).

Docker proxy settings

> if could, please provide a PR for ReadME and describe the step. Thanks Sorry, that things is in so many files [1] that cleaning it is way too large...

[SearchQnA] [ChatQnA] TEI service config for 1-card Gaudi scenario

> to move TEI embedding microservice to CPU Why? Is TEI-embedding Gaudi utilization too low for it to make sense, or is there some other reason?

Empty/missing Kubernetes securityContexts

> Could you provide more information about what issue the empty securityContexts cause? Such pods cannot be run in clusters with more strict pod security policies (see the "pod-security-standards" link)....

Empty/missing Kubernetes securityContexts

Thanks, the merged PR looks good, but there are few things that could be improved: * `/mnt` is not a good host mount point. Dirs mounted from host should be...

Empty/missing Kubernetes securityContexts

> PR [opea-project/GenAIInfra#153](https://github.com/opea-project/GenAIInfra/pull/153) should have resolved this Yes, looks good! Any idea when those changes get also to this (`GenAIExamples`) repository?