Eero Tamminen
Eero Tamminen
> @eero-t please create a separate issue(feature request) in our GitHub so we can discuss it internally Done: https://github.com/intel/compute-runtime/issues/787
> It can be enabled when OS is **Ubuntu 22.04**, but fails when we downgraded to 20.04. That's expected. Ubuntu 20.4 kernel and compute driver versions: * https://packages.ubuntu.com/source/focal/intel-compute-runtime * https://github.com/intel/compute-runtime/releases/tag/20.13.16352...
Please file your issue to whatever project provides `sysmon` tool. (This project is for Level-Zero frontend API implementation, NULL driver and validation + tracing layers.)
And close this bug (I cannot).
> `ValueError: CPU device only supports float32 dtype` => Check that the invoked container actually includes (writable) Habana devices.
> Useful resources: > > * LLM Trainer implementation in [the Kubeflow Training V1](https://github.com/kubeflow/training-operator/blob/master/sdk/python/kubeflow/trainer/hf_llm_training.py) This link is broken, the whole `python` directory hierarchy is missing from the given repo.
> Do you have plan to submit PR to fix this? @xiguiw No. (Fixing this could be a good "beginner" / "first time" task PR.)
> PS. All TEI image references are for 1.5.0 version, i.e. consistent. But somewhat out of date. `1.5.0` was released last summer, whereas latest `tei-gaudi` release is `1.5.3` (and "GenAIComps"...
GenAIComps CPU/rocm TGI is now consistent version, but this repo is not quite done yet, there's still lot of discrepancy. While most are now on TGI 2.4.x, some references to...
> And due to known issues, ChatQnA and AvatarChatbot may not be updated , is it ok? Those can be updated in a separate PR after their issues have been...