Keming comments

Results 219 comments of


                                            Keming

Unable to (re)symbolificate saved profile data

> To load a saved `my_profile.json` in the profiler with working symbols, you need to use `samply load my_profile.json` . > > You're not the first person to run into...

BUG: Cannot build image based on envd build image

I cannot reproduce this on gitpod. Can you provide your `build.envd` file?

RuntimeError: No CUDA GPUs are available

You need to install gpu operator in the cluster.

RuntimeError: No CUDA GPUs are available

I don’t have experience of azure aks. You can contact their customer service. On Wed, 23 Aug 2023 at 21:38, nithinkanil1 ***@***.***> wrote: > @kemingy Already installed GPU to a...

RuntimeError: No CUDA GPUs are available

No. It’s a standard cuda image. You can try the base cuda image. On Wed, 23 Aug 2023 at 21:47, nithinkanil1 ***@***.***> wrote: > Is this an issue with vllm...

bug: Install cuda again in the image

I guess it's related to the version of cuda image?

feat: provide instructions on how community members can wrap models for this project

The main LLM inference code is in https://github.com/tensorchord/modelz-llm/blob/main/src/modelz_llm/model.py. To add a new model, you need to check https://github.com/tensorchord/llmspec/blob/main/llmspec/model_info.py and add the corresponding docker image in this repo.

Tokenizer class LLaMATokenizer does not exist or is not currently imported.

Check the issue https://github.com/huggingface/transformers/issues/22222

bench(fdw): Latency

@xieydd I tried this example but it didn't work. I'm not sure how to make it work when remote has the pgvector extension `vector` while local only has pgvecto.rs extension...

bench(fdw): Latency

This requires https://github.com/tensorchord/pgvecto.rs-enterprise/pull/8 The latency is mainly affected by the network latency. When testing within AWS US-west2a, the latency is about several milliseconds. ### remote ```sql create extension vectors; set...