guoyande comments

Results 50 comments of


                                            guoyande

向量支持的数据类型

string, int, long, float, double是标量类型的字段，vector是向量类型的字段。标量字段支持过滤，向量用于相似搜索。

vearch集群如何将已有的表从store_type的MemoryOnly内存存储方式切换到RocksDB磁盘存储方式？

重新导入一下数据吧，没有太好的方法

bazel 3.4.1 + openjdk-11 + gcc 9.4(编译安装) + clang-8 环境编译scann不通过报错:

可以使用guoyande/gamma:scann镜像构建，镜像内bazel、gcc9、clang8均安装，需要手动安装一个python3.7/3.8, 然后pip install tensorflow~=2.5.0，最后编译scann 选项打开，正常编译gamma就可以。值的注意的是，scann需要下载github代码注意网络正常

Encountered error: [StatusCode.INVALID_ARGUMENT] [request id: <id_unknown>] inference input 'end_id' data-type is 'INT32', but model 'tensorrt_llm' expects 'UINT32'

I have the same problem. How did you solve it? Request “localhost:8000/v2/models/tensorrt_llm_bls/generate” interface can be successful. docker image: nvcr.io/nvidia/tritonserver:24.08-trtllm-python-py3 Perf: V100-SXM2-32GB Driver Version: 550.90.12 CUDA Version: 12.4

Encountered error: [StatusCode.INVALID_ARGUMENT] [request id: <id_unknown>] inference input 'end_id' data-type is 'INT32', but model 'tensorrt_llm' expects 'UINT32'

I solved it. The main reason is that the versions of "tensorrt-llm" and "tensorrtllm_backend" are different.

Compile error

https://github.com/baidu/braft/pull/491 链接的时候没有链接brpc pthread, 已经提pr

Does the “ViT-L/14” support conversion to TensorRT parameter files?

https://forums.developer.nvidia.com/t/tensorrt-inference-api-that-open-clip-vit-l-14-is-slowing-down/309551/3 My successful convertion. "ViT-L/14" ---> .onnx ---> .trt But inference with the tensorrt framework is slower. Is this a normal phenomenon? The link above has some details.

guoyande

delete_by_query删除部分数据

向量支持的数据类型

向量支持的数据类型

vearch集群如何将已有的表从store_type的MemoryOnly内存存储方式切换到RocksDB磁盘存储方式？

bazel 3.4.1 + openjdk-11 + gcc 9.4(编译安装) + clang-8 环境编译scann不通过报错:

文档没法看了

Encountered error: [StatusCode.INVALID_ARGUMENT] [request id: <id_unknown>] inference input 'end_id' data-type is 'INT32', but model 'tensorrt_llm' expects 'UINT32'

Encountered error: [StatusCode.INVALID_ARGUMENT] [request id: <id_unknown>] inference input 'end_id' data-type is 'INT32', but model 'tensorrt_llm' expects 'UINT32'

Compile error

Does the “ViT-L/14” support conversion to TensorRT parameter files?