Tamas Bela Feher comments

Results 82 comments of


                                            Tamas Bela Feher

Add raft-ann-bench script to generate synthetic dataset

```bash python -m raft-ann-bench.generate_dataset --rows 1000000 --cols 128 --dtype float32 dataset/base.fbin # After the dataset is generated, you can create query and ground truth files python -m raft-ann-bench.generate_groundtruth dataset/base.fbin --output=dataset...

Re enable IVF random sampling

Note this will only compile once #2155 is merged.

Scaling workspace resources

Thanks Artem for proposing this solution. On one hand, it is nice to have a secondary workspace allocator to handle large allocations. I need to still think about this. An...

Scaling workspace resources

Thanks Artem for the update! It is a nice idea to have an extra memory resource that we can use for potentially host mem backed large temporary allocations. This can...

[FEA] IVF index building with pinned H2D transfer

Tagging @abc99lr who plans to work on this, and @achirkin for visibility.

[FEA] IVF-Flat optimize loading cluster data for large batch search

This issue shall be implemented as a follow up to #2169, because that PR changes a few details in the IVF-Flat fine search.

[Bug/Feature Request] - TF-TensorRT to support “string” datatype

Hi @vilmara, thanks for reporting the error. Normally TF-TRT should exclude string nodes from the conversion, and convert the rest of the model. The error message suggests that there is...

[FEA] Support reduced precision in balanced k-means

The k-means clustering implementation calls [predict](https://github.com/rapidsai/raft/blob/branch-23.12/cpp/include/raft/cluster/detail/kmeans_balanced.cuh#L672) multiple times during its iteration. The [FusedL2NN](https://github.com/rapidsai/raft/blob/637019e676831c3d38fde4c6e3dfbcae066e1c23/cpp/include/raft/cluster/detail/kmeans_balanced.cuh#L117) operation is used in predict, which under the hood would call [CUTLASS](https://github.com/rapidsai/raft/blob/branch-23.12/cpp/include/raft/distance/detail/fused_l2_nn.cuh#L347) to enable 3xTF32 computations. This...

[FEA] Support reduced precision in balanced k-means

>If we find that 1xTF32 path is enough in terms of accuracy and is better performing, we will keep 1xTF32 as a default path for FP32 ANN instead of current...

Running multiple TensorRT-optimized models in Tensorflow

Thanks for the detailed report. It is a valid use case to have multiple models with multiple pre-built engines. We seem to have a problem with the way the engines...