lance icon indicating copy to clipboard operation
lance copied to clipboard

perf: PQ performance

Open BubbleCal opened this issue 1 year ago • 2 comments

Now we are optimizing the PQ performance, here to track all potential bottlenecks:

  • [ ] 256bit/512bit SIMD for 4bit PQ
  • [x] transforming: now we handle 4bit PQ case while transforming, which requires to collect the intermediate results
  • [x] cache locality: the distance calculating can be optimized by better access pattern to distance table
  • [x] constructing distance table: now it's 4x slower than computing distances
  • [ ] find partitions: significant when nprobes is small

BubbleCal avatar Nov 21 '24 01:11 BubbleCal

Have we fixed either of these yet?

  • [x] https://github.com/lancedb/lance/issues/2838
  • [x] https://github.com/lancedb/lance/issues/2837

wjones127 avatar Nov 21 '24 01:11 wjones127

Have we fixed either of these yet?

not yet

BubbleCal avatar Nov 21 '24 01:11 BubbleCal