cstore_fdw icon indicating copy to clipboard operation
cstore_fdw copied to clipboard

Hash-based indexes

Open saosebastiao opened this issue 10 years ago • 2 comments

I'm not familiar enough with FDWs to know for sure if this could work with the query planner, but I think it would be beneficial to use hash based indexes. In this sense, you could get away with not storing the indexed columns as columns, but store the hash in the table data similar to how you store skiplists. I think this would be extremely beneficial for data size and its implicit performance. I believe this is the strategy that kdb+ uses for its database (which is pretty ubiquitous in HFT).

saosebastiao avatar Aug 22 '14 17:08 saosebastiao

For reference, here is an article about a hack using a similar idea to get similar storage efficiency and performance using MySQL tables: https://engineering.eventbrite.com/optimizing-a-table-with-composite-primary-keys/

saosebastiao avatar Aug 22 '14 18:08 saosebastiao

Thanks @saosebastiao. I agree that this can be beneficial for data size. Original ORC format uses something similar. We will evaluate this in our next releases.

pykello avatar Aug 25 '14 06:08 pykello