pgsphere Add BRIN support for spoint and sbox

As discussed in the mailing list, I opened the PR here. Support for searches within spherical circles will be added in the future.

Jan 15 '18 10:01 gbroccolo

In general looks very good, but please take a look to minor notes by my review.

Jan 16 '18 09:01 akorotkov

Thank you @akorotkov for the review. I'll fix the minor notes ASAP, and let you know when it will be done.

Jan 16 '18 10:01 gbroccolo

Hi @akorotkov I've then fixed what you reported: I've used pgindent for source code, and corrected the Makefile in order to execute regression test of BRIN code just for proper PostgreSQL versions. Sorry for my late reply, it was a busy period. It should be now ready to be merged, but let me know if there are further needed fixes.

Feb 27 '18 14:02 gbroccolo

You still have a tab on line 34 of the Makefile (where you added $(BRIN_REGRESS)) that needs to be changed to spaces, I think. Compare with line 33 which is indented with only spaces. Either that or line 33 should have been indented with a tab. Not sure.

This is a fantastic contribution!

Feb 27 '18 15:02 esabol

Hi @esabol indeed I passed to pgindent only source code, not the Makefile. It works, but I can fix the style also for this. Thank you for your review!

Feb 27 '18 19:02 gbroccolo

I've also fixed the Makefile as per @esabol suggestion: other lines where indented with spaces so I uniformly indented the added line (n. 34) with spaces too.

Mar 04 '18 16:03 gbroccolo

I think we also need to bump extension version, so that existing users can upgrade. Another issue is instances pg_upgrade'd from pre-9.5. PostgreSQL doesn't offer built-in mechanism which could add them a BRIN index support. But we can do that using manual SQL-script and describe its usage in README.

Mar 11 '18 18:03 akorotkov

Hi @akorotkov I've added the following changes:

Upgrade of the copyright to 2018
Upgrade version in control file from 1.0 to 1.1
Upgrade version in Makefile from 1.1.5 to 1.1.6
Included in utils/ a command to load BRIN support in case of upgrades from pre-9.5 releases through pg_upgrade
Add the above information both to the documentation and in the README file

Mar 13 '18 00:03 gbroccolo

It's been a while. Any progress?

May 10 '18 20:05 esabol

@esabol I've applied the fixes required by @akorotkov, adding a command to support upgrades from pre-9.5 releases through pg_upgrade (similarly to what I've done for BRINs in PostGIS) and improving the documentation too. This is ready for the final review.

In the meantime, I've also tried to perform some benchmark on an artificial sample of spherical points, varying the size of the sample itself. I'm attaching here one of the plot I've obtained: it reports the time needed to select 1 spherical point included in a particular spherical sector using both BRINs or GiST indices, as the size of the whole sample increase, keeping low values for the configured shared buffers in PostgreSQL in order to better see how BRINs are useful for big datasets (note the behaviours' change at ~10M points). select1

May 12 '18 12:05 gbroccolo

Nice work. The performance degradation for less than ~10M points is disappointing, if I am being honest.

May 12 '18 14:05 esabol

@esabol it is expected that BRINs performance is lower than a tree index. Block Range INdices map per range, than the retrieved ranges after the index scan have to be sequentially inspected to find exact matches. In this sense, BRINs are more "lossy" than GiST indices. But the cost to inspect ranges' map is as lower than the cost to inspect a GiST index as the sample size is larger. BRINs are specifically thought for big dataset, more than an alternative of GiST indices.

Furthermore, consider that the sample used to generate the plot is an artificial one, running on a small DB server. It would be good to see how the BRINs perform with a real dataset.

May 12 '18 19:05 gbroccolo

I guess I'd just hoped that BRINs would see a benefit at ~1M points instead of ~10M. Many of our largest catalogs are in the millions of rows, not tens of millions. Of course, that will change over time. Catalogs only keep getting bigger.

May 12 '18 20:05 esabol

@esabol Block Range INdexing is configurable, i.e. you can set the number of data pages included in each range (in the above example, I considered 16 pages per range) as lower as you are able to be close to the desired performance. This will increase the effort for the maintenance of the index (time needed for creation, final size of the index, impact during insertions, time needed for resummarization) that will be more similar to the one needed for the GiST indices, but also the performance will be more similar for small datasets, keeping the advantages for larger ones. The parameter that configure this behaviour is the pages_per_range one, tuning it you can start to see benefits also for smaller datasets than ~10M ones. Furthermore, this "breakeven" point strongly depend from the underlying hardware, the specific data content, and so on. You should check the behaviours in your specific case.

May 13 '18 14:05 gbroccolo

@gbroccolo, right. BRIN performance is strongly dependent on how keys are correlated with layout of tuples on heap pages. We can imagine two corner cases for BRIN performance:

Worst case: keys distribution is random. MBR for page is almost as wide as MBR for the whole dataset.
Heap tuples are perfectly clustered by their keys: tuples which lays near contains near located keys. This could be achieved artificially by clustering heap by GiST index.

So as I get, the experiment you did is close to the first scenario. In real-life, distribution could be different, and expected to be somewhere in the middle of these two corner cases.

I would note, that I found experiment you did quite surprising. BRIN is expected to be slower on selective queries (low fraction of tuples is selected), and faster on non-selective queries (high fraction of tuples is selected). So, for me most straight-forward benchmark would be following: for some large-enough dataset run queries with different selectivity. With fraction of selected tuples higher than some threshold, BRIN should outperform GiST.

As I get in your experiment, every query selects 1 point. And we can see serious degradation of GiST near 10^7 dataset size. I can explain it as following. Before 10^7 points, GiST index fits OS cache. After that, lookups in GiST index becomes more expensive, because of random disk seeks. While BRIN index performs better thanks to smaller size of sequential disk access pattern. In order to help me validate my thoughts, could you please provide more details on hardware you use during benchmark: amount of RAM, type of disk (hard disk or SSD), PostgreSQL settings etc?

May 17 '18 09:05 akorotkov

Regarding pull request itself, I've some more notes:

Upgrade from pre-9.5 versions needs ALTER EXTENSION ... ADD ... commands to make new objects to be linked to extension.
You deleted sscan.c. Keeping this file in git helps to support building on systems, which don't have flex installed. So, please don't remove it from repository, even despite make clean removes it.

May 17 '18 10:05 akorotkov

Please excuse me, regrettably sscan.c had been prematurely deleted by me on the master branch. I have now reverted the deletion.

May 17 '18 12:05 ghost

@mnullmei, no problem, thanks.

May 17 '18 12:05 akorotkov

Hi @akorotkov, let me give further details about my benchmarks: I considered an Ubuntu Xenial VM with 2GB RAM, the storage unit was based on SSD, but consider the driver virtualization. I considered PostgreSQL 9.6 + pgSphere built from this current branch, and default configuration for PostgreSQL.

The artificial dataset was inserted in the DB in order to keep spatially close objects as more as possible close in data page tuples on disk, so the benchmark was actually more similar to the second scenario you mentioned in your reply, than the first one.

Apart of the graph I attached in my previous reply, I've produced several graphs, in order to inspect data access to disk, how performance changes with the selectivity of the query, and so on. I attach here the graph you requested, reporting the time needed to select a variable number of points included in a spherical sector, from 1 single point to 1 million. BRINs start to perform closer to GiST indices at ~O(100k) points, so performance start to increase as the selectivity is lower and lower.

selectn

For a further validation of your thoughts, I'm also attaching the number of indices & data blocks accessed from the storage, as the selectivity of the query decreases. As you mentioned, the reduced size of the index allows it to be completely cached in the database's shared buffers in memory. As a consequence no index block needs to be read from disk in case of BRIN, and also the number of data blocks read from disk is quite lower than GiST case as the selectivity of the query decreases.

indexpages datapages

Regarding the PR: let me rebase on master in order to include the fix of @mnullmei, and have a look to the ALTER EXTENSION stuff.

May 21 '18 21:05 gbroccolo

@akorotkov I rebased the PR branch to the latest master, and added the ALTER EXTENSION commands to link the objects to the extension.

May 27 '18 15:05 gbroccolo

If anyone is interested in reviving this feature, I recommend submitting a PR to the currently active pgsphere repo: https://github.com/postgrespro/pgsphere

Aug 09 '23 18:08 esabol

Hi @esabol, to be fair I don't know anymore if the PR would be useful after 5 years. Maybe now catalogs with ~O(10M) start to be more common, anyhow it's a pity because the PR was ready with all the required fixes and changes came up already 5 years ago. Before starting working on this again, my question is: would the PR (considering the limitations of the Block Range INdexing) be still of interest for pgsphere?

Aug 14 '23 20:08 gbroccolo

@gbroccolo wrote:

Before starting working on this again, my question is: would the PR (considering the limitations of the Block Range INdexing) be still of interest for pgsphere?

I don't know that I can answer that for all pgsphere users, but I do agree that ~O(10M) datasets are more common these days. That said, I have opened an issue to discuss the matter in the new repo.

https://github.com/postgrespro/pgsphere/issues/52

Looking at the changes in this PR, I don't think it would be that difficult to get a similar PR working with the new pgsphere repo. It's kind of up to you, I think.

Aug 14 '23 23:08 esabol

Update: There's now a working PR for adding BRIN support to the https://github.com/postgrespro/pgsphere/ repo that is based on @gbroccolo's PR here with essentially just some updates to the Makefile.

https://github.com/postgrespro/pgsphere/pull/55

Aug 16 '23 19:08 esabol

pgsphere pgsphere copied to clipboard

Add BRIN support for spoint and sbox

pgsphere
pgsphere copied to clipboard