go-ethereum eth/filters: avoid block body retrieval when no matching logs

eth/filters: avoid block body retrieval when no matching logs

Open s1na opened this issue 2 years ago • 1 comments

This PR supersedes https://github.com/ethereum/go-ethereum/pull/23655. It applies the trick only to the eth backend.

It changes the logic of GetLogs, which doesn't fill in log metadata anymore. Both GetLogs callers check for this variant and fill in the details when necessary (One is in checkMatches, the other in here https://github.com/ethereum/go-ethereum/blob/d12b1a91cd9423f83bf77dbe363164797549ff15/eth/filters/filter_system.go#L421

NOTE: the field logIndex has changed its meaning

Jun 29 '22 16:06 s1na

I tested the result against master and the result seems correct. Benchmark results:

------------ master -----------
Requesting logs for block range [13000000, 13002000]
Filtering logs took 18713
Requesting logs for block range [13002000, 13004000]
Filtering logs took 15218
Requesting logs for block range [13004000, 13006000]
Filtering logs took 15882
Requesting logs for block range [13006000, 13008000]
Filtering logs took 18950
Requesting logs for block range [13008000, 13010000]
Filtering logs took 14798

------------------ PR. ----------------
Requesting logs for block range [13000000, 13002000]
Filtering logs took 15515
Requesting logs for block range [13002000, 13004000]
Filtering logs took 12520
Requesting logs for block range [13004000, 13006000]
Filtering logs took 13296
Requesting logs for block range [13006000, 13008000]
Filtering logs took 16063
Requesting logs for block range [13008000, 13010000]
Filtering logs took 12268

Jun 30 '22 11:06 s1na

This got bitrotted by the log filter cache PR, needs a rebase.

Aug 19 '22 10:08 holiman

This got bitrotted by the log filter cache PR, needs a rebase.

Ping @s1na , want to give this another push?

Nov 05 '22 14:11 holiman

Ping @s1na , want to give this another push?

Ping again

Nov 29 '22 10:11 holiman

~~@holiman @fjl PTAL~~

Edit: Ah wait, I think its better to cache the derived logs.

Dec 01 '22 10:12 s1na

Yeah this idea doesn't seem to play nice with the cache. I see 2 options, none good:

Cache underived logs: which means we have to fetch block every time
But we can't easily cache derived logs either after we've done filtering at which point we have the data to fill in

Dec 01 '22 10:12 s1na

We can put the *types.Block into the cache entry as well, along with the logs slice.

Dec 01 '22 11:12 fjl

We can put the *types.Block into the cache entry as well, along with the logs slice.

Now that you say it, blocks are cached in blockchain. It's fine.

Dec 04 '22 15:12 s1na

I think what needs to happen here, is that the cache should contain the un-derived logs along with the *types.Block pointer for the block the logs were created in. That way, we can always derive them later for any query.

We can also consider mutating the cache entry with the derived results, if it is possible.

Dec 05 '22 14:12 fjl

We discussed this, a plan of action:

Cache a flattened list of logs. In this case we need to partially derive fields (everyone except for txhash).
Also cache block body atomic.Pointer[types.Body] (note body not block). instantiating block has extra costs.
Upon seeing first matching log, fetch body and cache it.
Note: this PR is calculating LogIndex wrong, it should be index of log in block.

Dec 08 '22 18:12 s1na

Since light client can't follow the chain I couldn't test the log filters on a live node. But with the testcase I caught an issue that has been there for a while. Basically the block range of a filter wasn't enforced. That's now fixed in the PR.

Dec 15 '22 09:12 s1na

@fjl ping 👉

Jan 19 '23 16:01 s1na

go-ethereum go-ethereum copied to clipboard

eth/filters: avoid block body retrieval when no matching logs

go-ethereum
go-ethereum copied to clipboard