Improve `Semantics` construction performance
hir::Semantics is a general cache used to be able to resolve any syntax node to its semantic definition. This is generally slow given the kind of work it does, but quite often we do a lot more work than necessary, populating unnecessary DynMap tables and more importantly, we populate it via parsing, where the parsing happens through direct database calls, even though the Semantics itself caches these. That means for big enough files we can end up with thrashing the LRU cache a fair bit. We should fix that, a related problem is the HasSource trait going through the database instead of Semantics which means we can end up with nodes not cached by Semantics ultimately ending up in panics when it tries to lookup stuff (which has happened in the past).
So the things that need to be changed are:
- Change the signature of (or add a second version)
HasSource::sourceto take a reference to theSemanticsinstead that does the right thing https://github.com/rust-lang/rust-analyzer/blob/062e1b9b81542680afde1aa47fa5316ac5133461/crates/hir/src/has_source.rs#L29 - Change https://github.com/rust-lang/rust-analyzer/blob/062e1b9b81542680afde1aa47fa5316ac5133461/crates/hir-def/src/src.rs#L14-L21 to support this
- Change https://github.com/rust-lang/rust-analyzer/blob/062e1b9b81542680afde1aa47fa5316ac5133461/crates/hir-def/src/child_by_source.rs#L26-L33 to support this
This will most likely require adding callbacks to some of the hir-def crate APIs to abstract away the parsing (and then add / keep the current APIs as convenience APIs where Semantics won't be involved).
Mostly unrelated, but in the past we used to construct multiple Semantics during the execution of a single request. It might be worth checking whether that's still the case.
@rustbot claim
Semantics caches those things but the direction is opposite; the cache direction is Source -> Def, but the functions we need to modify have the signatures Def -> Source. Should this be done by making the caches bidirectional or doing some inefficient inverse search like HashMap::iter().find()?
edit) The former seems to contradict #17380 🤔
Hey @Veykril , is this up for grab?
@rustbot claim
Semanticscaches those things but the direction is opposite; the cache direction isSource -> Def, but the functions we need to modify have the signaturesDef -> Source. Should this be done by making the caches bidirectional or doing some inefficient inverse search likeHashMap::iter().find()?edit) The former seems to contradict #17380 🤔
Good question, LRU cache invalidation wont be an issue with the new salsa anymore as the LRU doesnt trigger mid query computations. It has been a while since ive though about this issue