Tom Hepworth
Tom Hepworth
Just to note, this has been logged with the `duckdb` guys [here](https://github.com/duckdb/duckdb/issues/3974).
Ah oops, I forgot to check if any of our tests are affected by this logic...
@RobinL could you let me know if you even want this change in Splink3. I can port the code to v4 and ignore this version if preferred.
I believe this is partially resolved in [here](https://github.com/moj-analytical-services/splink/pull/983). This should be relatively simple to resolve if we can reuse the code implemented above.
> This looks like a great thing to think about and write down. didn't read in depth, but I did notice a type of the filename `depdenency_management.md` Hahaha, thanks Nick....
Yah, really like this and it's essentially the direction I was going in. I was leaning more towards simply combining the confusion matrix with the accuracy chart to reduce the...
Sorry, a quick Q on this - should we set a default value for the match threshold selection? Initially, the chart currently looks like so (at least for me): where...
Thanks as ever, Nick. We really appreciate all of the comments, feedback and code you send our way. And yes, you have brought this up previously, but in the form...
Thanks so much. I've had a quick poke around your code, but I'm quite behind due to being sick for 2 of the last 3 weeks, so I haven't had...
[MAINT] `linker._intermediate_table_cache` logs multiple entries when creating term frequency tables
@RobinL, I've quickly drafted up a potential solution to address this issue, available for review [here](https://github.com/moj-analytical-services/splink/compare/remove_term_frequency_templated_names_from_cache). The proposed changes include: 1. **Removal of Templated Names from the Cache**: For example,...