Robin Linacre

Results 234 comments of Robin Linacre

Is this a better implementation? Details ``` from __future__ import annotations from collections import Counter from typing import Dict, List, Set, Tuple from sqlglot import exp, parse_one # --------------------------------------------------------------------------- #...

see https://github.com/moj-analytical-services/splink/pull/2738

I think you're mixing up Splink 3 and Splink 4 syntax. The example on the page you link is: ``` import splink.comparison_library as cl from splink import DuckDBAPI, Linker, SettingsCreator,...

All the docs relate to the latest version - Splink 4. The banner at the top should probably be removed now, it's just for users maintaining old Splink 3 code

Yeah, that's right. Upgrading the docs to Splink 4 was a huge tasks and I lost the motivation to do everything. Also there's better error messages now, so the tables...

Sorry you're having problems. We're probably not going to have time to look at this any time soon. If you can find the bug, feel free to PR a fix...

I've been mulling this over. I had forgotten about this, but you can actually achieve this functionality as follows: ``` SettingsCreator.from_path_or_dict(path).get_settings('duckdb').match_weights_chart() ``` This is a bit of a hangover from...

This could also be useful to identify any code that is no longer actually used in Splink

Thanks for the report. Yeah, the solution of materialising the sampled table sounds sensible to me and should have very little impact on performance

Thanks for the report - yeah, this def looks like something we should fix