Robin Linacre

Results 234 comments of Robin Linacre

Hi @vfrank66 i've started to have a look at this, and do some experiments. All working so far. Could I double check - are the implementations of the new similarity...

Thanks for the report The characters �[4m�[0m are ANSI escape codes for terminal text formatting. Specifically: - �[4m is the ANSI escape code for enabling underline. - �[0m is the...

Closing on the basis I think this may have been a copy and paste error - please repoen with a reprex if you're still having issues

I think you can do this already using this kind of syntax: ```python import splink.comparison_level_library as cll import splink.comparison_library as cl from splink import DuckDBAPI, Linker, SettingsCreator, block_on, splink_datasets df...

Yeah - you're right to highlight these challenges. It's typically best to try and order in terms of 'better matches higher' - start with the most precise matches and work...

Starting to experiment with this as follows: ```python import sqlglot from sqlglot import exp sql = """ SELECT CASE WHEN levenshtein(name_l, name_r) < 2 THEN 0 WHEN levenshtein(name_l, name_r) <...

TODO: [ ] - Check tf adjustments work if a tf table is manually registered [ ] - Double check tf adjustments work from tf nodes with concat [ ]...

I've some some performance testing for a complex model which shows that the time to generate the sql is longer than the time to execute. In these tests, it takes...

Here's a reprex of the current inability to use the function if you use date types: Click to expand ```python import datetime import pandas as pd import splink.comparison_library as cl...