Tom Hepworth

Results 19 issues of Tom Hepworth

Code that allows users to evaluate their clusters in the context of the `linker.predict()` output. Specifically, this allows users the ability to view the match threshold associated with cluster links....

### What happens? When loading a pandas df with array columns into duckdb, it seems to transform them into strings. So an array `[1234, 5678]` will become -> `"[1234, 5678]"`....

It's late, so I'll update this with some further information tomorrow morning as appropriate. This impacts anywhere that either requests nulls be removed/adjusted or that compares strings. This rather tragically...

bug

You can see part of the debate [here](https://github.com/moj-analytical-services/splink/pull/545#issuecomment-1167063184) and my temporary fix [here](https://github.com/moj-analytical-services/splink/blob/86ffa1c4b11ae5fcaac7ac2b919ae1cb60510bb9/splink/linker.py#L833). In short, when creating a column using `random()` in a CTE chain and then attempting to filter...

bug
duckdb

### Type of PR - [ ] BUG - [x] FEAT - [ ] MAINT - [ ] DOC ### Is your Pull Request linked to an existing Issue or...

### Type of PR - [ ] BUG - [ ] FEAT - [ ] MAINT - [x] DOC ### Is your Pull Request linked to an existing Issue or...

### What happens? See https://github.com/moj-analytical-services/splink/pull/1061#issuecomment-1546708437. In short, when generating our term frequency tables we are currently recording both the physical and templated names within `linker._intermediate_table_cache`. This in turn makes clean-up...

caching

### Is your proposal related to a problem? Tags onto this conversation about cleaning up our settings class - https://github.com/moj-analytical-services/splink/issues/1669. I think we should sit down and work out how...

enhancement
splink4
refactoring

### Is your proposal related to a problem? See this closed issue for some further background, though it's not necessary - https://github.com/moj-analytical-services/splink/issues/1670. ## The Problem Currently, we face the following...

enhancement
Interface/API improvement
backend_change
comparison levels
blocking
splink4

### Is your proposal related to a problem? This proposal builds upon the discussions in issue https://github.com/moj-analytical-services/splink/issues/1055. ### Describe the solution you'd like As Splink has expanded, some of our...

enhancement
splink4
settings