Forest Gregg

Results 192 issues of Forest Gregg

Right now, blocking and scoring are two distinct phases. All the information about how two records came to be blocked together is unused by the scorer. This is a bit...

### Description A great many C extensions depend upon numpy. If numpy does not have a binary wheel targeting that platform, version, and architecture, then almost always it is going...

```console > echo "a,b" > [minimal_example].csv > csvs-to-sqlite "[minimal_example].csv" out.db /opt/homebrew/lib/python3.10/site-packages/csvs_to_sqlite/utils.py:38: FutureWarning: The error_bad_lines argument has been deprecated and will be removed in a future version. Use on_bad_lines in the...

sqlite supports these, and it would be quite nice to be able to add them with sqlite-utils. https://www.sqlite.org/foreignkeys.html#fk_actions

enhancement

It's pretty common that i need to trim leading or trailing white space from lots of columns in a database a part of an initial ETL. I use the following...

Currently, if two rects do not share an x1 and x2, they will not stack, it would be great if they did! Here's an example of something i would like...

enhancement
question

It would be awesome when using pgfsweave-script.R to produce a pdf, if there was an option to have the script clean up the intermediate files (*.tex, *.map, *.tikz).

Feature

Hi @mmihaltz, Thanks for the great package. Unfortunately, it's too slow right now for my current uses, but I'd like to see if I can get it up the speed...

I'm hoping I can use this package in my [dedupe](https://github.com/dedupeio/dedupe) library. We are committed to supporting python 2.7 for a while longer, so all of our dependencies need to support...