Juan-Pablo Velez

Results 17 comments of Juan-Pablo Velez

@hunterowens, is this still a problem or was it resolved?

@PlusTheMonk, if I recall we both did this independently. Did we end up picking one? If so, please close the commit. =)

canonical dataset (subset of building footprints with addresses. all unique addresses.) - 481,709 rows messy dataset (supplemental building ages. all uniquified addresses, started off as unique PINs .) - 441,909...

And what am I looking for when I do that? More true duplicates to label? What additional blocking predicate might it learn besides matching the entire field, and how would...

Would also improve search usability to make the search bar even bigger and more central, and to have autocomplete.

Here's a simple, relatively low-maintenance way to flag and link: The databases in the the dictionary are the sources for all/most of the datasets on a city's data portal. To...

Come to think of it, there's probably some easyish way of pulling databases, data portal datsets, and apps together. If developers added [civic.json](https://github.com/open-city/open-gov-hack-night/issues/3#issuecomment-24679245) (h/t @ryanbriones) files to their repos that...

The simplest thing to do would be to have a many to many relationship between databases and Socrata datasets, and not get into the weeds of tables at all. Matching...

What are some use cases for such an API?

Implementation question: should the new `build_powerplants` rule go in a new `build_powerplants.smk` file in `workflow/rules`, should it go in an existing `.smk` file?