Paul Dix
Paul Dix
domainatrix
A cruel mistress that uses the public suffix domain list to dominate URLs by canonicalizing, finding the public suffix, and breaking them into their domain parts.
extractula
Extracts content like title, summary, and images from web pages like Dracula extracts blood: with care and finesse.
monkey-rust
My first foray into learning Rust: an implementation of Thorsten Ball's Monkey programming language
service-oriented-design-with-ruby
Code examples from my forthcoming book "Service Oriented Design in Ruby and Rails"
truffle-hog
Finds RSS and Atom feed urls in html like a hog finds truffles. Tasty, delicious feeds... er, truffles.
typhoeus
Like a modern code version of the mythical beast with 100 serpent heads, Typhoeus runs HTTP requests in parallel while cleanly encapsulating handling logic.
working-with-big-data
Slides, code, and supplemental materials for the LiveLesson: Working with Big Data: Infrastructure, Algorithms, and Visualizations