Paul Dix

Results 9 repositories owned by Paul Dix

basset

62
Stars
6
Forks
Watchers

Library for various machine learning tasks

domainatrix

312
Stars
58
Forks
Watchers

A cruel mistress that uses the public suffix domain list to dominate URLs by canonicalizing, finding the public suffix, and breaking them into their domain parts.

extractula

39
Stars
2
Forks
Watchers

Extracts content like title, summary, and images from web pages like Dracula extracts blood: with care and finesse.

monkey-rust

105
Stars
3
Forks
Watchers

My first foray into learning Rust: an implementation of Thorsten Ball's Monkey programming language

sax-machine

233
Stars
70
Forks
Watchers

A declarative sax parsing library backed by Nokogiri.

service-oriented-design-with-ruby

244
Stars
60
Forks
Watchers

Code examples from my forthcoming book "Service Oriented Design in Ruby and Rails"

truffle-hog

38
Stars
7
Forks
Watchers

Finds RSS and Atom feed urls in html like a hog finds truffles. Tasty, delicious feeds... er, truffles.

typhoeus

81
Stars
5
Forks
Watchers

Like a modern code version of the mythical beast with 100 serpent heads, Typhoeus runs HTTP requests in parallel while cleanly encapsulating handling logic.

working-with-big-data

55
Stars
33
Forks
Watchers

Slides, code, and supplemental materials for the LiveLesson: Working with Big Data: Infrastructure, Algorithms, and Visualizations