generalized-language-modeling-toolkit
generalized-language-modeling-toolkit copied to clipboard
distribute calculation via map reduce cluster
at some point in time we have to think about handling large data sets like web crawls