brown-cluster
brown-cluster copied to clipboard
C++ implementation of the Brown word clustering algorithm.
Hello, I would like to use your algorithm to categorize job titles. Do you still make updates and maintain the library ? Bets Regards, Evangelia
The clustering algorithm seg faults when running for large files ~12GB. This commit fixes it.
The code fails (with core dump: segmentation fault message) when I run it on a huge txt file (about 20M types and 14GB file size). I already used wcluster for...
I am using Cygwin on windows and trying to run this code. On the first step when running "make" command, getting following error. basic/prob-utils.cc: In function ‘double rand_gaussian(double, double)’: basic/prob-utils.cc:8:37:...
Does the package return (or write in the log file) the perplexity or any other goodness of fit measure? If yes, would it be a good idea to run a...
Hi, Can you please guide me how can I pass multiple text files to generate output files on them?
strdb.cc terminates with segmentation fault when run on large data files of say 5 GB in size. This commit has a fix for this issue. ``` Program received signal SIGSEGV,...
I'm not sure whether this can be called an issue or the matter of understanding, I ran the clustering on Persian text and after couple of hours I got these...
I'm trying to compile on Windows 7 using g++ 4.7.2 and GNU Make 3.8.1. When I do I get the following errors: C:\Users\ameasure\brown-cluster-master>make g++ -Wall -g -o wcluster.o -c wcluster.cc...