Aaron Jaech comments

Results 7 comments of


                                            Aaron Jaech

what are these results?

I think you are looking in the wrong file for the cluster output. Look for a file called "paths" for the cluster ids.

When size of data is large (over 100 MB), Brown-cluster program will be killed. How can I fix this error?

The size of the input data doesn't matter as much as the size of the vocabulary. How big is the vocabulary you are dealing with?

Is there any limit for the vocab size (#types)?

I'm not sure what the exact limit is but I'm not surprised that it failed with 20M types. You can try using the restrict command line option to restrict it...

Is there any limit for the vocab size (#types)?

Did you try using the flag to restrict the vocabulary? On Thursday, July 14, 2016, lavelli [email protected] wrote: > I have noticed that at the end of March a new...

error code ，

That looks like a mistake. Thanks for finding it. I don't think it effects anything though.

Data download and preprocess

Hi, can you email me at [email protected] and I will help you get started using the code and data.

Data download and preprocess

If you email me at [email protected] I can help you get started with the data.