Erich Schubert

Results 67 issues of Erich Schubert

I'm looking for a good tool to benchmark ELKI ( http://elki.dbs.ifi.lmu.de/ ) clustering performance across parameters. The problem is, that the parameters aren't as nicely uniform as in your examples,...

enhancement

This has been removed in Lucene 5, and apparently was "dead code". Also, if data is read from disk, it would be inserted in primary, to approximate LRU.

German (and some [other European languages](https://en.wikipedia.org/wiki/Date_and_time_notation_in_Europe)) use a dot to denote ordinals. I.e. instead of "1st place", German uses "1. Platz". Instead of "July 28th", German uses "28. Juli". Examples...

bug
tokenize
ssplit

Since Java 8 has a time API comparable to JodaTime, projects such as jollyday are switching to this newer API, and so should CoreNLP. These APIs are quite similar, but...

The PAM/k-medoids implementation appears to implement SWAP, but not the BUILD part for initializing PAM. Instead you have to provide good starting medoids. I tired benchmarking it on a larger...

Investigation
Optimization
Proposal

The latest version of umlgraph on Mavencentral appears to be 5.6.6: https://mvnrepository.com/artifact/org.umlgraph/umlgraph It would be good to have the latest version available as a managed dependency via maven, jcenter, etc....

The current apriori implementation is really really slow. It does not correctly implement the apriori-gen and pruning parts of Apriori. So it is not really apriori as it was published,...

The PAM algorithm is quite different from a k-means style approach you implemented. Any idea what reference you used? I am trying to figure out why so many use the...

The auto-generated diagrams in my (complex) project tend to become a bit cluttered. Hence I am looking at ways to reduce contents. V1 had `-umlIncludeMethodParamTypes false`, and it seems this...

But as we are currently targeting JDK 8, and a new API arrived in JDK 9, it does not make sense to do this yet. The next long-term Java version...

enhancement