Chase Geigle issues

Results 12 issues of


                                            Chase Geigle

`sync`: `--provides` fails to filter out unneeded transitive (make) dependency

EDIT: Using `master` branch. With `mongodb-bin` available, ```bash $ aur sync unifi --provides ==> Using [aur] repository -> mongodb: (none) -> 4.0.12-1 -> python2-scons: (none) -> 3.1.1-1 -> unifi: 5.10.25-1...

feature

Add option to save intermediate model files between LDA inference iterations

There should be a way to save intermediate model files between iterations of inference.

feature-request

Benchmark LDA implementations

We should do a benchmark against existing implementations of LDA (like Mallet) for sanity.

documentation

Add ability to load LDA models from a stream

You should be able to load a model from a stream.

feature-request

Modernize LDA model storage format

Model saving should support writing to streams instead of fixed files, and should use the binary format from `io::packed`.

enhancement

Make SCVB0 interface more "stochastic" / "online"

The SCVB0 implementation should have an interface that reflects its stochastic nature (allowing to fit to new documents in a streaming fashion) mirroring the online classifier interface.

enhancement

Add a parallel CVB0 implementation

We should add a parallel implementation of CVB0, just like we have a parallel implementation of collapsed Gibbs sampling.

feature-request

Optimize alpha and beta during topic model inference

We should optimize the alpha and beta values during inference (see Hannah Wallach's thesis, chapter 2).

feature-request

LDA alpha and beta default values are too large

We should revisit our alpha and beta default values. 1.0 is _way_ to large.

documentation

Allow taking multiple samples to estimate parameters for Gibbs sampling

The Gibbs sampling inference methods should allow taking multiple samples to estimate theta and phi.

feature-request