Rolf Jagerman

Results 25 comments of Rolf Jagerman

For training and evaluating with libsvm data, there is now a libsvm parser in the `tensorflow_datasets` package [here](https://github.com/tensorflow/datasets/blob/b221178c90c05115b9e183d57242844bb3fddb4b/tensorflow_datasets/ranking/libsvm_ranking_parser.py), which can act as a generator. An example of how this parser...

I will close this issue now. Feel free to reach out or re-open if you have further questions about LibSVM-style data.

For LibSVM datasets, we now recommend using the `tensorflow_datasets` (TFDS) package. It already has a few standard LTR datasets that are in LibSVM format, most notably MSLR-WEB30K and Istella, see:...

Thanks for your question! The main reason for this limitation is performance. The vector/matrix data is stored as dense arrays of primitives which the JVM stores as a single object....

There's multiple reasons a pull or push can fail. I think it's a good idea for me to add documentation on this, as this behavior is not intuitive. The two...

So, the code example is pretty unclear, my apologies: The dataset is not an RDD in Spark, but a single partition of the data. Better example would probably be: ```...

Thank you for your question, you raise a good point! When we initially started this project, the goal was to develop more scalable inference of LDA in Spark via collapsed...

Hey, unfortunately I have had no time to work on Glint due to my every-day life obligations as a PhD. I will be working on parameter-server technology and Glint fulltime...

You're right that Glint is stand-alone and not necessarily interfaced with Spark. You could use it entirely without Spark. The documentation has a section that shows how Glint can easily...

Hi Yitong, Thanks for your interest in the project! I have unfortunately not tried this with YARN, and I personally have never used YARN before. I do think it would...