Gordon Mohr

Results 20 issues of Gordon Mohr

I know that prior experience suggests that a new release is only 'battle-tested' by the larger community when it becomes official, & thus installed by a default `pip install gensim`....

housekeeping

Motivated by the SO question: https://stackoverflow.com/questions/55768598/interpret-the-doc2vec-vectors-clusters-representation/55779049#55779049 `Doc2Vec` could plausibly have a function that's reverse-inference: take a doc-vector, return a (ranked) list of words most-predicted by that input vector. It'd work...

feature
wishlist
difficulty medium
good first issue
Hacktoberfest

The *2Vec models have an underdocumented implementation limit in their Cython paths: any single text passed to training that's more than 10000 tokens is silently truncated to 10000 tokens, discarding...

feature
difficulty hard
impact MEDIUM
reach LOW

Per discussion on #1873, while I'm not convinced the 'Koan' paper about CBOW has identified an error or clear results-improvement, they do appear to have a faster negative-sampling method. Per...

PR to eventually address loss-tallying issues: #2617, #2735, #2743. Early tinkering stage.

[Reported in forum thread: https://groups.google.com/g/gensim/c/xaGvo0j8yv0/m/VI74_Fp7AAAJ] User identically-trained models in `gensim-3.8.3` and `gensim-4.0.1`. As expected, the files on disk from from 4.0.1 save are much smaller. Unexpectedly, loading the 4.0.1 save,...

bug
performance

The `api.load()` utility will grab fresh Python code, in the form of an `__init__.py` file inside the Github project `gensim-data` download area, and then run it at the user's machine....

feature
difficulty medium

The default `steps` should probably be higher: perhaps 10, or the same as the training `iter` value. If there are no known-tokens in the supplied `doc_words`, the method will return...

wishlist
difficulty medium

Hi! Do you know of any usable work, or have any recommendations, for `IconVG` decoding in the browser? I'm thinking of something that'd be able to replace inline elements, like...

I have tons of open windows/tabs - hence the interest in TabFern. But in Brave, it's glacially slow, especially: (1) On initial browser launch. Even if I had the TabFern...

triage
support
target-brave
performance