Andrew DalPino
Andrew DalPino
Ok this should be fixed in version 3.0.3 on PECL and on the latest master branch https://pecl.php.net/package/Tensor/3.0.3
Nice, @tanmayk thank you! I'm curious, for the OpenBLAS dependency, could we install that from a repo such as the Debian one like we do with the Tensor extension? https://github.com/RubixML/Tensor/blob/master/.github/workflows/ci-ext.yml#L62
Hey @sr13579, TF-IDF works on token frequency vectors. You can transform blobs of text into token frequency vectors using either [Word Count Vectorizer](https://docs.rubixml.com/latest/transformers/word-count-vectorizer.html) or [Token Hashing Vectorizer](https://docs.rubixml.com/latest/transformers/token-hashing-vectorizer.html).
Targetting ML 3.0 release with this since it can be construed as a backwards compatibility break.
Thanks @campino2k
Hey @nviet thanks for the great report! The reason for the `LOCK_EX` i.e. exclusive lock is to prevent another process from writing to the file at the same time a...
Hey @27pchrisl I'm interested to know if you've thought of other approaches ... for example, filtering specific categories from the dataset before OneHotEncoding it. Would a "CategoryDropper" Transformer allow for...
I believe that Hebrew is supported natively by PHP. There are even some first-class functions that deal with the Hebrew language specifically. See https://www.php.net/manual/en/function.hebrev.php for example.
Hi @TheCelavi, we have no plans to offer sparse matrix factorization in the near future. Having that said, I would love to implement sparse matrix factorization - perhaps in Rubix...
Hey @TheCelavi thanks for showing me that PHPY library. It looks like we could possibly use that to call PyTorch or even NumPy under the hood to reimplement some of...