Matt Henderson

Results 13 comments of Matt Henderson

We would like to release these with a publication we are working on. We are planning to optimise the bert model a little more first

Hi @catburr - you can find our reddit model as a tensorflow-hub module here: https://github.com/PolyAI-LDN/polyai-models - it's http://models.poly-ai.com/convert/v1/model.tar.gz

In general there is a lot of questionable language in the reddit dataset, as it is totally unfiltered and we are including all subreddits including 'nsfw' ones. It is still...

Do you have any other way of running apache beam pipelines?

Hi Jason, the training set size should be 3.3M. Maybe check there are indeed 38 input files? `TOTAL: 38 objects, 1935927109 bytes (1.8 GiB)` I just re-ran the pipeline (with...

I was trying to reproduce the numpy example, but it seemed to be working on a new instance. I replaced the original comment above with a tensorflow example that is...

I think the tensorflow library is forcing static TLS for all the following imports, as it uses TLS and nmslib does not. ```bash readelf -l tensorflow/libtensorflow_framework.so | grep TLS TLS...

Actually pip uses those flags, I used `pip install -v` to confirm. I think the issue is that other `.so` libraries imported beforehand are forcing it into static TLS mode....

I confirmed above that there is no TLS line in my built `nmslib.so` file with readelf: ```bash # built normally: readelf -l nmslib.cpython-36m-x86_64-linux-gnu.so | grep TLS TLS 0x00000000003f8168 0x00000000005f8168 0x00000000005f8168...