Bo Wang
Bo Wang
Why is for example 0800 555 111 356 included in the generated vocab file? This example is at line 23163. Or is it just me who have this problem? ```...
Why is that I keep getting "`ImportError: No module named _emd`" error from emd.py? I use python 2.7. May I ask what is '_emd' ? I assume it's not the...
I want to alignment my audio recording files with corresponding transcripts. There are a lot of pauses and silence in my audios. I want **multi-level alignment** (mainly word-level and segment/paragraph-level)...
Thanks so much for your code hunkim! It is very helpful! Can I ask a quick question please? Am I right to think: within one batch, every time you feed...
I am running on Ubuntu 18.04 with cuda 10. I have followed `Setup & Installation (TL;DR) - Train model with Conda Environment`. > python3.6 demo.py > Found existing ./models folder,...
Are you sure this line is correct? `X_train = X_train - np.mean(X_train)` `np.mean(X_train)` gives a single value. Shouldn't it be `np.mean(X_train, 0)` ???
Thanks for your great work Joe! Following [the provided notebook](https://github.com/joewandy/hlda/blob/master/notebooks/bbc_test.ipynb), I have been trying to use hlda to infer topics on a large set (~100,000 docs) of short text docs...
Reading the way labels are being handled in the [training script](https://github.com/AndriyMulyar/bert_document_classification/blob/572883204cb1aca50d346979319905f698ad7049/examples/ml4health_2019_replication/predict_n2c2_2008.py) and [pred script](https://github.com/AndriyMulyar/bert_document_classification/blob/572883204cb1aca50d346979319905f698ad7049/examples/ml4health_2019_replication/predict_n2c2_2008.py), for each label (i.e. "Obesity" and the co-morbidities) the classes simply converted to binary (`if intuitive[name]...
Hi, I am trying to install from source enable cython. However there seems to be a lot of errors: `python3 setup.py build ` > running build > running build_py >...
Hi thanks for the library! I am integrating the `sectionizer` in my data pipeline, which uses Spacy v3. In `sectionizer.py` it writes: ``` Section attributes will be registered for each...