Bo Wang issues

Results 17 issues of


                                            Bo Wang

Incorrectly formatted line in vocabulary file

Why is for example 0800 555 111 356 included in the generated vocab file? This example is at line 23163. Or is it just me who have this problem? ```...

ImportError: No module named _emd

Why is that I keep getting "`ImportError: No module named _emd`" error from emd.py? I use python 2.7. May I ask what is '_emd' ? I assume it's not the...

multi-level alignment with "task_adjust_boundary_nonspeech_min"

I want to alignment my audio recording files with corresponding transcripts. There are a lot of pauses and silence in my audios. I want **multi-level alignment** (mainly word-level and segment/paragraph-level)...

bug

rnn_decoder initial_state

Thanks so much for your code hunkim! It is very helpful! Can I ask a quick question please? Am I right to think: within one batch, every time you feed...

'BucketingDataLoader' object has no attribute 'db'

I am running on Ubuntu 18.04 with cuda 10. I have followed `Setup & Installation (TL;DR) - Train model with Conda Environment`. > python3.6 demo.py > Found existing ./models folder,...

Subtracting mean embeddings

Are you sure this line is correct? `X_train = X_train - np.mean(X_train)` `np.mean(X_train)` gives a single value. Shouldn't it be `np.mean(X_train, 0)` ???

Choosing parameters for large dataset of short texts

Thanks for your great work Joe! Following [the provided notebook](https://github.com/joewandy/hlda/blob/master/notebooks/bbc_test.ipynb), I have been trying to use hlda to infer topics on a large set (~100,000 docs) of short text docs...

Incorrect evaluation for 2008 Obesity Challenge?

Reading the way labels are being handled in the [training script](https://github.com/AndriyMulyar/bert_document_classification/blob/572883204cb1aca50d346979319905f698ad7049/examples/ml4health_2019_replication/predict_n2c2_2008.py) and [pred script](https://github.com/AndriyMulyar/bert_document_classification/blob/572883204cb1aca50d346979319905f698ad7049/examples/ml4health_2019_replication/predict_n2c2_2008.py), for each label (i.e. "Obesity" and the co-morbidities) the classes simply converted to binary (`if intuitive[name]...

Installing from source: too many errors emitted

Hi, I am trying to install from source enable cython. However there seems to be a lot of errors: `python3 setup.py build ` > running build > running build_py >...

How are section attributes registered?

Hi thanks for the library! I am integrating the `sectionizer` in my data pipeline, which uses Spacy v3. In `sectionizer.py` it writes: ``` Section attributes will be registered for each...