Gokul NC comments

Results 50 comments of


Gokul NC

IIIT-D Multilingual Abusive Comment Identiication

Related work: **Multilingual Abusive Comment Detection (MACD) at Scale for Indic Languages** Dataset: https://github.com/ShareChatAI/MACD Langs: Hindi, Tamil, Telugu, Malayalam and Kannada Count: 30k samples per lang on avg.

BARD Bangla Article Classifier

BanFakeNews: https://github.com/Rowan1697/FakeNews Paper: https://arxiv.org/pdf/2004.08789.pdf

how to find the peretrain models

Yeah, even I don't find the trained model that the repo author has quoted in his README. @qidiso Can you please upload the trained model for the accuracies that you...

how to find the peretrain models

@quocnhat : Here's the new link: https://github.com/aidlearning/AidLearning-FrameWork/tree/master/src/facencnn/models All I had to do was browse the repo patiently for a minute. :)

Marian does not do Sanity Checking before inserting into SQLite

Thanks. OK, I will try that. It might also be that monolingual data using which I generated back-translations is very diverse compared to the learned SPM vocab from the 2M...

Marian does not do Sanity Checking before inserting into SQLite

Thanks @emjotde I finally managed to find it. I think some kind of **SQL(ite) Injection** has happened because of a noisy line in the data. So my corpus was perfectly...

FP16 support

I guess the gradients seems to be exploding when using `--fp16` aware training. ``` [2021-03-09 13:30:04] NaN/Inf percentage 1.00 in 10 gradient updates, but cost-scaling factor 7.62939e-06 is already at...

Unable to use quantized s2s models on CPU using marian-conv

@emjotde I am calling it using `marian-server` and the `-c decoder.yml` config after replacing the model name with the converted `model.bin`. Also I just tried using `marian-decoder` too (using **`intgemm8`**...

Unable to use quantized s2s models on CPU using marian-conv

Please find attached the log (which has the command and model config) [server.log](https://github.com/marian-nmt/marian-dev/files/6128067/server.log) decoder.yml: ``` relative-paths: false models: - model.bin vocabs: - model/vocab.en.spm - model/vocab.hi.spm beam-size: 8 normalize: 0 word-penalty:...

Unable to use quantized s2s models on CPU using marian-conv

Yes, the trained model.npz model works perfect.