neuralcoref issues

The results are missing a lot of basic co-references ...

2

Hi there, Running this against many trivial examples and it seems to miss obvious co-references ... here's an example of what I mean: Input: "My dogs love the beach. They...

ohmeow

wontfix

perf / accuracy

How to retrain neuralcoref using ELMo?

1

neuralcoref works well for majority of our use cases, but we're trying to eek out whatever remaining bits of performance we could. I noticed in https://github.com/huggingface/neuralcoref/blob/master/neuralcoref/train/training.md there is a reference...

Hevia

wontfix

Training Dataset Format

Just want to ask a question regarding the dataset format i need to have for training, seeing there is already all the code necessary for training, evaluation, and everything, i...

aqilabdulaziz1123

Fix wrong building of Mention Type one-hot vectors

This is way to fix #340. This is done by avoiding slicing and using coordinate indexing instead for the assignment.

valedica

Wrong Mention Type one-hot vectors during training due to a small bug in dataset.py

I think there is a small bug in dataset.py that affects the building of the Mention Type one-hot vectors of antecedent mentions in the pair features during training. Due to...

valedica

Add doc embedding implementation during inference

This closes #338. The implementation follows the one of the method [get_document_embedding](https://github.com/huggingface/neuralcoref/blob/60338df6f9b0a44a6728b442193b7c66653b0731/neuralcoref/train/document.py#L534-L542) from neuralcoref.train.document, which is the method that calculates document embeddings during training.

valedica

Fix punctuation in average embedding

Fix #336

valedica

Missing implementation of doc embeddings during inference

Document embeddings are not calculated during inference in [neuralcoref.pyx](https://github.com/huggingface/neuralcoref/blob/60338df6f9b0a44a6728b442193b7c66653b0731/neuralcoref/neuralcoref.pyx), but they are left at zeros. https://github.com/huggingface/neuralcoref/blob/60338df6f9b0a44a6728b442193b7c66653b0731/neuralcoref/neuralcoref.pyx#L717 This causes a mismatch between inference and training input features (doc embeddings during training...

valedica

Wrong average embedding during inference due to a small bug in neuracoref.pyx

The average embeddings can be wrongly calculated during inference due to a small bug in neuralcoref.pyx: https://github.com/huggingface/neuralcoref/blob/60338df6f9b0a44a6728b442193b7c66653b0731/neuralcoref/neuralcoref.pyx#L896 `PUNCTS` is a list of strings, while `token.lower` is an integer hash. This...

valedica

Unexpected f-word appeared in the web demo

Try: https://huggingface.co/coref/?text=Wi-Fi `Wi-Fi` is mysteriously changed to `Wi-fuck it`.

cuihaoleo

neuralcoref
neuralcoref copied to clipboard

Metadata

The results are missing a lot of basic co-references ...

How to retrain neuralcoref using ELMo?

Training Dataset Format

Fix wrong building of Mention Type one-hot vectors

Wrong Mention Type one-hot vectors during training due to a small bug in dataset.py

Add doc embedding implementation during inference

Fix punctuation in average embedding

Missing implementation of doc embeddings during inference

Wrong average embedding during inference due to a small bug in neuracoref.pyx

Unexpected f-word appeared in the web demo

← Metadata

Owner

Metadata

neuralcoref neuralcoref copied to clipboard

Metadata

← Metadata

Owner

Metadata

neuralcoref
neuralcoref copied to clipboard