castor issues

Use most recent version of torchtext

Currently, we're using torchtext 0.2.* -- we should update to the next major version, 0.3.

daemon

Fix tokenizer for reuters dataset

1

Need to remove a few characters ( like `?`, `!` ) from sentences. In other words, add a few relevant delimiters.

Ashutosh-Adhikari

Dataset path mismatch

2

so far, 2018-08-18. the data path using in the Castor/sm_cnn/create_dataset.sh such as ''../../Castor-data/TrecQA'' is NOT match with the real path in Castor-data dir. can you please check it?

liudonglei

other dataset trained on vdpwi

4

I saw the dataset loading fixed to four dataset(sick,msrvid, trecqa, wikiqa). I wanted to know how to trained vdpwi with other datasets. what's more, how to reasonably organize the dataset....

xyx-x

Hyper-parameter tuning for VDPWI

5

According to @daemon - the VDPWI works https://github.com/castorini/Castor/tree/master/vdpwi But the effectiveness is still below STOA because the hyper-parameters haven't been tuned yet.

lintool

NCE CNN refactoring to match MP CNN

Will do this after #128

Victor0118

SM CNN refactoring to match MP CNN

SM CNN needs to be refactored to match the API of MP CNN.

lintool

conv_rnn refactoring

2

Ref #99 `conv_rnn` and `kim_cnn` are both sentence classification models - they should share the same API, and in general be structured the same way. @Impavidity @daemon please coordinate on...

lintool

High-level documentation for using MP-CNN and VDPWI (forward inference)

2

@tuzhucheng Can you check in a MP-CNN model in `Caster-models/` and write up instructions on how to actually use it? I should be able to open a Python shell, copy-and-paste...

lintool

Given our code clean-up, it should be fairly straightforward to build a demo iPython notebook for MP-CNN to walkthrough its features? We can also try https://github.com/szagoruyko/pytorchviz to visualize e.g., https://github.com/szagoruyko/pytorchviz/blob/master/examples.ipynb

lintool

castor
castor copied to clipboard

Metadata

Use most recent version of torchtext

Fix tokenizer for reuters dataset

Dataset path mismatch

other dataset trained on vdpwi

Hyper-parameter tuning for VDPWI

NCE CNN refactoring to match MP CNN

SM CNN refactoring to match MP CNN

conv_rnn refactoring

High-level documentation for using MP-CNN and VDPWI (forward inference)

iPython notebook for MP-CNN

← Metadata

Owner

Metadata

castor castor copied to clipboard

Metadata

← Metadata

Owner

Metadata

castor
castor copied to clipboard