Summarizer
Summarizer copied to clipboard
hdf5 dataset generation
trafficstars
Allows the creation of an .h5 file from videos and .npy annotations (e.g. for the Twitch-LOL dataset).
May be extended later to generalize to other datasets (that don't have binary labels) or other formats than .npy for importing labels.
Possible improvements (could be added after merge):
- [ ] Adding extra flags in
generate_dataset.pyto control the KTS parameters fromutils/KTS/cpd_auto.pyandutils/KTS/cpd_nonlin.pyx. - [ ] Handling non-binary score labels in addition to summary labels.
- [ ] Handling other formats than .npy for importing labels.
The SumMe and TVSum datasets have a 2-second uniform segmentation version (that's been used for Twitch-LOL) uploaded to a dedicated folder.
This could be useful for an ablation study for example.