Paul Baclace

Results 6 comments of Paul Baclace

It would be nice to have instructions for adding new data. I'm looking to use DocBERT for classification using 1-2 thousand tokens. Looking at the tsv file. In hedwig-data/datasets/Reuters/train.csv it...

@achyudh Thank you for your clarifications and suggestion. In sec. 5.2 of the DocBERT paper, it says "...we conclude that any amount of truncation is detrimental in document classification..." Perhaps...

I am also attempting the multilabel case. I defined an attribute "type" and have 20 variations. None of them overlap in the areas I defined using VGG. The output of...

I have 20 annotation types for my work at Internet Archive to dissect images of journals, like { cover, footer, header, page_num, references, body, begin_article, ad, editorial, contributors...}. When trying...

This is misnamed due to a misunderstanding: should be "Classification limitation should be documented". More detail: You can have up to 7 labels. If you define more than that, an...

I've been wanting this one for a long time since I have experience generating 20 4-node clusters. I avoid HTTP 503 (Request limit exceeded) by ad hoc timing at the...