Andriy Mulyar comments

Results 82 comments of


Andriy Mulyar

Micro/Macro F1 score calculation is over-optimistic

Samuel, Thank you for pointing this out. In the original datasets considered for evaluation, performance was calculated in the multi-class manner. I will be pushing out an update that will...

How long is long?

Yes - but it will of course use more memory. I'd suggest to try finetuning a pretrained longformer if your documents truly are that long.

mean vs identity pooling?

Yes if you fix the number of chunks. On Mon, Nov 2, 2020, 10:27 AM Vipula Rawte wrote: > Hi, > > The papers describe four pooling functions: 1. Mean,...

mean vs identity pooling?

Hi, the public codebase just hasn't been updated. You can change the pooling from max to mean in the implementation to replicate the stated results in the paper. Cheers

The latest version of the code

The code is up to the date with the correction. On Mon, Nov 16, 2020, 6:26 AM chokchou wrote: > hello, I noticed your new paper on arXiv submitted on...

backpropgation on chunks?

1. Yes, separate for every chunk. 2. In our datasets we found it sufficient to fine-tune only the final transformer layer. On Fri, Oct 2, 2020, 11:12 AM Vipula Rawte...

Problems reproducing results on "I2B2 2006: Smoker Identification" dataset

Hi Arne, Thank you for your carefully laid out issue. What you tried seems reasonable. This repository contains code transferred from the original implemention - I will cross check with...

Problems reproducing results on "I2B2 2006: Smoker Identification" dataset

Arne, Thank you again for the continued detailed analysis. What you quoted indeed disagrees with our original implementation. This may very well be be an error in writing/interpretation on our...

How to interpret the result?

Hi, these are label indicators.

[WIP] ENH: Hellinger distance tree split criterion for imbalanced data classification

Hello, I have also conducted some investigations in utilizing the Hellinger Distance as a DT impurity criterion. I have my own cython implementation incorporated into a fork of sklearn along...