Sadra Barikbin comments

Results 91 comments of


                                            Sadra Barikbin

Support for regex in Replace normalization function

Hi, yes it does: ```python from tokenizer import Tokenizer from tokenizers.normalizers import Replace from tokenizers import Regex from tokenizers.models import BPE my_tokenizer = Tokenizer(BPE()) my_tokenizer.normalizer = Replace(Regex('[0-9]+'), '[NUM]') ```

Problem adding token with a specific replace normalizer

[NUM] is not identified in the first one. Its id is that of the [UNK] token.

AddedVocabulary does not play well with the Model

Hi @n1t0 @patrickvonplaten I just came across the third issue @n1t0 mentioned: > We can't extract these added tokens during the pre-processing step of the training, which would be desirable...

Add StepParamScheduler feature

Yes, you're right. I had written it for a project, so I attempted for a PR.

Add StepParamScheduler feature

@vfdev-5 This class is also useful in the case where user wants to apply such scheduling on a parameter other than LR or a specific parameter group.

Add StepParamScheduler feature

I can't name one, but what about the second use case? if someone wanted to apply this scheduler on a specific parameter(LR) group. Currently our `LRScheduler` does not accept a...

Add StepParamScheduler feature

It could work, but only by separating their optimizers. That gets difficult. PyTorch's `_LRSchduler` does not accept a `param_group_index`-like parameter.

Approximate PR/ROC AUC Metics

I searched a little bit and found that for computing AUC there are three approaches: 1. Computing it exactly. It takes O(NlogN) in time and O(N) in space. It is...

Approximate PR/ROC AUC Metics

You're right that trying to compute the metric at the end of run is excruciating both in terms of time and in terms of memory, given large data samples. Wilconon-Mann-Whitney...

Approximate PR/ROC AUC Metics

![image](https://user-images.githubusercontent.com/22097587/176096451-be59b9fd-b2fa-4c3d-82be-7a043f20d836.png) According to the formulas above, using two float variables initialized with zero at the beginning of the run we do two first formulas at each batch and finally do...