Nicolas Patry comments

Results 978 comments of


                                            Nicolas Patry

Audio-to-regions widget and community API for pyannote.audio

> `audio-token-classification`? scream You're actually pretty on spot on IMO, since `token-classification` is actually `text-segmentation` I think. It's also aligned with `image-segmentation`. Which basically should be a list of "objects"...

Audio-to-regions widget and community API for pyannote.audio

`speech-segmentation` was never deprecated, but it also never had widget support afaik. It's output is not `audio` so I don't see how `audio-to-audio` could be used: https://github.com/huggingface/huggingface_hub/blob/main/api-inference-community/docker_images/superb/app/pipelines/speech_segmentation.py

Audio-to-regions widget and community API for pyannote.audio

I think we can keep the PR as is, merge it when ready, so things are functional (even though less than perfect). And when support for `audio-segmentation` is ready (or...

Mask-fill pipeline for t5 and flan-t5

I think it fills `fill-mask` quite nicely, in the sense the given a masked input, the model should tell us what should be under mask. Now potential caveats/pains: - Currently...

convert fast tokenizers to slow

You could try and create inverse scripts for the conversion you found. But it's not going to be trivial. You need to create the protobuf sentencepiece expects. Not sure I...

convert fast tokenizers to slow

Awesome. Do you mind explaining a little more or giving links for potential readers that would want to do the same?

convert fast tokenizers to slow

Everything you need is here: https://github.com/huggingface/transformers/blob/main/src/transformers/convert_slow_tokenizer.py There is no simple tutorial, there are many configurations in `tokenizers` that could achieve what you want, with various tradeoffs. What I recommend is...

Nicolas Patry

Audio-to-regions widget and community API for pyannote.audio

Audio-to-regions widget and community API for pyannote.audio

Audio-to-regions widget and community API for pyannote.audio

Mask-fill pipeline for t5 and flan-t5

convert fast tokenizers to slow

convert fast tokenizers to slow

convert fast tokenizers to slow

Output `past_key_values` from `TextGenerationPipeline`.

Output `past_key_values` from `TextGenerationPipeline`.

Output `past_key_values` from `TextGenerationPipeline`.