Jeffrey Wardman comments

Results 36 comments of


                                            Jeffrey Wardman

Mixup or CutMix augmentations

@bmabey Randomly sampling within the dataloader for multiple images. Would be nice to augment labels as well. These can then be fed in with ``` AUGMENTATION = MIXUP(p=0.6) # Second...

Only search source for designated platform and python

Thanks for linking the preexisting issue. I missed it; my apologies. Surely there's a simple way to add a filter=* argument that will skip attempting to download files that don't...

Rescale layer in whisper processor

```python import torch from transformers import WhisperProcessor, WhisperForConditionalGeneration from datasets import load_dataset from transformers import AutoProcessor, AutoModelForCTC def inference(input, processor, model): output = processor(input, sampling_rate=16000, return_tensors="pt") if "whisper" in processor.tokenizer_class.lower():...

Rescale layer in whisper processor

You can see in the above that the transcript is gibberish for the unscaled whisper model. This is because it is taking in as input the range [0, 65535] rather...

[Whisper] Word level and character level timestamps

This approach with DTW is more memory efficient and scalable: https://github.com/linto-ai/whisper-timestamped

[Whisper] Word level and character level timestamps

Just going to bump this. There are several solutions out there and this is a pretty key missing feature from the transformer implementation of Whisper. E.g. https://github.com/jianfch/stable-ts/blob/main/stable_whisper/whisper_word_level.py

word-level timestamps in `transcribe()`

@jongwook is there a way to access it via a beta flag for instance? How can we know when something is/isn't added to the API?

Pointdata and celldata lost when decimating mesh

Unfortunately neither solution is suitable for me. The former really struggles with curves and creates a lot of triangles that look like dents. The latter takes far too long (to...

Pointdata and celldata lost when decimating mesh

Do you have a link for the issue in VTK's GItLab repository? Is there a way to allow decimate to return the removed/remaining point indices? That would resolve the issue....

cut_with_box with pointdata creates noise along boundary

No worries. I cannot reproduce it with the mesh in the example below. I shared with you privately a link to an open source dataset. This is the code: ```...