Ming-Hsuan-Tu issues

Results 77 issues of


Ming-Hsuan-Tu

large number of bounding boxes

Since the maximum number of cuda threads is 1024. So if the number of bounding boxes exceeds 1024 (count > 1024), the current implementation won't work. https://github.com/jeetkanjani7/Parallel_NMS/blob/master/GPU/nms.cu#L104 Any idea?

Any plan for dinov2?

DinoV2 has shown remarkable performance on downstream tasks, but its use of Vision Transformer (ViT) is computationally inefficient. Do you have plans to train an efficientVIT version of DinoV2?

Failed to load vit_large_patch14_clip_336.openai

converter ```python python convert-pth-to-ggml.py --model_name vit_large_patch14_clip_336.openai --ftype 1 ``` runner ```python ./bin/vit -t 4 -m ../ggml-model-f16.gguf -i ../assets/tench.jpg ``` ![image](https://github.com/staghado/vit.cpp/assets/1567200/3dbcb03c-3616-4ee4-866a-9b254bba7c3d) Any idea?

is it possible to use cuda

Hi, is it possible to use cuda for inference?

sometimes no response or not stable

![image](https://github.com/collabora/WhisperFusion/assets/1567200/9115261d-a29e-4e09-8109-9403de09305d) sometimes it does not have response and sometimes no text-to-speech generated. why?

which browser you used?

I am exprementing issues when using firefox and chromium. for firefox it would throw the exception in main.js ``` Connecting AudioNodes from AudioContexts with different sample-rate is currently not supported....

exec /__e/node20/bin/node: no such file or directory

```yml on: pull_request: types: [opened, synchronize] name: Tag Created jobs: tag-created: runs-on: rtx4090-1 container: image: nvcr.io/nvidia/l4t-tensorrt:r8.0.1-runtime steps: - name: Checkout code uses: actions/checkout@v4 with: ref: v0.2.0 ... ``` I am...