Jacky Lee comments

Results 47 comments of


                                            Jacky Lee

[Bug] Owlv2 Zero-Shot Object Detection

Thanks @NielsRogge! This worked for me: ``` import torch import requests from PIL import Image, ImageDraw from transformers import AutoProcessor, AutoModelForZeroShotObjectDetection checkpoint="google/owlv2-base-patch16-ensemble" model = AutoModelForZeroShotObjectDetection.from_pretrained(checkpoint) processor = AutoProcessor.from_pretrained(checkpoint) url =...

LLVM/GPU CI Tests are slow

The [GitHub-hosted runners don't have a GPU](https://docs.github.com/en/actions/using-github-hosted-runners/about-github-hosted-runners#supported-runners-and-hardware-resources), so CL uses the CPU for GPU tests. I ran a [modified test](https://github.com/jla524/tinygrad/actions/runs/4090705359/jobs/7054302509) to confirm this. ``` >Run python -c "import pyopencl as...

LLVM/GPU CI Tests are slow

Sounds good. Here are the slowest tests on GPU: ``` 97.35s call test/test_train.py::TestTrain::test_efficientnet 44.74s call test/test_ops.py::TestOps::test_conv2d 40.05s call test/test_train.py::TestTrain::test_resnet 32.05s call test/test_mnist.py::TestMNIST::test_conv 30.07s call test/test_onnx.py::TestOnnxModel::test_benchmark_openpilot_model 27.15s call test/test_mnist.py::TestMNIST::test_sgd 22.70s call...

LLVM/GPU CI Tests are slow

Most of the time is spent on `optim.step()`, and it looks like [Adam is much slower than SGD](https://discuss.pytorch.org/t/optimizer-step-the-slowest/90820). With Adam: ``` test/test_train.py::TestTrain::test_efficientnet stepping with 5.3M params bs 2 sampling took...

Added kaiming_uniform initialization for Conv2d and Linear layers

I've added some tests to check if the distributions match torch. We can add something like this in `test/test_randomness.py`. ``` def test_kaiming_uniform(self): self.assertFalse(normal_test(Tensor.glorot_uniform)) self.assertTrue(equal_distribution(Tensor.kaiming_uniform, lambda x: torch.nn.init.kaiming_uniform_(torch.empty(x)), lambda x: (np.random.rand(*x)...

Jacky Lee

[Bug] Owlv2 Zero-Shot Object Detection

LLVM/GPU CI Tests are slow

LLVM/GPU CI Tests are slow

LLVM/GPU CI Tests are slow

Added kaiming_uniform initialization for Conv2d and Linear layers

add std to tensor.py

add std to tensor.py

add std to tensor.py

Add MLPerf UNet3D model

Add MLPerf UNet3D model