mlx issues

[BUG] CUDA random crashes on large sizes

```python import mlx.core as mx a = mx.random.uniform(shape=(1024, 1024, 1024, 3)) mx.eval(a) ``` Fails with: ``` RuntimeError: cudaGraphAddKernelNode(&node, graph_, NULL, 0, &params) failed: invalid argument ```

awni

bug

cuda

[BUG] CUDA conv test hangs on B200

``` python python/tests/test_conv.py TestConv.test_torch_conv_2D ``` It hangs in one of the grouped conv tests.

awni

bug

cuda

[BUG] [CUDA] Blas tests failing on B200

2

``` python python/tests/test_blas.py -v ``` A bunch of failures: ```python test_matmul_shapes (__main__.TestBlas.test_matmul_shapes) ... test_matmul_shapes (__main__.TestBlas.test_matmul_shapes) (dtype='float32', shape_a=(1, 2, 1), shape_b=(1, 1, 1), transpose='nn') ... FAIL test_matmul_shapes (__main__.TestBlas.test_matmul_shapes) (dtype='float32', shape_a=(1, 2,...

awni

bug

cuda

[BUG] Profiling out of time

1

**Describe the bug** A clear and concise description of what the bug is. When profliing gptoss models [add_profiling_suppport](https://github.com/ml-explore/mlx-lm/pull/601), the process of profiling prefill becomes extremely slow, and finally throw timed...

yiakwy-xpu-ml-framework-team

[Feature Request] Complex Number and GPU Support for `mlx.linalg.svd` and `mlx.linalg.eig`

1

Hi MLX Team, Thank you for developing such an outstanding package! I’ve been using **MLX** recently and noticed that the function **`mlx.linalg.svd`** currently supports `float32` and `float64`, while **`mlx.linalg.eig`** only...

CadeXinyu

[bug/performance] Random state is updated even when unused

```python import mlx.core as mx def fun(): for _ in range(1000): mx.random.randint(1, 10) fun() print(mx.random.randint(0, 10, shape=(32, 32))) ``` Evaluating the last line causes 1k split kernels to run since...

awni

performance

[Feature] Kernels for sparse matmuls

1

Hi, I'm wondering if anyone is working on implementing metal kernels for sparse matrix multiplication. I'd like to try implementing this myself, but want to make sure the community would...

mercush

enhancement

Add float64 Eig and complex64 SVD/Eig support (Fixes #2708)

1

## Proposed changes Extended dtype support for `mlx.linalg.svd` and `mlx.linalg.eig` as requested in #2708. **Changes:** - Added `float64` support for `mlx.linalg.eig` (CPU) - Added `complex64` support for `mlx.linalg.svd` (CPU) -...

harsh-sutariya

Prefer signed types, and remove -Wall -Wextra warnings

## Proposed changes - `array` class prefers `int64_t` instead of `size_t` - `SmallVector` is inherently small -- sizes are now `int` - propagate the signedness through the codebase and fix...

andresy

[Feature] Quantized Convolution 2D - Possible PR?

Hi @awni , I've been working on a quantizable Conv2D layer that is a dropin replacement for Conv2D (for a large conv unet ~5GB with self attention, cross attention and...

bitanath

mlx
mlx copied to clipboard

Metadata

[BUG] CUDA random crashes on large sizes

[BUG] CUDA conv test hangs on B200

[BUG] [CUDA] Blas tests failing on B200

[BUG] Profiling out of time

[Feature Request] Complex Number and GPU Support for `mlx.linalg.svd` and `mlx.linalg.eig`

[bug/performance] Random state is updated even when unused

[Feature] Kernels for sparse matmuls

Add float64 Eig and complex64 SVD/Eig support (Fixes #2708)

Prefer signed types, and remove -Wall -Wextra warnings

[Feature] Quantized Convolution 2D - Possible PR?

← Metadata

Owner

Metadata

mlx mlx copied to clipboard

Metadata

← Metadata

Owner

Metadata

mlx
mlx copied to clipboard