Jake Vanderplas comments

Results 652 comments of


                                            Jake Vanderplas

Performance Issue Report: JAX Slower Than Autograd on GPU and CPU Setups

When I try benchmarking your original function using `jax.jit`, I find that JAX is 4x faster than autograd on both CPU and GPU for inputs of size 1000 ```python import...

Copy nn.{softmax,log_softmax} to scipy.special

It looks like these functions can't be imported directly, because `jax.nn.softmax` defaults to `axis=-1`, while `scipy.special.softmax` defaults to `axis=None`. We'll have to create wrappers for these in `jax/_src/scipy/special.py`.

Copy nn.{softmax,log_softmax} to scipy.special

> We'll have to create wrappers for these in `jax/_src/scipy/special.py`. Also, we might want to make the `jax.scipy.special` wrappers use the non-deprecated softmax version by default. The deprecated version has...

Performance issue with 64bit on CPU

Thanks for the report. It looks like the compiler is able to recognize the constant expression in one case, but not in the other. I don't think I'd consider this...

Performance issue with 64bit on CPU

We can see what the compiler is doing with these functions using [ahead of time lowering](https://jax.readthedocs.io/en/latest/aot.html). For example: ``` h1_lowered = jax.jit(h1).lower(x, a, b, c).compile() h2_lowered = jax.jit(h2).lower(x, a, b,...

Performance issue with 64bit on CPU

I think I understand the difference: the expensive allocation is the output of the `einsum`. In `h1`, the input to the einsum is an internal buffer (the output of `x...

Counterintuitive speed of einsums vs equivalent matmuls

Hi - thanks for the question! Could you take a look at https://jax.readthedocs.io/en/latest/faq.html#benchmarking-jax-code and update your benchmarks? In particular, accounting for asynchronous dispatch via `block_until_ready()` and separating-out compile time and...

Jake Vanderplas

Performance Issue Report: JAX Slower Than Autograd on GPU and CPU Setups

Copy nn.{softmax,log_softmax} to scipy.special

Copy nn.{softmax,log_softmax} to scipy.special

Performance issue with 64bit on CPU

Performance issue with 64bit on CPU

Performance issue with 64bit on CPU

Counterintuitive speed of einsums vs equivalent matmuls

Counterintuitive speed of einsums vs equivalent matmuls

JAX-based gradient descent plateaus

JAX-based gradient descent plateaus