Florian issues

Results 4 issues of


                                            Florian

fix: ScaleModule and SumModule for DiagHessian.

Partially fixes #316 . ScaleModule is also used for `torch.nn.Identity`. Not sure if `hessian_is_zero()` should always return `True` for those two modules. Same with `accumulate_backpropagated_quantities()` which concats `dicts` instead of...

Missing implementation of supported layers for DiagHessian and BatchDiagHessian

There are multiple layers which are specified as [being supported](https://docs.backpack.pt/en/master/supported-layers.html) for second order derivatives that actually do not work when trying to calculate the Hessian diagonal using `backpack-for-pytorch

fix: Remove depreciated state behavior.

Last remains of depreciated eqx.experimental statefulness behavior.

`DynamicScale` behaves unexpected when computing per-sample gradients with `vmap`.

When running `jax.vmap`, e.g. to compute per-sample gradients, the `fin_steps` and `scale` attributes of `DynamicScale` might become arrays, leading to an error in the next step during training if not...