matthiasgeihs comments

Results 84 comments of


                                            matthiasgeihs

Support for bn254 G2

I know about the issue. But are there any better alternatives on Ethereum as long as bls12-381 is not supported natively? (see [EIP-2537 discussion thread](https://ethereum-magicians.org/t/eip-2537-bls12-precompile-discussion-thread/4187/39))

Support for bn254 G2

@fedealconada I've been resorting to existing libraries such as `ffjavascript`.

Determine secure amount of private key modulo bias

Regarding 1. A 2^-64 bias means that the statistical distance between the uniform distribution and the actual distribution (assuming a perfect underlying rng) is 2^-64 (see the appendix of https://eprint.iacr.org/2023/1254.pdf,...

Finetuning fails with `RuntimeError: Please install flash-attn==1.0.3.post0 and triton==2.0.0.dev20221202`

Any chance to update the model such that it runs with PyTorch 2?

TX.wait() never resolves even though the transaction has already been completed.

can somebody fix this please?

Why does batch size affect convergence?

I'm not necessarily an expert but I have some intuition why this might happen: Let's say you have a very small batch size. This means there is only very few...

Why does batch size affect convergence?

You might want to watch https://youtu.be/kCc8FmEb1nY?t=867. This might clarify a few things.

Why does batch size affect convergence?

*The different batches don't talk to each other* means that the model parameters are optimized **per batch**. ``` block_size ^= context_length ^= length of a training chunk batch_size ^= number...

Why does batch size affect convergence?

I think you might wanna look at how which point the backprop optimization actually changes the parameters. This is done after each batch. (Also note that batch != block.)

Why does batch size affect convergence?

wow, in this case i am also pretty much out of explanations. of course you can try to run with different seed. or maybe batch size has to be a...