banyan-god

Results 4 issues of banyan-god

![W B Chart 3_27_2024, 9 55 33 AM](https://github.com/karpathy/nanoGPT/assets/153394752/400c926a-0443-4faa-b114-6a567420a988) I am running on 2x 4090 , updated gpu to 2 instead of 8 in gradient_accumulation_steps ``` more train_gpt2.py # config for...

This Dockerfile sets up environment to start the training process. You still need to download and run `python dev/data/fineweb.py --version 10B` You can build the dockerfile using `docker build -t...

The current implementation converts trigonometric values (cos_cached and sin_cached) to bfloat16, which introduces significant precision issues. This degrades the relative positional encoding properties of RoPE, particularly in **long-context** scenarios, as...

Problem Currently, the Pyctuator constructor requires a registration_url parameter and always attempts to register with Spring Boot Admin via this URL. However, when using [Kubernetes discovery](https://codecentric.github.io/spring-boot-admin/current/#kubernetes) in Spring Boot Admin,...