Umar Jamil
Umar Jamil
pytorch-llama
LLaMA 2 implemented from scratch in PyTorch
transformer-from-scratch-notes
Notes about "Attention is all you need" video (https://www.youtube.com/watch?v=bCz4OMemCcA)
pytorch-transformer
Attention is all you need implementation
pytorch-stable-diffusion
Stable Diffusion implemented from scratch in PyTorch
pytorch-lora
LORA: Low-Rank Adaptation of Large Language Models implemented using PyTorch
pytorch-llama-notes
Notes about LLaMA 2 model
quantization-notes
Notes on quantization in neural networks
mamba-notes
Notes on the Mamba and the S4 model (Mamba: Linear-Time Sequence Modeling with Selective State Spaces)
pytorch-ddpm
Implementation of the paper "Denoising Diffusion Probabilistic Models" in PyTorch
pytorch-transformer-distributed
Distributed training (multi-node) of a Transformer model