Liger-Kernel
Liger-Kernel copied to clipboard

Published 2 weeks ago •

→

Metadata

Efficient Triton Kernels for LLM Training

Reame
Issues

Results 163 Liger-Kernel issues

Sort by recently updated

[RFC] Logits numerical issues in convergence test

### 🐛 Describe the bug This issue is to discuss how we should modify our convergence test to handle numerical issues of logits. ### Context In #704, we make `FusedLinearCrossEntropy`(flce)...

Tcc0403

Need to investigate Gemma3 implementation with Liger

### 🐛 Describe the bug The tolerance when comparing loss in gemma3 multimodal model need to be set high (atol,rtol - 1e-3) compare to others (atol=1e-8,rtol=1e-5) in order to pass...

Manan17

Automating benchmark testing with every pr merged.

Hello, So we discussed an approach to solve this: Running all the benchmarks take less than an hour. I tried it on a single H100 GPU and it took me...

Manan17

‹
1
2
...
8
9
10
11
12
13
14
15
16
17

About

Efficient Triton Kernels for LLM Training

hacktoberfest

finetuning

triton

mistral

llama

llms

llm-training

llama3

phi3

gemma2

triton-kernels

5.9k

Stars

437

Forks

5.9k

Watchers

Owner

← Metadata

5.9k

Stars

437

Forks

5.9k

Watchers

Owner

Metadata

Efficient Triton Kernels for LLM Training

Back

Liger-Kernel Liger-Kernel copied to clipboard

Metadata

[RFC] Logits numerical issues in convergence test

Need to investigate Gemma3 implementation with Liger

Automating benchmark testing with every pr merged.

← Metadata

Owner

Metadata

Liger-Kernel
Liger-Kernel copied to clipboard