Dmitry
Results
2
issues of
Dmitry
I'm trying to launch MipNerf traing but like in https://github.com/kakaobrain/nerf-factory/issues/12 I have huge CPU memory consumption while GPU idles. I guess it's because data loader and sampler working on CPU....
# What does this PR do? This PR fixes HPU Graphs usage and Flash Attention for Gemma model. Changes are based on Starcoder 2 and Qwen 2 implementations. ## Before...
run-test