Nanoflow
Nanoflow copied to clipboard
CUDA Error: misaligned address
on RTX4090 running llama3-8B, allocate_kv_data_batchsize=10, I encountered the error of the following: CUDA Error: misaligned address (716) /code/gemv/include/attention/prefill.cuh: line 1918 at function cudaLaunchKernel((void*)kernel, nblks, nthrs, args, smem_size, stream) terminate called after throwing an instance of 'std::invalid_argument' what(): BatchPrefillWithPagedKVCache Failed with error: misaligned address
It happens in the 2nd Cycle
Is this fixed now? Maybe you can try the lastest version of our codebase now, it shoule be work.