mengllm issues

Results 4 issues of


                                            mengllm

Prefill speed is approximately 4~6 tokens/s for Qwen1.5-1.8B

Hi, mllm-qnn can work on my device oppo findx7 ultra(snapdragon 8gen 3+16G RAM). However, the prefill speed for Qwen1.5-1.8B is approximately 4-6 tokens per second, which significantly diverges from the...

Only the first device result aligns with the host’s computation

@and-ivanov @benrothen Hi, Regardless of whether I use `generated.bin` or `extracted.bin,` and whether I use `checksum_kernel` or `checksum_kernel_from_data,` the device’s checksum result changes with each execution of `cuLaunchKernel.` Only the...

Verification failed for the ‘run_generated with SMC’ test case

@and-ivanov @benrothen Hi, The verification succeeds in the ‘test_generated with SMC’ test case, but it always fails in the ‘run_generated with SMC’ test case. My test hardware environment includes an...

sample code for sake

@and-ivanov @benrothen Hi, I have verified the SAKE protocol using the Tamarin prover tool. Currently, only the code for SHA256 hashing and random number generation is available;there is no sample...