Mustafa Ali
Mustafa Ali
Hi, I am trying to benchmark Mixtral-7B,, however I get this error: ``` BS: 64, ISL/OSL: 128,128 ^A[TensorRT-LLM][ERROR] 3: [executionContext.cpp::setInputShape::2309] Error Code 3: API Usage Error (Parameter check failed at:...
Hello, I need to enable DHCP when the module is in AP mode so that any client connected to it can get automatic ip, in stock firmware, AT+CWDHCP command could...
I ran `ipython3 train.py ` I got this: data:image/s3,"s3://crabby-images/a2aba/a2abad7b23750ed680b1aadc442b63cdb7ec0db8" alt="screenshot from 2018-11-04 21-33-29" I installed sonnet from this command `$ pip install dm-sonnet`
Great repo! I learned a lot from it and the blog, thank you! I was wondering if there is an easy way to rewrite the kernels to use tensor cores...
Are you planning on supporting Flash decoding with H100 specific instructions? I mean FA3 for decoding?
**Is your feature request related to a problem? Please describe.** I was wondering if CUTLASS is planning to support FP8 inputs of Conv kernels (output can be FP16 or FP32)...
Hi I tried running a sdpa_fp8 graph where seqlen_q and seqlen_k are different, however it seems that it only uses the seqlen_q as in performance is the same when I...