FdyCN comments

Results 16 comments of


                                            FdyCN

Is it possible to create program from cuBin or PTX?

> In the jitify2 API (under development) you can do this: https://github.com/NVIDIA/jitify/blob/ca7f794/jitify2.hpp#L2153 @benbarsdell Thx a lot, glad to hear that. i reviewed this branch. and is this tool necessary? https://github.com/NVIDIA/jitify/blob/jitify2/jitify2_preprocess.cpp...

Tensor cores not work as expected

SAME results i got too( same GPU : NVIDIA GeFore RTX 3070 Laptop GPU). Could you please check this??? @mdoijade @Ru7w1k @AndyDick thanks so much.

Any plan on supporting M2 line of SoCs?

> Not so much for M2 Max, which always shows CPU and PCPU at 100% > > i got the same issue on M2 Pro. ![image](https://user-images.githubusercontent.com/80800417/233528404-b47cd97c-32b5-44df-afec-c1f655dffdcf.png)

TypeError: 'str' object cannot be interpreted as an integer

same problem

[Help]How to include <cuda_fp16.h> in jitify？

i know it can be added in this way: ``` jitify::Program program = kernel_cache.program( program1, // Code string specified above {example_headers_my_header1_cuh}, // Code string generated by stringify {"--use_fast_math", "-I" ${where...

[Help]How to include <cuda_fp16.h> in jitify？

> I think this should work, as long as the `-I` option has the correct path (e.g., "/usr/local/cuda/include"). If it's still not working for you, could you provide a full...

[Feature Request] run the LLM model on the Qualcomm Hexagon NPU in Android OS

@Hzfengsy im a liitle bit confused, cause TVM does have [Hexagon backend codegen](https://github.com/apache/tvm/blob/main/tests/python/codegen/test_target_codegen_hexagon.py)， and mlc-llm is based on TVM Unity. So why mlc-llm cannot lowering to hexagon target codes？ Is...