rupakroyintel

Results 12 comments of rupakroyintel

@oviano Thanks for your suggestion. We are working on the oneVPL Documentation and we would try to integrate the changes you mentioned in the upcoming releases. Meanwhile, please let us...

@jondea, @cfRod This is aarch64 platform specific issue. Can you please have a look into this? Thanks.

@oviano This type of issue can be caused by the additional allocation of hardware buffers by FFmpeg that weren't freed in time. Could you please provide the command line that...

@oviano Yes, a memory dump might be useful. Even if you do not know exactly which command line is the problem, some example command lines could give us some clues....

@oviano Thanks for sharing the details with us. We have made the relevant team aware of this issue. We will get back to you soon.

@vineelabhinav Can you please look into this issue? Thanks.

@matiaslin Thanks for reaching out. We did some experiments with your sample code. These are our findings: -Increasing the number of threads helps alleviate this performance gap with subthread -With...

> The file name of the example int4_weight_decompression_cmnts.cpp doesn't seem good. What is cmnts? Removed the int4_weight_decompression_cmnts.cpp and added int4_weight_decompression,cpp

@vpirogov @dzarukin We tried translating packed 8 int4 values into a single int value. However, it looks like the zero-points attribute wei:per_ocic:s4:32x8 is not supported. Here is the output from...

@dzarukin @vpirogov @shu1chen I have pushed the latest changes. The example passed on GPU. However, it failed on CPU. I have added the verbose log for the failing case: `ONEDNN_VERBOSE=all...