rupakroyintel comments

Results 12 comments of


                                            rupakroyintel

Feature Request: better documentation as to which tools work for which codecs

@oviano Thanks for your suggestion. We are working on the oneVPL Documentation and we would try to integrate the changes you mentioned in the upcoming releases. Meanwhile, please let us...

Assertion failure in brgemm in debug build on G3 aarch64 machine

@jondea, @cfRod This is aarch64 platform specific issue. Can you please have a look into this? Thanks.

Sometimes the driver seems to die with device error -17 and only recovers via a reboot

@oviano This type of issue can be caused by the additional allocation of hardware buffers by FFmpeg that weren't freed in time. Could you please provide the command line that...

Sometimes the driver seems to die with device error -17 and only recovers via a reboot

@oviano Yes, a memory dump might be useful. Even if you do not know exactly which command line is the problem, some example command lines could give us some clues....

Sometimes the driver seems to die with device error -17 and only recovers via a reboot

@oviano Thanks for sharing the details with us. We have made the relevant team aware of this issue. We will get back to you soon.

Assertion failure in brgemm in debug build on G3 aarch64 machine

@vineelabhinav Can you please look into this issue? Thanks.

Speed drop when running oneDNN in a subthread

@matiaslin Thanks for reaching out. We did some experiments with your sample code. These are our findings: -Increasing the number of threads helps alleviate this performance gap with subthread -With...

example: int4 weight decompression

> The file name of the example int4_weight_decompression_cmnts.cpp doesn't seem good. What is cmnts? Removed the int4_weight_decompression_cmnts.cpp and added int4_weight_decompression,cpp

example: int4 weight decompression

@vpirogov @dzarukin We tried translating packed 8 int4 values into a single int value. However, it looks like the zero-points attribute wei:per_ocic:s4:32x8 is not supported. Here is the output from...

example: int4 weight decompression

@dzarukin @vpirogov @shu1chen I have pushed the latest changes. The example passed on GPU. However, it failed on CPU. I have added the verbose log for the failing case: `ONEDNN_VERBOSE=all...