Sergey Kamenev

Results 30 comments of Sergey Kamenev

All output of clinfo ``` Number of platforms 3 Platform Name Intel(R) CPU Runtime for OpenCL(TM) Applications Platform Vendor Intel(R) Corporation Platform Version OpenCL 2.1 LINUX Platform Profile FULL_PROFILE Platform...

It's my custom model. 512 neuron (input layer) + 256 * 15 (15 hidden layers by 256 neuron) +2 neuron (output layer). Dictionary has a size ~5MB. I'm using pytorch_dlprim...

For a CPU with this batch size, training occurs as quickly as possible on my data. And I’m also interested in the question: can pytorch_dlprim be used in parallel to...

Hello! I'm waiting for code review

Two of of tests don't working with simular errors not related to my fixes. python tests/test_op.py --device privateuseone:1 python tests/validate_network.py --device privateuseone:1 Mean 1d Traceback (most recent call last): File...

My own application training network with my fixes without probles. I'm using PyTorch 1.13.1

When tested, my code produces the same errors as yours (see Issues). I think the point is that my version of OpenCL (Rusticl on Linux: radeonsi, fiji, LLVM 17.0.6, DRM...

Please try running my code test on your computer. I think everything will be great.

> Is it 32 or 64 bit atchitecture? need to track down which kernel fails. My CPU have 64 bit architecture. GCN 3 (Fiji) - I don't know how many...

On AMD OpenCL from amdgpu-pro also error ``` python tests/validate_network.py --device privateuseone:3 Testing resnet18 Accessing device #3:Fiji on AMD Accelerated Parallel Processing Traceback (most recent call last): File "/home/inetstar/Kamenev/programming/ZenDnn/pytorch_dlprim/tests/validate_network.py", line...