Sergey Kamenev
Sergey Kamenev
All output of clinfo ``` Number of platforms 3 Platform Name Intel(R) CPU Runtime for OpenCL(TM) Applications Platform Vendor Intel(R) Corporation Platform Version OpenCL 2.1 LINUX Platform Profile FULL_PROFILE Platform...
It's my custom model. 512 neuron (input layer) + 256 * 15 (15 hidden layers by 256 neuron) +2 neuron (output layer). Dictionary has a size ~5MB. I'm using pytorch_dlprim...
For a CPU with this batch size, training occurs as quickly as possible on my data. And I’m also interested in the question: can pytorch_dlprim be used in parallel to...
Hello! I'm waiting for code review
Two of of tests don't working with simular errors not related to my fixes. python tests/test_op.py --device privateuseone:1 python tests/validate_network.py --device privateuseone:1 Mean 1d Traceback (most recent call last): File...
My own application training network with my fixes without probles. I'm using PyTorch 1.13.1
When tested, my code produces the same errors as yours (see Issues). I think the point is that my version of OpenCL (Rusticl on Linux: radeonsi, fiji, LLVM 17.0.6, DRM...
Please try running my code test on your computer. I think everything will be great.
> Is it 32 or 64 bit atchitecture? need to track down which kernel fails. My CPU have 64 bit architecture. GCN 3 (Fiji) - I don't know how many...
On AMD OpenCL from amdgpu-pro also error ``` python tests/validate_network.py --device privateuseone:3 Testing resnet18 Accessing device #3:Fiji on AMD Accelerated Parallel Processing Traceback (most recent call last): File "/home/inetstar/Kamenev/programming/ZenDnn/pytorch_dlprim/tests/validate_network.py", line...