Isuru Fernando comments

Results 928 comments of


                                            Isuru Fernando

macOS: `Undefined symbols for architecture arm64: [...] "___to_global"`

I cannot reproduce this bug with pocl 4.0

macOS: `Undefined symbols for architecture arm64: [...] "___to_global"`

I tried only https://github.com/pocl/pocl/issues/1288#issuecomment-1696570750

Numerical precision issue

I cannot reproduce with, ``` PYTHONPATH=. PYOPENCL_CTX=port:nvidia ./run_tests.py pyFAI.opencl.test.test_openCL.TestDoubleWord ```

> PYOPENCL_CTX=port:nvidia selects Portable Computing Language:cpu-broadwell-Intel(R) Xeon(R) CPU E5-1650 v4 @ 3.60GHz Ah, you are overriding pyopencl's parsing of `PYOPENCL_CTX` env var and has different behaviour. I can reproduce now.

Numerical precision issue

https://github.com/pocl/pocl/pull/1227 is the culprit.

Numerical precision issue

For now you can add `#pragma clang fp contract(on)` to the top of the `doubleword.cl` file. It's strange that NVIDIA OpenCL don't run into this issue.

CMake code to find and link against OpenCL / ICD loader is problematic

Did you try the main branch or the release?

CMake code to find and link against OpenCL / ICD loader is problematic

It does have the unused code, but I believe https://github.com/pocl/pocl/pull/1252 fixes the issue that you are running into.

segmentation fault in clGetDeviceIDs on arm64(Jetson AGX)

If you are okay with using binaries, you can try the instructions at https://github.com/pocl/pocl#pocl-with-cuda-driver

segmentation fault in clGetDeviceIDs on arm64(Jetson AGX)

I've tried it on an aarch64 server, but not a Jetson in particular. If it does not work there, I'll be happy to fix it.