aomp
aomp copied to clipboard
AOMP is an open source Clang/LLVM based compiler with added support for the OpenMP® API on Radeon™ GPUs. Use this repository for releases, issues, documentation, packaging, and examples.
https://github.com/ye-luo/openmp-target/blob/master/hands-on/tests/complex/complex.cpp ``` $ clang++ -fopenmp -fopenmp-targets=amdgcn-amd-amdhsa -march=native -Xopenmp-target=amdgcn-amd-amdhsa -march=gfx906 complex.cpp lld: error: undefined symbol: __mulsc3 >>> referenced by /tmp/complex-gfx906-72c03e-gfx906-c2b83e.o:(__omp_offloading_10304_2920ae4__Z8test_mulIfSt7complexIfES1_EvT0_T1__l59) >>> referenced by /tmp/complex-gfx906-72c03e-gfx906-c2b83e.o:(__omp_offloading_10304_2920ae4__Z8test_mulIfSt7complexIfES1_EvT0_T1__l59) lld: error: undefined symbol: __divsc3 >>> referenced...
The source code I'm using has multiple offload regions in different member functions of a class. If I enable individual target region and comment the other target pragma Kernel 1...
Using 0.7-7. The AOMP linker works on more complicated miniQMC but failed in linking the following test case. https://github.com/ye-luo/openmp-target/tree/master/hands-on/tests/link_static_fat_bin ``` /usr/lib/aomp/bin/clang++ -Xopenmp-target=amdgcn-amd-amdhsa -march=gfx906 -fopenmp -fopenmp-targets=amdgcn-amd-amdhsa -c classA.cpp rm -f mylib.a...
I notice in the docs and in the repo there are references to spack, and in fact a partial spack package. Could this be upstreamed so we can start actually...
…threads executing target regions multiple times.
run devito benchmark with AOMP16.0-0, singe gpu is ok, but when run with multi-gpu with mpirun (openmpi4.1.4, ucx 1.13.1) , runtime Assertion error. set OMPI_CC=aompcc mpicc -O3 -g -fPIC -Wall...
D133705 removed the restriction on the SDL to be prefixed as "lib" and suffixed with ".a".
Hi, I built the BT and CG benchmarks from the NAS parallel benchmark suite (https://www.nas.nasa.gov/software/npb.html) with AOMP 15.0 and the BT benchmark is showing 0.00 Mop/s (mega operations per second)....
The STL function `memcmp()` does not appear to be available on device. While much of the STL is not expected to be present for device regions, typically simple functions like...