nways_accelerated_programming
nways_accelerated_programming copied to clipboard
N-Ways to GPU Programming Bootcamp
- Regarding RDF calculation, you have to scan all the pairs of atoms there, but as it's implemented it: - 1) overcounts **( r(ij)=r(ji) )** - 2) counts self-distance, which...
Cannot build [nways_Docker_python](https://github.com/openhackathons-org/nways_accelerated_programming/blob/9381b0d54548318116303340949e40f82dd17f15/nways_Dockerfile_python) as nvidia/cuda:11.4.2-devel-ubuntu20.04` is no longer available from docker.io: ``` docker pull nvidia/cuda:11.4.2-devel-ubuntu20.04 Error response from daemon: manifest for nvidia/cuda:11.4.2-devel-ubuntu20.04 not found: manifest unknown: manifest unknown ```
Cupy: Example 1 has a typo. Should be cuda.Device(0). Since we only have one MiG instance, using Device(1) will throw an error.
Cupy: Exercise 4: Is the expected output correct? This does not match the output of my solutions to Exercises 3 and 4.
CuPy Fundamentals: Typo in Data transfer: d_X = cp.asarray(x) should be cp.asarray(h_X).
Cupy: Example 5 - typo. Capitalize "step 5" -> Step 5.
Cupy: Example 6, Step 3 - typo. Should be 'set reduction expression a + b' or 'set reduction expression for a and b'. The ampersand (&) is an operator in...
Following are a list of suggested changes to the Python Nways materials as suggested by Robert Searles and Jonathan Dursi JIT kernels • Can we move this before CUDA kernels?...
Create the replica of the current Nways to GPU programming content with CFD example using miniweather example. Nways content is available on Github([https://github.com/openhackathons-org/nways_accelerated_programming](https://nam11.safelinks.protection.outlook.com/?url=https%3A%2F%2Fgithub.com%2Fopenhackathons-org%2Fnways_accelerated_programming&data=05%7C01%7Cmozhgank%40nvidia.com%7Cddf9fdbf0c9547561d2c08db46613e5f%7C43083d15727340c1b7db39efd9ccc17a%7C0%7C0%7C638181155688226277%7CUnknown%7CTWFpbGZsb3d8eyJWIjoiMC4wLjAwMDAiLCJQIjoiV2luMzIiLCJBTiI6Ik1haWwiLCJXVCI6Mn0%3D%7C3000%7C%7C%7C&sdata=LJ%2F2jggfAvaKgbRB0uwlITiZNZ5lV8F9wiJehmy5lCM%3D&reserved=0)) . Miniweather example is available on Github([https://github.com/openhackathons-org/gpubootcamp/tree/master/hpc/miniprofiler](https://nam11.safelinks.protection.outlook.com/?url=https%3A%2F%2Fgithub.com%2Fopenhackathons-org%2Fgpubootcamp%2Ftree%2Fmaster%2Fhpc%2Fminiprofiler&data=05%7C01%7Cmozhgank%40nvidia.com%7Cddf9fdbf0c9547561d2c08db46613e5f%7C43083d15727340c1b7db39efd9ccc17a%7C0%7C0%7C638181155688226277%7CUnknown%7CTWFpbGZsb3d8eyJWIjoiMC4wLjAwMDAiLCJQIjoiV2luMzIiLCJBTiI6Ik1haWwiLCJXVCI6Mn0%3D%7C3000%7C%7C%7C&sdata=toQr2yPa7Oi68POKARBSrFt30Zt8hih4eFCftVy5ajI%3D&reserved=0))...
To extend the current Nways content, create a version using Python cuNumeric, Legate ([https://github.com/nv-legate/legate.core](https://nam11.safelinks.protection.outlook.com/?url=https%3A%2F%2Fgithub.com%2Fnv-legate%2Flegate.core&data=05%7C01%7Cmozhgank%40nvidia.com%7Cddf9fdbf0c9547561d2c08db46613e5f%7C43083d15727340c1b7db39efd9ccc17a%7C0%7C0%7C638181155688226277%7CUnknown%7CTWFpbGZsb3d8eyJWIjoiMC4wLjAwMDAiLCJQIjoiV2luMzIiLCJBTiI6Ik1haWwiLCJXVCI6Mn0%3D%7C3000%7C%7C%7C&sdata=XNUDO25bGQwtSAw0zOfKySUdRO1%2FSX568oEOxsovYQ0%3D&reserved=0)), and OpenAI Triton ([https://openai.com/research/triton](https://nam11.safelinks.protection.outlook.com/?url=https%3A%2F%2Fopenai.com%2Fresearch%2Ftriton&data=05%7C01%7Cmozhgank%40nvidia.com%7Cddf9fdbf0c9547561d2c08db46613e5f%7C43083d15727340c1b7db39efd9ccc17a%7C0%7C0%7C638181155688226277%7CUnknown%7CTWFpbGZsb3d8eyJWIjoiMC4wLjAwMDAiLCJQIjoiV2luMzIiLCJBTiI6Ik1haWwiLCJXVCI6Mn0%3D%7C3000%7C%7C%7C&sdata=4CuRdc9oRspJaVwykOOJ7y%2FD1ymop2QZ8KAz1UXuSTE%3D&reserved=0)). This will extend to the current Nways Bootcamp with Python, which uses...