Muhammad Awad

Results 17 issues of Muhammad Awad

The current hash function failed an initial test with signed types.

Use google tests for testing and use current tests as examples.

Atomic operations should use a `memcpy` for comparison (See https://github.com/NVIDIA/cccl/issues/989). Current [pair implementation](https://github.com/owensgroup/BGHT/blob/main/include/detail/pair.cuh#L35) uses a custom `==` operator.

- [ ] Benchmarking scripts to reproduce paper results. - [ ] Look into why the search query is slower. - [ ] Successor query - [x] Range query -...

> Running on GeForce RTX 3080 GPU CUDA 12.1 C++17 CMake version 3.26.3 Although the g++ compiler is version 9.4.0 on Ubuntu 20.04, CMake is using CXX STANDARD 17 so...

The benchmarking code now requires 20 GiBs of memory for a complete set of benchmarks. It would be nice to limit the memory requirements for benchmarking on workstations. The code...

This PR proposes adding an experimental Heterogeneous System Architecture (HSA) backend for IREE. HSA provides standard APIs to manage and manipulate low-level device(s) primitives such as queues, signals, and memory...