Georgii Evtushenko

Results 54 issues of Georgii Evtushenko

Specifying `__launch_bounds__` in the presence of RDC has proven to be troublesome and unreliable. We have to abstract it out so that launch bounds are not specified when RDC is...

nvbug
P0: must have

Currently, `BlockRadixRankMatchEarlyCounts` doesn't work in some specific cases `(1

Currently, we have a set of block radix rank facilities: - `BlockRadixRank` - `BlockRadixRankMatch` - `BlockRadixRankMatchEarlyCounts` There's also a `enum BlockScanAlgorithm` that describes the differences between these algorithms. Unlike the...

P3: backlog

Thank you for this project! It's much easier to get matrices now. The only issue I found so far is that it takes days to download big matrices. I've changed...

This PR applies a technique similar to one in segmented sort algorithm. Segments are partitioned and various thread groups are applied to various segment categories. While optimizing segmented reduction I...

P2: nice to have

The custom stopping criterion introduced in https://github.com/NVIDIA/nvbench/pull/151 currently triggers linear regression and R2 computation every second `is_finished()` call. Per @julie-is-late [comment](https://github.com/NVIDIA/nvbench/pull/151#discussion_r1445601451): > depending on the expected amount of time a...

Currently, NVBench relies on a fixed [criteria](https://github.com/NVIDIA/nvbench/blob/57c4d42ba505d525a1a3815d2b2560450a4f83dd/nvbench/detail/measure_cold.cu#L102-L154) to understand when enough samples are collected. We can consider an API that would allow users to modify this criterion, accepting the set...

The recent switch to lazy loading by default in CTK 12.2 seems to have broken the async benchmarks. This can be reproduced by `nvbench.example.axes`. The deadlock can be fixed by...

type: bug: functional

### Is this a duplicate? - [X] I confirmed there appear to be no [duplicate issues](https://github.com/NVIDIA/cccl/issues) for this request and that I agree to the [Code of Conduct](CODE_OF_CONDUCT.md) ### Area...

feature request

### Is this a duplicate? - [X] I confirmed there appear to be no [duplicate issues](https://github.com/NVIDIA/cccl/issues) for this bug and that I agree to the [Code of Conduct](CODE_OF_CONDUCT.md) ### Is...

doc