parsec
parsec copied to clipboard
PaRSEC is a generic framework for architecture aware scheduling and management of micro-tasks on distributed, GPU accelerated, many-core heterogeneous architectures. PaRSEC assigns computation threads...
show-caps: * don't report flops for unknown cuda devs (report 0.0, like cpus in #663) * report peer access mask * report cpu avx/simd instruction if detected available The diff...
Add support for sending and receiving the data directly from and to devices. There are few caveats (noted on the commit log). 1. The first question is how is such...
These checks have no relevance and cause problems in TTG because the input data pointers may change.
The idea is the following: - tasks incarnations (aka. BODY) can be marked with the "batch" property allowing the runtime to provide the task with the entire list of ready...