Jed Brown
Jed Brown
A potential users has requested that we provide Docker images. I'm not sure if this should be at the libCEED level or higher (Ratel or CEED bundle).
We should make a visual explainer on the website, not specific to the fluids example, because this is a common thing other libCEED users will be thinking about, perhaps for...
DMPlex can read/create non-tensor meshes (see the `simplex` argument to `DMPlexCreateBoxMesh()`). `PetscFEGetCellTabulation()` will return the (non-tensor) tabulated basis matrices. We should add to `examples/petsc/bps.c`: 1. Add a run-time flag to...
Some of you may have noticed that there is a `release` branch in the repository now. I'm curious if we should start distinguishing main development from ABI-compatible bug fixes, perhaps...
For those with LLNL CZ access, I created a mirror repository that will be able to run CI jobs. You can log in and request access if you don't already...
We should support batched application of CeedOperator to vectors. One approach is to make a new constructor ```c int CeedOperatorCreateKroneckerProduct(CeedOperator J, CeedInt m, const CeedScalar *T, CeedOperator *JxT); ``` where...
Arm SVE intrinsics are vector-length agnostic, so represent a nontrivial difference in strategy from Intel intrinsics. An example shows how one might program with these intrinsics: https://developer.arm.com/documentation/100891/0612/coding-considerations/using-sve-intrinsics-directly-in-your-c-code Scatters and gathers...
Some backends, such as those that use atomics to resolve write conflicts, are non-deterministic due to lack of floating point associativity. We should - [x] Add an interface to query...
libCEED should be able to track performance over time. The benchmarking tools should make it easy (one or two commands) to compare performance for a particular backend between any two...
CUDA 10 adds a Graphs API that is like streams, but can optimize a schedule for lower latency in iterative algorithms. This may be a valuable path to optimization for...