Cabana
Cabana copied to clipboard
Sparse Grids Roadmap
Initial rectilinear partitioning path:
- [x] Initial sparse grid functionality #287
- [x] Rename
SparseIndexSpace
->SparseMap
(Also theTileIndexSpace
and similarly named classes). Note that a user will be inputting globalijk
cell ids into this structure. Perhaps we should add acreateSparseMap()
function which takes aGlobalMesh<SparseMesh>
to populate the number of global tiles including padding and returns aSparseMap
. @YuxingQiu #326 - [x] Partitioner needs to be generalized such that communicators with the given topology and block id information is returned by the partitioner. For example, the constructor implementation of
GlobalGrid
will largely be moved into the partitioners. @YuxingQiu #383; #464 - [x] #534
- [ ]
SparseArrayLayout
andSparseArray
@YuxingQiu #546 - [ ]
Halo
specialization for local grid which defines the relevant gather/scatter operations. May simply just need to return 0 at first when outside of sparse domain. @sslattery - [ ]
Hypre
structured solvers mapped toSparseArray
@sslattery - [ ]
Spline
infrastructure forp2g
andg2p
mapped to sparse space (this will probably just work). @YuxingQiu - [ ] Unite load balancing interface (initially developed in #444) #541
Additional steps for non-rectilinear path:
- Develop new
GlobalGrid<SparseMesh>
specialization that will take and hold aSparseMap
object. TheblockRank
will no longer beijk
but rather just the MPI rank. This information will be provided by the partitioner. Communicators will no longer have Cartesian topology - they will be general. We could consider trying out the general MPI topology feature. All indices are linearized so functions likedimNumBlock()
are no longer needed. @sslattery - Develop new
SparseIndexSpace
usingSparseMap
and information from theEntityType
,Decomposition
, andIndexType
which would be used in aLocalGrid
. @YuxingQiu -
LocalGrid
specialization which defines partitioning strategy and returnsSparseIndexSpace<3>
based onEntityType
,Decomposition
, andIndexType
. OurGlobalGrid
specialization for sparse grids will have a partitioner which will help inform what things are owned and what are ghosted. This specialization should take aSparseMap
in the constructor. @YuxingQiu -
LocalMesh
specialization for sparse grids which describes the local geometry as a function of the dense local indices. @YuxingQiu - Develop
Kokkos::View
allocation functions using a newCajitaLayoutSparse
for 3d and 4d sparse index spaces. The new layout will map fromijk
indices to thetile
,cell
, and, if 4D,field_dim
indices. @sslattery - Develop execution policy for sparse index spaces and the associated overloads of
Cajita::grid_parallel_for
in which user functors have the signatureKOKKOS_LAMBDA(const int i, const int j, const int k)
for 3D orKOKKOS_LAMBDA( const int i, const int j, const int k, const int d)
for 4D. @YuxingQiu - Refactor neighbors to not be
ijk
based but rather based on the 1d block id (e.g. MPI rank) as we will now communicate to arbitrary neighbors. Then addsharedIndexSpace()
to local grid which will returnSparseIndexSpace
objects that describe overlap with neighbors for communication. The neighbor index in these functions is no longer a localijk
offset but rather the local 1D neighbor id. AnumNeighbor()
function should be added which returns the number of neighbors. Note that partitioning strategy means arbitrary number of neighbors now defined similar to unstructured grids and therefore more partitioning information will be needed in theLocalGrid
specialization. @sslattery
Notes:
- The
GlobalGrid<SparseMesh>
will not support periodicity initially. We can try to add this later as needed. It will be difficult to deal with the halo overlap issues when we don't have a fixed grid domain to work with (e.g. in sparse grids usually the user is just putting a box around their particles). - We will make a new partitioner for particle-balanced simulations (probably using RCB). For now just use the current partitioners which will partition the
GlobalGrid
based only on the mesh. -
p2g
andg2p
will probably just work for now (in a thread, not the global APIs) and we can look at different performance options later.
Other: performance testing
- [x] #411