h5spark icon indicating copy to clipboard operation
h5spark copied to clipboard

Supporting Hierarchical Data Format and Rich Parallel I/O Interface in Spark

Results 5 h5spark issues
Sort by recently updated
recently updated
newest added

I followed the instructions: > Scala version: > > `export LD_LIBRARY_PATH=$LD_LIBRARY_PATH:path_to_h5spark/lib` > `module load sbt` > `sbt assembly` > `sbatch spark-scala.sh` The step `sbt assembly` incurred the following errors: >...

A user requests to use H5Spark in Jupyter, which is a popular notebook used anywhere. The current issue is that the class path is not recognized in the notebook, i.e.,...

Currently, H5Spark parallelize the IO along the slowest dimension, i.e., the dimension that changes slowest on disks. For example, for a 2D C array x[10][200], the h5spark will choose the...

When loading multiple files, the file size can have a long-tailed distribution(see the figure), or an even distribution. In case of even distribution, we don't need to balance the load....

Needs to maintain the coordinates during the read, then figure out the global offset during parallel write.