Huihuo Zheng issues

Results 9 issues of


                                            Huihuo Zheng

Suggesting change in the name field in chrome tracing output

Hello, I was using the unitrace to trace an AI application. Below is part of my output. ``` {"ph": "X", "tid": 4294950910, "pid": 4294950911, "name": "gen9_eltwise_bwd[SIMD32 {1568; 1; 1} {512;...

Added support for Cache VOL

This allows h5bench to pause the async operations in Cache VOL and start them later after issuing out all the dataset write calls.

enhancement

HDF5 VOL

Adding S3 support when PyTorch framework is selected.

Check whether we can adopt the PyTorch S3 support: https://pytorch.org/data/main/generated/torchdata.datapipes.iter.S3FileLoader.html

Validating DLRM config

The DLRM workload support is added here: https://github.com/argonne-lcf/dlio_benchmark/pull/114. But we still need to validate that. I am adding this issue to keep track of that.

Validating Magatron-DeepSpeed

Magatron-DeepSpeed is added in https://github.com/argonne-lcf/dlio_benchmark/pull/114. This issue is to keep track of the validating work.

Spark support

Maybe we can consider the Apache Spark support.

This PR includes changes on * Computation time calculation, instead of putting the value from configuration file. The computation time will be actually the time spend on framework.compute function. *...

Changing logging levels

In this PR, we changed the per step output from info to debug to reduce the logging overhead. We also add support for changing logging level

Redundant shuffling

In reconfigure, global shuffling is performed. The data loader only have the indices of the local samples. It still has the shuffling on the pytorch / dali data loader which...