hi20240217

Results 2 issues of hi20240217

``` cat linear_model_example.py import math import torch import torch.profiler import torch.distributed as dist import os dist.init_process_group(backend='nccl') local_rank=int(os.environ['LOCAL_RANK']) rank=torch.distributed.get_rank() torch.cuda.set_device(local_rank) if not dist.is_available() or not dist.is_initialized(): print("dist init error") dtype =...

How to use rocprofv2 to obtain the metrics supported by the current device.

Under Investigation