Wang, Yiting

Results 10 comments of Wang, Yiting

@nevakrien Thanks for your comment about missing pip packages for dataset preparation, I will update README.md soon. By the way, what's your conflicting absl-py version? I have tried to install...

@nevakrien Current hyperparameters in examples/pretrain_bert/README.md are verified on Intel GPU Max 1550. I'm afraid that it will be Out-Of-Memory on MAX 1100. You can reduce batch size to avoid OOM....

Hi @nevakrien, can you see `Intel Extension for Tensorflow* GPU backend is loaded.` and `Selected platform: Intel(R) Level-Zero` when `import tensorflow`? I see the log you just posted which shows...

Hi @nevakrien. I'm sorry for your inconvenience as the README.md doesn't mention multi-GPU training. This part will be added to README.md soon. We use `intel-optimization-for-horovod` to implement efficient multi-GPU training...

@nevakrien Here is README.md for multi-GPU: https://github.com/intel/intel-extension-for-tensorflow/tree/main/examples/pretrain_bert#convergence, you can follow it. And the patch is updated, too. Please use the latest patch.

@nevakrien You can use `export ZE_AFFINITY_MASK=gpu_ids` to limit device visibility. For example, > export ZE_AFFINITY_MASK=0 # using XPU:0 > export ZE_AFFINITY_MASK=2,3 # using XPU:2 and XPU:3 More details can be...

@nevakrien I'm sorry it confused you that `the python code ran multiple times`, it is because we use lamb optimizer for bert pretraining which followed [NV examples](https://github.com/NVIDIA/DeepLearningExamples/tree/master/TensorFlow2/LanguageModeling/BERT#features). It contains 2...

``` print("STATING UNGODLY HACK") # Assuming you want TensorFlow to see only the first GPU gpus = tf.config.experimental.list_physical_devices('XPU') #gpus=[] if gpus: try: # Only the first GPU will be visible...

As for the multi gpu version, the log looks OK. Could you please wait for hours to see if there are any logs printed out? Or you can use `xpu-smi...

It seems that `intel_gpu_top` is more suitable for games or videos performance checks. Could you please use `xpu-smi dump` to see Intel data center GPUs device statistics. For example, `xpu-smi...