optuna-examples icon indicating copy to clipboard operation
optuna-examples copied to clipboard

Example of distribute training with TensorFlow.

Open keisuke-umezawa opened this issue 4 years ago • 5 comments

Motivation

Currently, we do not have an example of distribute training with TensorFllow. It is useful for TensorFlow users.

https://www.tensorflow.org/guide/distributed_training

Description

Create an example of distribute training with TensorFllow.

keisuke-umezawa avatar Apr 19 '20 08:04 keisuke-umezawa

Still seems to be an open issue.

Just a note that we cannot test GPU or TPU code for the time being in our CI environment. @keisuke-umezawa could you maybe elaborate a bit on what type of distributed examples you had in mind? I think that'll make this issue easer to tackle.

hvy avatar Aug 04 '20 05:08 hvy

This issue has not seen any recent activity.

github-actions[bot] avatar May 20 '21 23:05 github-actions[bot]

I'm not familiar with tensorflow-ecosystem, but this Keras distributed training tutorial provides more concrete code example than the tensorflow's document. In the tutorial, a simple convolutional neural network is trained with MNIST dataset in parallel.

So, how about creating the keras distributed example based on the tutorial, and then translate it using tensorflow APIs? @keisuke-umezawa @hvy

CC @sfujiwara Please feel free to add any comments on this topic.

toshihikoyanase avatar Jun 01 '22 12:06 toshihikoyanase

So, how about creating the keras distributed example based on the tutorial, and then translate it using tensorflow APIs?

That's a better idea!

keisuke-umezawa avatar Jun 06 '22 03:06 keisuke-umezawa

This issue has not seen any recent activity.

github-actions[bot] avatar Jun 20 '22 23:06 github-actions[bot]

This issue was closed automatically because it had not seen any recent activity. If you want to discuss it, you can reopen it freely.

github-actions[bot] avatar Sep 29 '22 23:09 github-actions[bot]