benchmarks issues

Benchmark performance drops significantly when using map_and_batch

8

After taking the latest benchmarks, we noticed a drop in performance on models inception3 and resnet152. Testing with TensorFlow r1.5 on 32xP100 GPUs (8 servers), imagenet data, batch size 64....

eladweiss

Remove unnecessary call to glob

3

On slow filesystems, calling glob twice can have a significant performance penalty. Only one call is necessary.

bherta

cla: yes

ValueError: Boundaries (<dtype: 'int32'>) must have the same dtype as x (<dtype: 'int64'>).

2

The environment： centos7.3 cuda8.0 cudnn60 Tesla P40 tensorflow_gpu1.3 ``` [root@Tensorflow tf_cnn_benchmarks]# python /home/benchmarks-tf_benchmark_stage/scripts/tf_cnn_benchmarks.py --num_gpus=8 --batch_size=128 --model=resnet50 --variable_update=parameter_server TensorFlow: 1.3 Model: resnet50 Mode: `training` Batch size: 1024 global 128 per device...

xshow-xs

Resnet56 with CIFAR-10 produces huge log file

10

Hi, I'm trying to train resnet56 on CIFAR-10 with the following param. However, each time I start the run, it creates a log file of size 1.2G or 2.4G. Somehow...

tjingrant

stat:contributions welcome

The VariableMgrDistributedReplicated decrease the speed of convergence

9

@reedwm Hi, I am in trouble during using the following code. ""' for i, (g, v) in enumerate(grads): apply_gradient_op = opt.apply_gradients([(g, v)]) barrier = self.benchmark_cnn.add_sync_queues_and_barrier( 'replicate_variable_%s' % i, [apply_gradient_op]) """...

Sampson1107

stat:awaiting response

Error running in replicated mode

8

System information： OS Platform: ubuntu 16.04 TensorFlow : install from source Python version: Python 2.7.5 1. Run with the command: `python tf_cnn_benchmarks.py --num_batches 100 --display_every 1 --num_gus 8 --model resnet50...

Agoniii

stat:awaiting response

ImportError: cannot import name interleave_ops

13

After I pull and merge the latest commit, I got the `ImportError`. I attached the error log as below: ``` Traceback (most recent call last): File "tf_cnn_benchmarks.py", line 26, in...

DjangoPeng

Different kinds of preprocessing

1

Inside `preprocessing.py` you are using inception preprocessing to train images and vgg preprocessing for evaluation, according to slim. It's a little bit confusing. If you want to train a different...

chrisrn

DenseNet in this benchmark has "naive" implementaiton, will underperform

1

At the time of writing, DenseNet is implemented in this benchmark with what is described as the "naive" implementation in [Memory-Efficient Implementation of DenseNets](http://arxiv.org/abs/1707.06990). This will under perform compared to...

ahundt

stat:community support

Need for a better focus on details.

5

This issue can be taken as a feature-request or a request related to documentation. The high-performance benchmarking example is a good effort. However the code is very fused (combining distributed...

ghost

benchmarks
benchmarks copied to clipboard

Metadata

Benchmark performance drops significantly when using map_and_batch

Remove unnecessary call to glob

ValueError: Boundaries (<dtype: 'int32'>) must have the same dtype as x (<dtype: 'int64'>).

Resnet56 with CIFAR-10 produces huge log file

The VariableMgrDistributedReplicated decrease the speed of convergence

Error running in replicated mode

ImportError: cannot import name interleave_ops

Different kinds of preprocessing

DenseNet in this benchmark has "naive" implementaiton, will underperform

Need for a better focus on details.

← Metadata

Owner

Metadata

benchmarks benchmarks copied to clipboard

Metadata

← Metadata

Owner

Metadata

benchmarks
benchmarks copied to clipboard