Albert Zeyer issues

Results 300 issues of


                                            Albert Zeyer

Remove/change the default "target" layer option for losses

#508 Effectively this means changing `Loss.get_default_target()` (because each loss class already can define its own default). The base function currently has this implementation: ``` @classmethod def get_default_target(cls, extern_data): """ :param...

potential-new-behavior

Specify behavior of `out_type`

We should maybe define more exactly the behavior when the user specifies `out_type` (which is currently a bit inconsistent also across layers; related is #541). In many cases, it would...

TensorFlow

No default "unit" option for `RecLayer`/`RnnCellLayer`

#508

potential-new-behavior

difficulty: easy

prohibit Theano option "multiprocessing" when working with TF

#508

potential-new-behavior

Disallow `data` as layer input, always require `data:<data_key_name>`

#508 Thus effectively removing the default data key "data". I don't have a strong opinion on this, so I just put this here and leave it open for discussion. This...

potential-new-behavior

SubnetworkLayer: deprecate `from` with `concat_sources=False`

I think this depends on #530. In case of `concat_sources=False`, the source names `data:0` etc are not nice, and I think we can have better ways (this is #530). (I'm...

potential-new-behavior

TensorFlow

Training instability (Inf/nan score) with TensorFlow 1.15

It would crash at an early step in the first epoch with a message like: ``` ... pretrain epoch 1, step 59, cost:ctc 6.582720208647288, cost:output/output_prob 6.08799995325171, error:ctc 0.9999999632127583, error:decision 0.0,...

TensorFlow

New TF dataset pipeline: draft

Existing configs should work as before, without any change in behavior. The datasets themselves (all what derives from class `Dataset`) will stay as is, as well as their API. It...

good first issue

TensorFlow

Horovod single-node multi-GPU training, hangs on crash

When the training crashes (e.g. GPU out-of-memory, or got inf/nan, or whatever), it often happens that the process (SGE job, Slurm job) is just hanging and not exiting.

Support for distributed TensorFlow: draft

See the [overview of distributed TensorFlow in general (independent of RETURNN)](https://github.com/rwth-i6/returnn/wiki/Distributed-TensorFlow) for some background. This issue here is about the specific implementation in RETURNN. This is somewhat orthogonal to the...

good first issue