Albert Zeyer issues

Results 300 issues of


                                            Albert Zeyer

Job _sis_hash_static / hash modifies inplace, alters _sis_kwargs

Many of the jobs `hash` functions modify `kwargs` inplace. E.g.: ```python class CountNgramsJob(Job): ... @classmethod def hash(cls, kwargs): """delete the queue requirements from the hashing""" del kwargs["mem_rqmt"] del kwargs["cpu_rqmt"] del...

Feature request: Hook for job setup directory

Currently I run `setup_job_symlinks()` (code below) at the end of some of my Sis scripts (Sis config), which will go through the Sis job graph, look for matching jobs, and...

Documentation outdated (2025)

I just want to raise this here. Also including the README. A lot of parts are still TF specific. And many of those even don't mention that. PyTorch relevant documentation...

FileCache no space left on device race condition

Again a crash. It ultimately failed with this (after retrying a few times): ``` OSError: [Errno 28] No space left on device ``` Log: ``` FileCache: Copy file /rwthfs/rz/cluster/home/az668407/setups/2025-08-aed-large/work/i6_core/datasets/huggingface/TransformAndMapHuggingFaceDatasetJob.F xPUVJtw1EeN/output/dataset/train/data-00405-of-00848.arrow...

test_ConvLayer_empty_out fails: InvalidArgumentError: Incompatible shapes: [7] vs. [1,1,5]

CI run log [tf-tests (3.8, 2.10.0, TEST=TFNetworkLayer)](https://github.com/rwth-i6/returnn/actions/runs/18276719792/job/52030410528?pr=1774#logs). ``` Python env: python is /opt/hostedtoolcache/Python/3.8.18/x64/bin/python Python 3.8.18 NumPy: 1.24.4 TensorFlow: v2.10.0-rc3-6-g359c3cdfc5f 2.10.0 /home/runner/.local/lib/python3.8/site-packages/tensorflow/__init__.py ``` Relevant log: ``` ___________________________ test_ConvLayer_empty_out ___________________________ Traceback (most...

Random access in datasets could be speed up by batched access, more structured shuffling

_Originally posted in [#1257](https://github.com/rwth-i6/returnn/issues/1257#issuecomment-3185864489), on the `HuggingFaceDataset`, but actually, the discussion applies just the same for most of our datasets, e.g. `HDFDataset`, `OggZipDataset`, etc_ > [We] do random access here...

Albert Zeyer

Job _sis_hash_static / hash modifies inplace, alters _sis_kwargs

Feature request: Hook for job setup directory

Documentation outdated (2025)

FileCache no space left on device race condition

test_ConvLayer_empty_out fails: InvalidArgumentError: Incompatible shapes: [7] vs. [1,1,5]

Random access in datasets could be speed up by batched access, more structured shuffling

Dataset set_file_cache has no effect

DistributeFilesDataset with sharding, num seqs seems incorrect

devtrain dataset, reuse train dataset

MultiProcDataset, _get_random_seed_for_epoch in workers results in unexpected behavior