bschifferer comments

Results 108 comments of


                                            bschifferer

[RMP] Benchmarking Session-Based Models

A detailed view is available here: https://docs.google.com/document/d/1g5FUrdhZQzef1OWwiQLfNNGdHr4a71Cr-jqndl-SoQg/edit#

[RMP] Benchmarking Session-Based Models

Collecting results in a google spreadsheet (details) + some slides as a summary

[BUG] UserWarning: You have more processes(4) than dataset [1,1]<stderr>: partitions(1), reduce the number of processes.

Hello @ssubbayya , thanks for reporting the bug. You are correct. I found a workaround that it will train: You need to: - add parameters global_size=1, global_rank=0 when initialising the...

[BUG] UserWarning: You have more processes(4) than dataset [1,1]<stderr>: partitions(1), reduce the number of processes.

train = Dataset(os.path.join(args.path, "train", "part_" + str(MPI_RANK) + ".parquet")) valid = Dataset(os.path.join(args.path, "valid", "part_" + str(MPI_RANK) + ".parquet")) Can you try to add part_size parameter to the Dataset above? Dataset(os.path.join(args.path,...

[BUG] UserWarning: You have more processes(4) than dataset [1,1]<stderr>: partitions(1), reduce the number of processes.

@ssubbayya `ValueError: None values not supported.` sounds that the dataset contains NaN values / None values, is that correct? You should be able to test it like this Dataset().to_ddf().isna().sum().compute() Can...

Introduce distributed embeddings

The distributed embedding examples uses a custom train step functions: https://github.com/NVIDIA-Merlin/distributed-embeddings/blob/main/examples/dlrm/main.py#L201-L215 In my understanding, distributed embedding does NOT work with keras model.fit function: https://github.com/NVIDIA-Merlin/models/pull/974/files#diff-1e42e5c4771f01c26b3c78c545eb341590a4406b2c5af8da0491ab4b7ea51464R80 I think we need the distributed...

bschifferer

[RMP] Benchmarking Session-Based Models

[RMP] Benchmarking Session-Based Models

[BUG] UserWarning: You have more processes(4) than dataset [1,1]<stderr>: partitions(1), reduce the number of processes.

[BUG] UserWarning: You have more processes(4) than dataset [1,1]<stderr>: partitions(1), reduce the number of processes.

[BUG] UserWarning: You have more processes(4) than dataset [1,1]<stderr>: partitions(1), reduce the number of processes.

Introduce distributed embeddings

[Task] Add all notebooks for unittests

[Task] Add booking.com solution to Merlin Models example

[DOC]Models - Docstring bash comments

[Task] Update notebooks to note we require TF 2.8+ in the notebooks.