annotated-transformer
annotated-transformer copied to clipboard
nbatches vs batch_size
In the forward method you currently say nbatches = query.size(0)
, but this appears to actually be the batch size, not the total number of batches the network is trained for.
I don't know if some people use this nomenclature differently, but I think renaming the variable to batch_size
would be a bit more clear.