marian icon indicating copy to clipboard operation
marian copied to clipboard

negative cost with negative weights

Open tomsbergmanis opened this issue 2 years ago • 0 comments

The documentation says that word weights can be real-valued. Real numbers can be negative. This is convenient because outputs of models one could use for scoring often output log-likelihoods, which are negative numbers. image However, using negative weights results in a negative cost value, which, if minimized, maximizes the error: image

  • If this is a mistake in the documentation, could we update the documentation to say "non-negative numbers"?
  • If not, could we update the documentation with the steps to be taken to avoid divergence with negative weights?

Cheers, Toms

tomsbergmanis avatar Feb 24 '23 09:02 tomsbergmanis