Zack Ziegler

Results 3 comments of Zack Ziegler

Everything’s in nats by default, so you’ll have to convert from base e to base 2 to get bits.

Hi, This is a preprocessing choice we inherited from previous summarization work with OpenNMT, which found that the first 400 tokens is often plenty to compose a good summary. That...

Weird model! If you’re doing the usual VQ-VAE thing with the straight through estimator, could there be some kind of accumulation of error? I can’t picture the model exactly, but...