robberlang

Results 8 comments of robberlang

With PR #753 you can get all the inputs of your batch sent to your model function at once.

It's not possible that your single query of 100K inputs (using `input_batch`) get sent to your model as a single request. I came across the same problem. The PR I...

DId you build the query frontend docker image with my branch, and specify that image when starting Clipper?

This helped me, thanks. The `git clone` line (line 20) needs a backslash at the end of it though.

The website can be built from this [repo](/ucbrise/clipper-website)

You should have received an email with info for getting the model and vocab files. Thanks.

The machine I'm using supports AVX-512. Trying with packed8avx2 gives this: `Error: FBGEMM doesn't allow to use AVX2 packing order on AVX512 CPUs` and `Error: Aborted from void marian::cpu::variant::fbgemmPacked8Gemm(marian::Tensor, marian::Tensor,...

I've figured it out. The problem is that `marian-conv` is quantizing `decoder_ff_logit_out_Wt`, and the reason it does that, the reason that that parameter exists, is that I had trained the...