swift-models icon indicating copy to clipboard operation
swift-models copied to clipboard

GPT Inference Model has a shape mismatch

Open compnerd opened this issue 4 years ago • 3 comments

When running the GPTInferenceUI on Windows, it seems that the model shape mismatch is back:

Fatal error: Incompatible shapes: [1,43,768] vs. [1,43]: file S:\4\tensorflow-swift-apis\Sources\TensorFlow\bindings\EagerExecution.swift, line 301

As soon as the model is executed, it fails.

compnerd avatar Apr 24 '20 21:04 compnerd

We walked through this yesterday on a GCP VM and couldn't reproduce, so @compnerd will post additional details about what he's seeing.

texasmichelle avatar Apr 28 '20 14:04 texasmichelle

Interestingly enough, I was able to reproduce it again. The interesting thing is that I noticed that the application terminated improperly during the download, I dear that this could be more complicated (some subtle bug and lack of validation).

compnerd avatar Apr 29 '20 14:04 compnerd

OK, it's relevant that this is an intermittent issue. Adding @xihui-wu, in case she encounters this while building out benchmarks.

texasmichelle avatar Apr 29 '20 15:04 texasmichelle