JackCaoG comments

Results 401 comments of


                                            JackCaoG

TF2 DeBERTaV2 runs super slow on TPUs

Thanks, I will try to take a look or finding someone from my team to help. nvm, this is tf2, I only knows pt/xla lol

Codegen all.dim

https://github.com/pytorch/xla/blob/master/scripts/gen_lazy_tensor.py#L47 controls what got passed to the shape function. I don't think we want to pass all int64 and bool to the shape fn because in many times they don't...

Codegen all.dim

actually in https://github.com/pytorch/xla/pull/3771/files I already make `bool` to be pass to all shape fns

Codegen all.dim

https://github.com/pytorch/xla/pull/3771/files is merged, I think you just need to handle `dim` now

LayerNorm generates incorrect output in float16 when XLA is enabled.

Hmm, this might be a bit tricky. The result still seems somewhat close. Not sure if the accuracy gap coming from XLA:GPU or the way we lower the `LayerNorm`

LayerNorm generates incorrect output in float16 when XLA is enabled.

I guess the todo item here is to dump the HLO of `b` and file an issue to the XLA:GPU team.

LayerNorm generates incorrect output in float16 when XLA is enabled.

HLO dump I get is ``` HloModule IrToHlo.44 ENTRY %IrToHlo.44 (p0.2: f32[10], p1.24: f16[3,10], p2.39: f32[10]) -> (f16[3,10]) { %constant.4 = f16[] constant(0), metadata={op_type="prim__Constant" op_name="prim__Constant" source_file="[email protected]" source_line=2501} %reshape.5 = f16[1]{0}...

JackCaoG

TF2 DeBERTaV2 runs super slow on TPUs

Codegen all.dim

Codegen all.dim

Codegen all.dim

LayerNorm generates incorrect output in float16 when XLA is enabled.

LayerNorm generates incorrect output in float16 when XLA is enabled.

LayerNorm generates incorrect output in float16 when XLA is enabled.

[Questions] Why use sub computation other than a few instructions?

TPU Pod support with PjRt

Unexpected performance of dropout