Patrick Liu

Results 12 comments of Patrick Liu

hey @cehongwang @narendasan was this ever resolved? what was the issue? im seeing nan values as well for my network. thanks!

@narendasan can you point me to what resolved the issue? im still seeing this for my network after taking changes from HEAD of main

hey @peri044 , thanks for the response. i tried modelopt -> export on a simple model below. am i using this wrong or missing something obvious? im using non-strict export...

hey @lanluo-nvidia thanks for checking! here are my pytorch and modelopt versions: ``` nvidia-modelopt 0.29.0 nvidia-modelopt-core 0.29.0 torch 2.5.1 ```

thanks @lanluo-nvidia - upgrading to torch 2.6 resolves the issue. compiling the exported program gives me something unexpected though. for reference, the model (after `mtq.quantize()`) is: ``` JustAConv( (conv): QuantConv2d(...

@narendasan @apbose this is a stripped down portion of Meta's SAM2 (original at https://github.com/facebookresearch/segment-anything-2/blob/main/sam2/modeling/sam/prompt_encoder.py), with minor modifications of `_embed_points` fill in `CHECKPOINT_PATH` and run the below `python ...` ```python #...

you should see this fx Graph dump as part of the stack trace ``` While executing %cat_5 : [num_users=1] = call_function[target=torch.ops.aten.cat.default](args = ([%_frozen_param0, %where_3], 1), kwargs = {_itensor_to_tensor_meta: {: ((1,...