Sudhakar Singh

Results 7 issues of Sudhakar Singh

Added multi host pjit tests to the existing test setup of 2 nodes and 1 process per GPU

If the local conv (or "causal conv1d") is intended to shift the tokens by 1, then this should instead be `padding=d_conv` instead of `padding=d_conv - 1`, shouldn't it? (Or can...

# Description Expose `rotary_base` as an arg instead of hardcoding to 10000 Fixes # (issue) https://github.com/NVIDIA/TransformerEngine/issues/849 ## Type of change - [ ] Documentation change (change only to the documentation,...

# Description Trying to bake in `fp8_model_init` into layer initialization. The `fp8_model_init` context manager needs to be then added/managed by the user. Baking it into TE layer initialization would allow...

It wasn't possible to run this tutorial https://help.getzep.com/graphiti/graphiti/lang-graph-agent consistently without adding this rate limiting API. ---- > [!IMPORTANT] > Adds rate limiting handling with exponential backoff to `generate_response()` in `openai_client.py`....

# Description Sliding Window Attention with CP for THD format is enabled with A2A communication. Fixes # (issue) ## Type of change - [ ] Documentation change (change only to...

2.10.0

# What does this PR do ? Allows applying rope embedding with "offsets" to sequences in packed (thd) or batch (bshd/sbhd) formats. The offset per sequence can be supplied as...