Aleksandr Dremov

Results 11 issues of Aleksandr Dremov

### Issue Link :link: https://github.com/Alamofire/AlamofireImage/issues/430 ### Goals :soccer: - Implement basic SwiftUI view for images lazy download - Documentation ### Implementation Details :construction: - Created LazyImage view that loads image...

At this moment, no collection can address random insertions. I have a data structure implemented through implicit Treap that behaves like a general array but can do random insertions/deletions/access with...

enhancement

### Problem description I wanted to test quality metrics of several different algorithms from `crowdkit.aggregation.classification` and found myself writing such kind of function: ```python def get_scores(model, data, fit=True): if fit:...

enhancement

If protocol#1 inherits protocol#2 imported from different module, then protocol#2 requirements are not met: ```swift /// @mockable protocol Foo: Equatable { func foo() } ``` ```swift class FooMock: Foo {...

Hello! I try to implement a multi-stage training with fp8 autocast. However, when I load checkpoint from first training stage using torch's `load_state_dict(...)`, loss quickly explodes. Are there any global...

While running RMS norm, I got the following exception: ``` /workspace/TransformerEngine/transformer_engine/common/transformer_engine.cpp:39 in function CheckInputTensor: Assertion failed: t.data.dptr != nullptr. Input x is not allocated! File "/usr/local/lib/python3.11/dist-packages/transformer_engine/pytorch/module/rmsnorm.py", line 50, in forward...

bug

Seems like at the current time packed tensors in `thd` format are not supported by `transformer_engine.pytorch.attention. DotProductAttention`. That's weird as such mode clearly supported by `fused_attn_fwd` from fused_attn cpp_extensions I...

### Feature description Currently, all messages are sent in plain text mode. Allowing to do text formatting will be very useful. To do that, `parse_mode` option should be passed in...

enhancement

### Describe the issue Executions per day show NaN ### Environment - Kestra Version: v19.0.1 - Operating System (OS/Docker/Kubernetes): Docker - Java Version (if you don't run kestra in Docker):...

bug

### 🐛 Describe the bug When testing flex attention with the mask from issue #141126, I found that apart from being incorrect it is also significantly slower than torch's generic...