Kristian Klemon issues

Results 12 issues of


                                            Kristian Klemon

Python API

Many thanks for your awesome project! While it works perfectly using the provided command line script, it's not straightforward to use it directly from code, e.g. when the FID computation...

Incorrect sequence length calculation in usage example

**Bug description** The usage example in the project's README uses the length of the original protein sequence to determine the slicing indices for the model output with respect to padding....

Can ZLUDA compile and run FlashAttention?

[FlashAttention](https://arxiv.org/abs/2205.14135) is an IO-aware, highly optimized implementation of the Attention mechanism for Nvidia GPUs. Since it's implemented in CUDA, it should in principle also compile and run with ZLUDA, right?...

FlashAttention implementation

Inspired by this work, I implemented the Perceiver architecture with out-of-the-box FlashAttention support. It offers a great speedup over a naive implementation and up to 16x increased input sequence lengths...

FlashAttention support

[FlashAttention](https://github.com/Dao-AILab/flash-attention) is an approximation-free fast and memory efficient implementation of multi-head attention. In particular, it reduces the memory requirement from $\mathcal{O}(n^2)$ to $\mathcal{O}(n)$ and would therefore be highly beneficial for...

feat: optional restoring of ComfyUI snapshots to bake custom nodes into the Docker Image

## Motivation Many users and workflows rely on custom nodes and extensions which correspondingly should also be available in a RunPod worker. [ComfyUI Manager](https://github.com/ltdrdata/ComfyUI-Manager), perhaps the most popular ComfyUI management...

Kristian Klemon

Python API

Incorrect sequence length calculation in usage example

Can ZLUDA compile and run FlashAttention?

FlashAttention implementation

FlashAttention support

feat: optional restoring of ComfyUI snapshots to bake custom nodes into the Docker Image

Feature: Support image subfolders

Publish to PyPI

PyTorch Implementation

Allow providing a custom API prefix for the serverless test server