Federico Belotti issues

Results 15 issues of


                                            Federico Belotti

XCiT torch.div 'rounding_mode' not supported in pytorch 1.7.0

Hi, i've read from the [official repo](https://github.com/facebookresearch/xcit#getting-started) that the minimum pytorch version supported is 1.7.0, but in your porting `torch.div` with keyword argument `rounding_mode='floor'` is used, which is available from...

Test function

We should add some tests function to test the trained model. Given some results, we should also update the readme @DavideTr8

enhancement

LongLora fine-tuning support

[LongLora](https://arxiv.org/abs/2309.12307) is "an efficient fine-tuning approach that extends the context sizes of pre-trained large language models". They propose to fine-tune a model with a sparse local attention while maintaining dense...

Enhancement - MathML XSLT refactoring

The XSLT to translate from MathML to LaTeX is very outdated, so it would be nice to refactor it to some LaTeX math standard

enhancement

help wanted

Enhancement - MathML support

It will be useful to translate also to MathML or Mathematica languages.

enhancement

help wanted

Add LongLora for both full and lora fine-tuning

Follow up of #1346. This PR introduces LongLora as in https://github.com/Lightning-AI/litgpt/issues/1237 for both the LoRa and full fine-tuning, while also enabling it during generation. cc @rasbt

enhancement

waiting on author

LoRA matrices dropout

Hi everyone, [recently has been proposed](https://arxiv.org/abs/2404.09610v1) to apply the dropout directly on the LoRA weight matrices A and B: this favors sparsity which improve generalization and reduce overfitting. The dropout...

Nucleus (top-p) sampling

[Nucleus sampling](https://arxiv.org/abs/1904.09751) (top-p sampling in HF) is a dynamic sampling strategy that "truncat[es] the unreliable tail of the probability distribution, sampling from the dynamic nucleus of tokens containing the vast...

enhancement

generation

Handle for 2D/3D observations that are images

I was also about to open an issue regarding Dreamer on feature-vector based (partially observable) environments where no Cnn is needed (and as a matter of fact, to also handle...

enhancement

help wanted

`torch.compile` agents

Hi everyone, in [this branch](https://github.com/Eclectic-Sheep/sheeprl/tree/feature/compile) one can use `torch.compile` to compile the Dreamer-V3 agent. In particular: * in the `sheeprl/configs/algo/dreamer_v3.yaml` one can decide what to compile and which arguments to...

enhancement

help wanted