salman issues

Results 41 issues of


                                            salman

[Docs] Write PPO and DPO recipe docpages

Write docpages for #1005 and the various losses supported for our DPO recipe

documentation

[RFC][Docs][WIP] Custom Recipe Tutorial

#### Context What is the purpose of this PR? Is it to - [ ] add a new feature - [ ] fix a bug - [X] update tests and/or...

CLA Signed

Toggling KV-caches

#### Context What is the purpose of this PR? Is it to - [x] add a new feature - [x] fix a bug - [ ] update tests and/or documentation...

CLA Signed

Implement quantized model inference for `generate_v2`

We'll probably also need #1782.

enhancement

high-priority

inference

Guard core version when compiling quantized models

https://github.com/pytorch/torchtune/issues/1795

bug

high-priority

[RFC] Classification task fine-tuning

There has been some community appetite for classification tasks #1249 #1124. Incidentally, due to the use of classification models for RLHF, we already have some of the necessary components to...

discussion

community help wanted

Remove `GemmaTransformerDecoder` and `TiedEmbeddingTransformerDecoder`

Now that we've deprecated these classes we can remove them in an upcoming release.

better engineering

[RFC] Supporting KV-cache toggling

### Problem Currently, when we use `model.setup_caches()`, KV-caches are always updated for every subsequent forward pass on the model. We have valid use cases for using `model.setup_caches()`, but then not...

rfc

inference

[WIP] Config Continous Integration (CCI)

#### Context What is the purpose of this PR? Is it to - [ ] add a new feature - [ ] fix a bug - [x] update tests and/or...

CLA Signed

Failing tests on MPS

Several of our tests fail due to precision differences between MPS and CUDA/CPU falling outside of numerical tolerances on our tests: ```bash $ pytest tests FAILED tests/torchtune/generation/test_generation.py::TestGenerate::test_stop_tokens_batched[prompt_tokens_batched-generation_model_no_kv_cache] - assert False...

bug