andrewor14 issues

Results 29 issues of


                                            andrewor14

Add generic fake quantized linear for QAT

Stack from [ghstack](https://github.com/ezyang/ghstack) (oldest at bottom): * __->__ #1020 **Summary:** This commit adds a generic fake quantized linear module to replace the uses of the existing more specific QAT linears....

CLA Signed

Make module swap the main QAT flow again

Stack from [ghstack](https://github.com/ezyang/ghstack) (oldest at bottom): * #1020 * __->__ #1019 **Summary:** Following https://github.com/pytorch/ao/issues/987, this commit makes module swap the main QAT flow today. We remove all tensor subclass fake...

CLA Signed

Move and rename GranularityType -> Granularity

Stack from [ghstack](https://github.com/ezyang/ghstack) (oldest at bottom): * #1020 * __->__ #1038 * #1037 Summary: Move GranularityType to quant_primitives.py to be consistent with other similar fields like MappingType and ZeroPointDomain. Test...

CLA Signed

Make module swap the main QAT flow again

Stack from [ghstack](https://github.com/ezyang/ghstack) (oldest at bottom): * #1020 * #1038 * __->__ #1037 Summary: Following https://github.com/pytorch/ao/issues/987, this commit makes module swap the main QAT flow today. We remove all tensor...

CLA Signed

Fix QAT module swap BC

Stack from [ghstack](https://github.com/ezyang/ghstack) (oldest at bottom): * __->__ #1058 Add back _module_swap_api.Int8DynActInt4WeightQATLinear. Differential Revision: [D64252460](https://our.internmc.facebook.com/intern/diff/D64252460)

CLA Signed

Loss not going down for fine-tuning Llama3-8B on C4

I'm fine-tuning Llama3-8B on the C4 dataset (en subset) for 2000 steps using the `full_finetune_distributed` recipe. I find that the loss did not go down at all and the quantized...

bug

high-priority

Can we support outputting checkpoints directly in .pt format?

Today we need to do an extra conversion step according to this README: https://github.com/pytorch/torchtitan/blob/main/docs/checkpoint.md ``` python -m torch.distributed.checkpoint.format_utils dcp_to_torch outputs/checkpoint/step-100 /tmp/checkpoint.pt ``` I think we should **provide an option for...

enhancement

module: checkpoint

oncall: export

andrewor14

Add generic fake quantized linear for QAT

Make module swap the main QAT flow again

Move and rename GranularityType -> Granularity

Make module swap the main QAT flow again

Fix QAT module swap BC

Loss not going down for fine-tuning Llama3-8B on C4

Can we support outputting checkpoints directly in .pt format?

Feature request: support different input/output formats in the same recipe

QAT range learning tracker

torch.export failed with tensor subclass, worked in 2.9.0