[sparse] Migrate Float8SemiSparseTensor off of AQT
This PR migrates Float8DynamicActivationFloat8SemiSparseWeighConfig off of using the AQT CutlassSemiSparseLayout subclass.
The old AQT flow can still be used by passing version=1 into the config
Testing:
pytest test/quantization/quantize_/workflows/float8/test_float8_semi_sparse_tensor.py
:link: Helpful Links
:test_tube: See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/ao/3361
- :page_facing_up: Preview Python docs built from this PR
Note: Links to docs will display an error until the docs builds have been completed.
:x: 1 New Failure, 1 Unrelated Failure
As of commit 2298c3c0bcdb533006bf995162cfcd65aec48bad with merge base 1f9bfd7ebf054c616543052c5767a8e6972cbdf8 ():
NEW FAILURE - The following job has failed:
- Run 1xH100 Tests / test (H100, linux.aws.h100, --pre torch torchvision torchaudio fbgemm-gpu-genai --index-url https... / linux-job (gh)
test/integration/test_load_and_run_checkpoint.py::TestLoadAndRunCheckpoint::test_deprecated_hf_models_model_info3
BROKEN TRUNK - The following job failed but were present on the merge base:
👉 Rebase onto the `viable/strict` branch to avoid these failures
- Run Regression Tests / test-nightly (CUDA Nightly, linux.g5.12xlarge.nvidia.gpu, --pre torch --index-url https://downloa... / linux-job (gh) (trunk failure)
test/test_low_bit_optim.py::TestFSDP2::test_fsdp2
This comment was automatically generated by Dr. CI and updates every 15 minutes.
@jcaip has imported this pull request. If you are a Meta employee, you can view this in D87560869.
cc @RandySheriff if you want to review the changes, couldn't add you as a reviewer for some reason.
I think we should split the tensor into 2, one for each packing format