Results 49 comments of QinLuo

with `MoeHybridParallelPlugin` I met another problem: https://github.com/hpcaitech/ColossalAI/issues/5426 ``` grad = grad.to(master_moe_param.dtype).to(master_moe_param.device) AttributeError: 'NoneType' object has no attribute 'to' ```

without `EPMixtralSparseMoeBlock` in mixtral_policy, I met another problem: hangs at the following point: ![image](https://github.com/hpcaitech/ColossalAI/assets/1772912/af59c6fb-31a5-4de0-b348-72adbb32a31b)

> > with `MoeHybridParallelPlugin` I met another problem: #5426 > > ``` > > grad = grad.to(master_moe_param.dtype).to(master_moe_param.device) > > AttributeError: 'NoneType' object has no attribute 'to' > > ``` >...

Could someone in colossalai review it? Thx. cc @flybird11111

> Could you add unit tests to check this feature? I've tested in my local experiments ![image](https://github.com/hpcaitech/ColossalAI/assets/1772912/600a49e1-4011-4ca5-aaa5-48a1e293b34e) But how to test it in unittest: colossalai/tests

> > > Could you add unit tests to check this feature? > > > > > > I've tested in my local experiments ![image](https://private-user-images.githubusercontent.com/1772912/301859323-600a49e1-4011-4ca5-aaa5-48a1e293b34e.png?jwt=eyJhbGciOiJIUzI1NiIsInR5cCI6IkpXVCJ9.eyJpc3MiOiJnaXRodWIuY29tIiwiYXVkIjoicmF3LmdpdGh1YnVzZXJjb250ZW50LmNvbSIsImtleSI6ImtleTUiLCJleHAiOjE3MDcyMDYzMjQsIm5iZiI6MTcwNzIwNjAyNCwicGF0aCI6Ii8xNzcyOTEyLzMwMTg1OTMyMy02MDBhNDllMS00MDExLTRjYTUtYWFhNS00OGExZTI5M2IzNGUucG5nP1gtQW16LUFsZ29yaXRobT1BV1M0LUhNQUMtU0hBMjU2JlgtQW16LUNyZWRlbnRpYWw9QUtJQVZDT0RZTFNBNTNQUUs0WkElMkYyMDI0MDIwNiUyRnVzLWVhc3QtMSUyRnMzJTJGYXdzNF9yZXF1ZXN0JlgtQW16LURhdGU9MjAyNDAyMDZUMDc1MzQ0WiZYLUFtei1FeHBpcmVzPTMwMCZYLUFtei1TaWduYXR1cmU9NDliNWFkNTdjMTcxODUwMTVkYzY1NjJiYzdiMDBiMDRjZmFiZjgzMDA4YzkwZTU1NDE4Mjc1ZWNmYzkxMzE4MyZYLUFtei1TaWduZWRIZWFkZXJzPWhvc3QmYWN0b3JfaWQ9MCZrZXlfaWQ9MCZyZXBvX2lkPTAifQ.ox05Y2u3iBzzCtel-loFlm1-WDRz-ui6tzv4T0th7VA) > > But how to...

@ver217 ``` pytest tests/test_checkpoint_io/test_torch_fsdp_checkpoint_io.py ================================================================= test session starts ================================================================== platform linux -- Python 3.10.11, pytest-8.0.0, pluggy-1.4.0 rootdir: ~/ColossalAI configfile: pytest.ini plugins: hypothesis-6.75.2, anyio-4.0.0 collected 1 item tests/test_checkpoint_io/test_torch_fsdp_checkpoint_io.py . [100%] ==================================================================...

I see the MR, that's awesome, when can we use it?