ColossalAI icon indicating copy to clipboard operation
ColossalAI copied to clipboard

[checkpointio]support distributed checkpoint io for model saving.

Open flybird11111 opened this issue 11 months ago • 3 comments

📌 Checklist before creating the PR

  • [ ] I have created an issue for this PR for traceability
  • [ ] The title follows the standard format: [doc/gemini/tensor/...]: A concise description
  • [ ] I have added relevant tags if possible for us to better distinguish different PRs
  • [ ] I have installed pre-commit: pip install pre-commit && pre-commit install

🚨 Issue number

Link this PR to your issue with words like fixed to automatically close the linked issue upon merge

e.g. fixed #1234, closed #1234, resolved #1234

📝 What does this PR do?

Summarize your work here. if you have any plots/diagrams/screenshots/tables, please attach them here.

💥 Checklist before requesting a review

  • [ ] I have linked my PR to an issue (instruction)
  • [ ] My issue clearly describes the problem/feature/proposal, with diagrams/charts/table/code if possible
  • [ ] I have performed a self-review of my code
  • [ ] I have added thorough tests.
  • [ ] I have added docstrings for all the functions/methods I implemented

⭐️ Do you enjoy contributing to Colossal-AI?

  • [ ] 🌝 Yes, I do.
  • [ ] 🌚 No, I don't.

Tell us more if you don't enjoy contributing to Colossal-AI.

flybird11111 avatar Jan 16 '25 10:01 flybird11111

hi all, take a look at this please. This bug is quite annoying for me.

https://github.com/hpcaitech/ColossalAI/pull/6168

Lemon-412 avatar Jan 18 '25 06:01 Lemon-412

hi all, take a look at this please. This bug is quite annoying for me.

#6168

ok

flybird11111 avatar Jan 20 '25 03:01 flybird11111

DON'T merge to main. Create a new feature branch on the org repo and merge to it.

ver217 avatar Jan 20 '25 08:01 ver217