InternEvo icon indicating copy to clipboard operation
InternEvo copied to clipboard

[Feature] CPU synchronization Problem

Open yingtongxiong opened this issue 1 year ago • 0 comments

Describe the feature

Some CPU synchronizations block the GPU kernel, leading to bubbles between GPU kernels. It should be optimized in the future.

  1. item() in rotary embedding.
  2. moe_loss construction.

Will you implement it?

  • [ ] I would like to implement this feature and create a PR!

yingtongxiong avatar Mar 26 '24 02:03 yingtongxiong