Viral Bhadeshiya
Viral Bhadeshiya
@miscco I like to work on this if possible.
> > > This seems very similar to [#183](https://github.com/deepseek-ai/DeepEP/issues/183). We haven’t encountered this problem, but my guess is that it might be caused by misaligned kernel launch times. You can...
> [@viralbhadeshiya](https://github.com/viralbhadeshiya) def bench_kineto(fn, kernel_names: Union[str, tuple], num_tests: int = 30, suppress_kineto_output: bool = False, trace_path: Optional[str] = None, barrier_comm_profiling: bool = False, num_kernels_per_period: int = 1): # Profile suppress...