lingjiew93

Results 3 issues of lingjiew93

I'm running LLM training with MI250. The instruction and code I used are https://www.mosaicml.com/blog/amd-mi250 and https://github.com/mosaicml/llm-foundry It runs well without profiling, but when I tried to profile below errors are...

Hi, I found that TCC_READ_sum is only half about the real size which from MI100 result. The same as TCC_HIT_sum. They both have 64 bytes cacheline size so the number...

Under Investigation