Beilei
Beilei
Thanks for your reply. Sorry, I am not using git, but I think I can if necessary. While, I have to make sure that the optimization point I have proposed...
Thanks. According to the DramSim2, before any READ/WRITE commands, an ACTIVE command will be handled. So, when the ACTIVE is handled, the lastOpenRowAddress will be updated: ``` void Rank::receiveFromBus(BusPacket *packet)...
Thanks. The "old transactions" ? Sorry, but I can not understand. ``` //memory controller update void MemoryController::update(){ //now that we know there is room in the command queue, we can...
Thanks ``` if(packet->row == bankStates[packet->bank].lastOpenRowAddress) ``` This line will not always be true. After the ACTIVE command was processed, the lastOpenRowAddress had been set to the row address latest opened:...
Did you notice this test [MPS or MLX for Domestic AI? ](https://medium.com/@koypish/mps-or-mlx-for-domestic-ai-the-answer-will-surprise-you-df4b111de8a0) ? Did MLX use any features that torch.mps don't use to boost the performance? @awni
> @SunnyBeike In my experiments with small (image) CNNs, see above, I am not seeing different GPU frequencies according to `asitop`. For both torch and mlx, I am getting near...