stanford_alpaca
stanford_alpaca copied to clipboard
train using 2 nodes is slower than 1 node
when I use two A100 nodes, each node is (80GX8). I found two nodes train is slower than one node. I use torchrun xxx. can any one meet this?
I am getting this error https://github.com/tatsu-lab/stanford_alpaca/issues/189#issue-1658173995 using single node. any idea whats the problem?
I am sorry. I use the default params and don't meet it.