DevKiD comments

Results 21 comments of


                                            DevKiD

adb connect fail

Did you restart the services?

adb connect fail

Can I have the full log?

[HARD] Support arbitrary tensor parallel splits

I would also add a small benchmark test to see how to split the model among the devices instead of splitting it equally like I understand. I would like to...

[HARD] Support arbitrary tensor parallel splits

Then I don't see a problem. Probably I misunderstood something. By TP a layer or multiple layer is splitted across devices. But how are they splitted across those devices? I...

[HARD] Support arbitrary tensor parallel splits

Is there also something hybrid like powerful devices are taking layers and less powerful devices doing it by TP?

[HARD] Support arbitrary tensor parallel splits

For Pipeline I see by using a 4gb model a rise of 4gb in ram usage which is strange. Why is ist not 1.3gb per device bye 3 device?

[HARD] Support arbitrary tensor parallel splits

I will create a recording tomorrow. If it happens again I will upload.

[HARD] Support arbitrary tensor parallel splits

Back to the original. Let a model block have 82 Neurons per layer then every device (3 devices) should get about 27 and one 28. To make this easy we...

[HARD] Support arbitrary tensor parallel splits

I still don't understand the problem. Can you @rltakashige create a recording so that others and I can understand?

[HARD] Support arbitrary tensor parallel splits

I made a thinking error. Unbalance load can lead to performance loss and to a bottleneck. The smart padding might be intelligent. But instead of giving paddings we can compute...