fourcake
fourcake
你好,我想请问DKT数据集里每个学生可能做多个同一知识点的题目,那这时输入该怎么组织呢,是每道题目组织一个ONE-HOT向量还是以学生为单位,每个学生组织一个ONE-hot向量,若是后者的话,怎么处理重复出现的知识点这一问题呢
I encountered the same problem. On the dolly dataset, each time a client trained, the video memory usage increased by about 2.5GB, resulting in my 24GB 4090 machine only being...
Algorithm 1 in the appendix of your article indicates that the client calculates the latest model by accumulating (seed-gradient) values at the beginning of each round of training. However, in...
However, when I set the number of clients to 1 and the sampling rate to 1, the memory usage does not fluctuate much regardless of how many rounds of training...
> > I encountered the same problem. On the dolly dataset, each time a client trained, the video memory usage increased by about 2.5GB, resulting in my 24GB 4090 machine...
By chance, I ran the code on another 4-GPU 4090 machine, and I found that the video memory problem disappeared, the garbage video memory was recycled in time, and the...
Thank you for your answer. So in client-level DP, although the noise scale is calculated in a similar way, it is valid to directly clip and add noise to the...