L2D icon indicating copy to clipboard operation
L2D copied to clipboard

Official implementation of paper "Learning to Dispatch for Job Shop Scheduling via Deep Reinforcement Learning"

Results 6 L2D issues
Sort by recently updated
recently updated
newest added

Hello! I noticed that the code did not include PDRS related codes such as SPT , MOPNR mentioned in the experiment part of the paper. Is that so?

https://github.com/zcaicaros/L2D/blob/7b2efbb1ffc960260b16952f2bed68e500765bf0/permissibleLS.py#L13

Why your models trained on small instances can be tested on bigger instances, how to solve the difference of size. Why each operation of your instance need different machines, no...

Dear authors, I would like to know how to load instances that are not present in your BenchDataNmpy folder. Which format do you accept? How do you generate the .npy...

尊敬的作者: 您好! 我阅读了您的代码,对于permissinleLS.py中的函数,我有一些疑问,恳请您的解答: 1.mch_a = np.take(mchMat, a) - 1 这一语句的含义。在该模块中,机器数减一的语句大量出现,并且相关变量常出现在矩阵索引的位置,例如 opIDsOnMchs[mch_a][np.where(opIDsOnMchs[mch_a] >= 0)][-1]。我不太能理解其含义,在我的理解下,np.take(mchMat, a)提取了某一个action对应的机器编号,似乎和行索引没有什么关联。 2.变量opIDsOnMchs的含义。 3.对于action和Job工序中顺序约束的理解。通篇阅读后,我认为您对action的编码方式是 J * M,例如15 * 15的数据中共有编号为0-224的225个action。然而permissinleLS.py测试代码的输出结果print('opIDsOnMchs\n', opIDsOnMchs) 动摇了我的理解。例如在某一次完整的迭代中最终opIDsOnMchs输出为[[3 1 8] [0 4 6] [2 5...

I'm a bit confused about the PPO update process. In the line 110: ![Screenshot from 2024-06-06 11-21-26](https://github.com/zcaicaros/L2D/assets/71386827/8a9ed211-bd73-4ef9-8178-c50ea4fed5b0) The rewards in a single episode ​​are normalized by subtracting the mean while...