nash_q_learning icon indicating copy to clipboard operation
nash_q_learning copied to clipboard

Error when using nashpy calculation in replacement environment

Open yikeqingli opened this issue 5 years ago • 2 comments

When I replace the matrix environment with the small environment I wrote, I use nashpy to calculate the length of pi, and the length of pi often changes. Sometimes there is a case where the length is zero and an error is reported. What does this mean? Does it mean that there are multiple Nash equilibrium points, how should I modify it? Thank you.

yikeqingli avatar Apr 24 '20 02:04 yikeqingli

[(array([0., 0., 1., 0., 0.]), array([0., 0., 0., 0., 1.])), (array([0.67208459, 0. , 0.32791541, 0. , 0. ]), array([0. , 0. , 0.51307566, 0.48692434, 0. ])), (array([0.51880328, 0. , 0. , 0. , 0.48119672]), array([0. , 0. , 0.11574906, 0.88425094, 0. ]))] 3 This is the calculated pi and its length at a certain occurrence [[0.6559585633182168, 0.497509580230747, 0.4356962170942844, 0.7459217987761908, 0.4440960550343467], [0.594101906637517, 0.14389364237289126, 0.5644707360506582, 0.38696744948674355, 0.5979444623727973], [0.6066314076499245, 0.24148246706700294, 0.5468899220410627, 0.628756196088522, 0.8159105990196025], [0.24635874151293857, 0.6678868022812967, 0.22636373821599765, 0.1195441767231642, 0.5628103686521754], [0.7781361689219078, 0.7911891715865836, 0.025662569218334985, 0.7995954864741698, 0.04006035719713352]] 5 This is the q_1 used by pi and its length

yikeqingli avatar Apr 24 '20 02:04 yikeqingli

Sorry for not replying sooner @yikeqingli . And thanks for letting me know the issue. Currently I added multi state environment to this repository and confirmed this issue.Please wait a little longer

tocom242242 avatar Nov 23 '20 08:11 tocom242242