burlap
burlap copied to clipboard
Repository for the ongoing development of the Brown-UMBC Reinforcement Learning And Planning (BURLAP) java library
https://github.com/jmacglashan/burlap/blob/cc3b09de81f33e6bed8154144a3183dee8595e32/src/main/java/burlap/domain/singleagent/blockdude/BlockDudeModel.java#L138 Because `dir` can be +1 and `ax` can be the `map.length - 1`, the agent can try to move outside the bounds of the map array. To reproduce, run...
When BlockDude domain with level 1 or level 2 is run for value iteration, it's throwing array index out of bounds exception. After debugging, i figured out that in moveUp...
In the [burlap_examples](https://github.com/jmacglashan/burlap_examples) Repository, the [GridGameExample](https://github.com/jmacglashan/burlap_examples/blob/master/src/main/java/edu/brown/cs/burlap/examples/GridGameExample.java) does not work with a 64-bit Java Runtime Environment under Windows x64. You get the following error: > Exception in thread "main" java.lang.NullPointerException >...
I'm experimenting with policyIteration params to see the differences between policyIteration and valueIteration. While tuning the params in my model, I'm noticing that sometimes policyIteration will end before a policy...
I see that the [PolicyIteration](http://burlap.cs.brown.edu/doc/burlap/behavior/singleagent/planning/stochastic/policyiteration/PolicyIteration.html) class has a [getTotalPolicyIterations()](http://burlap.cs.brown.edu/doc/burlap/behavior/singleagent/planning/stochastic/policyiteration/PolicyIteration.html#getTotalPolicyIterations--) method, allowing you to see how many iterations it took to converge. The [ValueIteration](http://burlap.cs.brown.edu/doc/burlap/behavior/singleagent/planning/stochastic/valueiteration/ValueIteration.html) class, however, does not seem to have...
I discovered that the `Graphics2D` context passed to a `StatePainter` is broken: updates don't appear correctly on-screen. The following is a comparison of the `Visualizer` view and the same context...
Is this suppose to be like that? I thought that I need terminal state to end each episode, is it for ending the whole thing? I'm working with ros btw...
The number of states reachable from the given states differs when using PolicyIteration and ValueIteration. When PolicyIteration and ValueIteration were run on a graph defined MDP of 100 states, ValueIteration...
Fixes an ArrayIndexOutOfBoundsException when block dude is at the edge of a map facing the edge.