Some small problem
Hello, first of all, congratulations on having your work accepted by ICLR.
I'm also very interested in your work, but I have a few questions I hope the authors can address:
-
Where does the experience used in the simulation come from? In Figure 6 of the paper, a comparison is made with Memory libraries containing 5, 20, and 40 experiences, but in the project's Memory library, there are only 21 experiences. Why 21? Were 5 and 20 randomly selected from it during simulation? How was the scenario with 40 experiences completed? Do readers need to simulate it themselves and then use the reflection module to generate it?
-
Does the Memory library need to be updated in real time? From the project source code, it seems like you don't update the Memory library used in the simulation in real time, but instead write the experiences generated by the reflection module to another Memory library. In theory, doesn't this need real-time updating?
-
Diversity of experiences in the Memory library: If the Memory module doesn't need to be updated in real time, and a fixed-size Memory library is selected for simulation use, how do you ensure the diversity of experiences in the Memory library?