octogen icon indicating copy to clipboard operation
octogen copied to clipboard

Desgin: The memory system for octogen

Open imotai opened this issue 8 months ago • 0 comments

Background and Why

Even GPT-4, a well-performing model, can generate outdated code due to outdated training data. However, LLMs have a strong language understanding ability, which can be used to correct these errors through prompts.

  1. To enable LLM to use command-line tools and libraries that are not covered by the training data. This includes two cases:
    • Tools and code repositories don't being included in the training data at all.
    • The model is trained on data from outdated tools or libraries, which means that the model cannot use them right
  2. To prevent LLM from repeating the same mistakes. When a LLM uses an incorrect code usage or tool, it will always repeat the incorrect code and then recall the correct result through long-term memory before executing the code. However, this can be improved by using short-term memory to store the correct tool or code usage in the instructions.

Desgin

image

Other Memory Desgin

  1. https://arxiv.org/pdf/2310.08560.pdf this paper can provide some data proof for the desgin
  2. https://arxiv.org/abs/1909.09436

imotai avatar Oct 19 '23 05:10 imotai