TransformerPrograms
TransformerPrograms copied to clipboard
[NeurIPS 2023] Learning Transformer Programs
Hello, Thanks for your work. I attempted the in-context learning training command from the experiment details, but encountered a 'loss is NaN' error. Could you share the command you used?...
LlaMa
Hi, Thanks for your work. I would like to know if TransformerPrograms can be used for LlaMa and other LLMs.
Hi, I came across this project while working with Differentiable Logic Gate Networks (difflogic) by Felix Petersen, which was one of the works cited in your paper. I am interested...