Edward Beeching comments

Results 40 comments of


                                            Edward Beeching

Added CSharpAll variant of SimpleTestEnv

Thanks for the PR! Can you add some details about what feature/example this implements.

feat(macos builds): added macos build support

Hey, not yet. Hopefully tomorrow. Thanks for this.

feat(macos builds): added macos build support

Hi, I got around to trying this. Unfortunately, the mac build templates don't work on linux. I think we would have to include a build server as part of the...

Missing evaluation scripts?

Hi, thanks for raising this issue. Regarding logging, there should be a log file created in tmp/results/doom_rl/*. Thanks for pointing out about the plotting code, I will add this file...

Thanks for your question. I have added trained models for all scenarios in [saved_models](https://github.com/edbeeching/3d_control_deep_rl/tree/master/saved_models). Note these models were trained with ViZDoom version 1.1.4 (some textures changed in the more recent...

T-SNE Visualization (Fig-11)

I just realized I provided code for analyzing the attention distribution, not the TSNE. I added the TSNE code [here](https://github.com/edbeeching/3d_control_deep_rl/blob/master/visualization/hidden_state_tsne_analysis.py) again you will need to modify a bit to get...

Edward Beeching

Added CSharpAll variant of SimpleTestEnv

feat(macos builds): added macos build support

feat(macos builds): added macos build support

Missing evaluation scripts?

T-SNE Visualization (Fig-11)

T-SNE Visualization (Fig-11)

Using the ONNX output values

Why zephyr-7b-dpo-lora is finetuned from mistralai/Mistral-7B-v0.1 instead of zepher-7b-sft model?

How to perform full parameter finetuning without A100 GPUs

How to perform full parameter finetuning without A100 GPUs