openrl issues

[Feature Request] selfplay support more than two players

### 🚀 Feature [Feature Request] selfplay support more than two players ### Motivation _No response_ ### Additional context _No response_ ### Checklist - [X] I have checked that there is...

huangshiyu13

enhancement

add introduction to OpenRL Wrappers

### 📚 Documentation add introduction to OpenRL Wrappers ### Checklist - [X] I have checked that there is no similar [issues](https://github.com/OpenRL-Lab/openrl/issues) in the repo - [X] I have read the...

huangshiyu13

documentation

[Feature Request] Add AWR algorithm

### 🚀 Feature [Feature Request] Add AWR algorithm ### Motivation _No response_ ### Additional context _No response_ ### Checklist - [X] I have checked that there is no similar [issues](https://github.com/OpenRL-Lab/openrl/issues)...

huangshiyu13

enhancement

[Feature Request] add QMIX

### 🚀 Feature add QMIX ### Motivation _No response_ ### Additional context _No response_ ### Checklist - [X] I have checked that there is no similar [issues](https://github.com/OpenRL-Lab/openrl/issues) in the repo...

huangshiyu13

enhancement

[Feature Request] Add vdn algorithm

### 🚀 Feature Add vdn algorithm, including vdn_net, vdn_module, etc. ### Motivation No response ### Additional context No response ### Checklist - [X] I have checked that there is no...

strivebfq

enhancement

[Feature Request] Add cpu number check in make

### 🚀 Feature - Add CPU number check to make function. - if the user wants to allocate larger environment number than their CPU numbers during asynchronous mode, raise the...

huangshiyu13

enhancement

optimize agent save

### 🐛 Bug agent.save() is not well implemented. The saved file for nlp task is too large. ### To Reproduce ```python from openrl import ... ``` ### Relevant log output...

huangshiyu13

bug

Need Wrapper tutorial

### 📚 Documentation I'm rather confused about the use of wrappers, and would like a tutorial to explain the use of the individual wrappers. ### Checklist - [X] I have...

shangjaven

documentation

怎么支持多gpu?

### ❓ Question 怎么支持多gpu? ### Checklist - [x] I have checked that there is no similar [issues](https://github.com/OpenRL-Lab/openrl/issues) in the repo - [x] I have read the [documentation](https://openrl-docs.readthedocs.io/)

zghnx

question

What is the simplest way to use a vlm for the Net on a PyBoy Gymnasium Environment with no predefined Reward Function

I'm thinking it should be possible to use the VLM as the policy and evaluation, just with different prompts. I'm trying to use Qwen2.5-VL-3B-Instruct as basis to create an agent...

TimeLordRaps

openrl
openrl copied to clipboard

Metadata

[Feature Request] selfplay support more than two players

add introduction to OpenRL Wrappers

[Feature Request] Add AWR algorithm

[Feature Request] add QMIX

[Feature Request] Add vdn algorithm

[Feature Request] Add cpu number check in make

optimize agent save

Need Wrapper tutorial

怎么支持多gpu?

What is the simplest way to use a vlm for the Net on a PyBoy Gymnasium Environment with no predefined Reward Function

← Metadata

Owner

Metadata

openrl openrl copied to clipboard

Metadata

← Metadata

Owner

Metadata

openrl
openrl copied to clipboard