Sebastian Bodza comments

Results 8 comments of


                                            Sebastian Bodza

AssertionError while running train.py with multiagent_traffic_light_grid.py by replacing PPO to SAC.

I am getting the same error when using my custom environment, where i predict some values with a trained tensorflow model. The Catpole environment does not lead to the error....

feat: get video_id from thumbnail if it's not available

Bugfix working as of 19.12.2021! Thanks 👍

[BUG] [0.8.1] INT8 model loading/inference issue

> Also wanted to point out that when using `torch.int8` in `deepspeed.init_inference(model, dtype=torch.int8, replace_with_kernel_inject=True)`, [this code line](https://github.com/microsoft/DeepSpeed/blob/4de4d2acc6077707c806a8abbc26fba1e2779745/deepspeed/inference/engine.py#L489) is called which skips running `WeightQuantization(...).model_quantize(...)` and I am not sure if this...

[air] Deprecate MlflowTrainableMixin, move to setup_mlflow() function

Working with MLFow Projects in combination with ray is quite buggy. When running a Mlflow project, a mlflow session will be started. Then ray will put all mlflow logs into...

[air] Deprecate MlflowTrainableMixin, move to setup_mlflow() function

Ok, this seems to be an issue of the following: > The question is, should we use a config-API setup_mlflow(config) or and explicit API setup_mlflow(experiment_name, experiment_id, tracking_uri, …) Currently the...

nucleus sampler problem?

The problem seems to come from repetition_penalty. Compared to other implementations the outcome differy quite alot. The implementation however seems to be fine, at least for the penalty itself.

nucleus sampler problem?

Some more information is in the following git project issue: https://github.com/the-crypt-keeper/can-ai-code/issues/75 To reproduce the whole process: 1. git clone the-crypt-keeper/can-ai-code 2. Create the prompts with `prepare.py --template prompts/Wizard-Coder.txt` 3. Create...

Support peft's LoRa for HF transformer models.

Any plan to support loras directly? Would be great to switch between loras :)