Max Tian

Results 6 comments of Max Tian

I am trying implement something that would allow new observations in the live trading environment. The process would be (within the environment): 1. Grab live data 2. Append it to...

[Here](https://gist.github.com/maxmatical/49c32c96c24635243cccfc3164083767) is a gist using the `BatchLossFilter` callback with the standard blurr training script. I'm currently experimenting with `BatchLossFilter` since it had some traction on twitter a while back, plus...

Is this only the case with a softmax output? So I would do shap_values[0] for class 1, shap_values[2] for class 2 and etc?

@tjruwase just want to follow up on using the `moving_average` to create an EMA model 1. does the EMA model also have to have the same zero config as the...

hi @tjruwase i believe we may have some misunderstanding in my questions 1. my question is more on whether it is required to call `deepspeed.initialize` on the EMA model (as...

thinking more about it, i can see maybe some concerns with initializing the `ema_model` with `copy.deepcopy` the scenario i'm thinking of is if i initialize `ema_model` and keep it on...