Ian Lin comments

Results 41 comments of


                                            Ian Lin

Exp4.P cannot handle delay reward

linucb is also not, but we modify it to make it do it...

Exp4.P cannot handle delay reward

The current implementation of exp3 also doesn't support delay reward

Exp4.P cannot handle delay reward

#108 is the solution for exp3

default p_min

How do we define T?

default p_min

I think we should fix N. What happens if we have more than T rounds?

default p_min

I think the actions and experts should both be fixed... I don't think Exp4.P can handle changes of actions and experts reasonably... This is a big change, any idea? @yangarbiter...

default p_min

After retraining the experts, I don't think the weight can still work, and the new weight of a new action is also a problem.

Problems running the examples

@taweihuang

probability normalization in exp4.p

I think this only transform `query_vector` to be `np.ndarray`?

probability normalization in exp4.p

It transform `query_vector` to `ndarray` because every values in `query_vector` is `np.float64`, so that division make `query_vector` to be `ndarray`. It's quite unexpected.