Katsuki Ohto issues

Results 35 issues of


                                            Katsuki Ohto

feature: cumulative worker id without entry server

Combining #245 and #246, it seems natural to implement it like this.

feature: enable to make templates if no trained player is in action t…

…urn players There is the case that - training vs other agents - as a result, m['policy'][m['turn'][0]] is None

feature: apply lambda=1 in the timestep that there is no value output

For training involving steps where the outputted value does not exist, it is necessary to set lambda to 1 locally. I am not sure if VTrace is correct in this.

chore: remove fileno() inferface from PickledConnection

I have never concerned `fineno()` interface. Is it useful?

(To be discussed) (Idea) feature: multi dimensional reward

Do we delete OUTCOME, or use OUTCOME as the first dimension of REWARD if it is defined?

(WIP) feature: compute total reward

Do we have to report not only terminal outcomes but also total rewards?

(To Be Discussed) feature: remove prepare_env()

Is `prepare_env()` really necessary?

(Idea) feature: update configuration for turn-based

Turn-based batch creation and zero-sum averaging are different and independent. Moreover, these should be set False at default for safety.

feature: abolish entry_server and worker_server accepts entry

Is `entry_server` necessary??

(Idea) feature: set optimizer from outside

example ``` net = SimpleConv2dModel2() optim = Adam(net.parameters(), lr=1e-4) learner = Learner(args=args, net=net, optim=optim, remote=False) learner.run() ``` This PR requires #170 (return model instance by calling net())