libranet
libranet copied to clipboard
what is the start_ind_random and the end_mask_random?
if recycle_ind < parameters['ACTION_NUMBER'] - 1: start_mask_random = ( (count_rem + net.A[recycle_ind] >= 0) & (start_ind_random == -1) ) start_ind_random[start_mask_random] = recycle_ind
**end_mask_random** = ( count_rem + net.A[recycle_ind] < parameters['Interval_N'] )
**end_ind_random**[end_mask_random] = recycle_ind
maskselect_end = (sort[recycle_ind]==parameters['ACTION_NUMBER']-1)
action_sort = sort[recycle_ind]
A_sort = np.squeeze(net.A_mat[action_sort])
_ind_max = (( (count_rem + A_sort < parameters['Interval_N']) & (count_rem + A_sort >= 0) | maskselect_end) & (mask_max_find==0) ) & (mask_last==0)
action_max[_ind_max] = action_max[_ind_max] + sort[recycle_ind] [_ind_max] #?
mask_max_find = mask_max_find + ( (count_rem + A_sort < parameters['Interval_N']) & (count_rem + A_sort >= 0) | maskselect_end ).astype(np.int8)
action_random = (start_ind_random + (end_ind_random + 2 - start_ind_random ) * np.random.rand(h, w)).astype(np.int8)
Why don't we generate action_random at random, but start_ ind_ Random and end_ ind_ random?