Topic-Seg-Label
Topic-Seg-Label copied to clipboard
Delayed Reward
The delayed reward defined by def HSMI(batch_obs, action_list): line 12 of pn_tool seems to only contain the element that encourages lower similarity between adjacent segments. Unless I am mistaken, compared the equation presented in the paper it seems the element encouraging higher similarity within a segment is missing.