Stock-Trading-Environment icon indicating copy to clipboard operation
Stock-Trading-Environment copied to clipboard

how to understand the reward calculation

Open chris881 opened this issue 3 years ago • 1 comments

How to understand this? what is the exact purpose? thanks l lot

delay_modifier = (self.current_step / MAX_STEPS) reward = self.balance * delay_modifier

chris881 avatar Sep 07 '21 16:09 chris881

Hi, can you tell me about the pkg and lib versions? TIA @chris881

TashinAhmed avatar Sep 28 '21 09:09 TashinAhmed