Stock-Trading-Environment
Stock-Trading-Environment copied to clipboard
how to understand the reward calculation
How to understand this? what is the exact purpose? thanks l lot
delay_modifier = (self.current_step / MAX_STEPS) reward = self.balance * delay_modifier
Hi, can you tell me about the pkg and lib versions? TIA @chris881