CoderM8n
CoderM8n
I refer to the following environment class class StockTradingEnv(gym.Env): def __init__( self, config, initial_account=1e6, gamma=0.99, turbulence_thresh=99, min_stock_rate=0.1, max_stock=1e2, initial_capital=1e6, buy_cost_pct=1e-3, sell_cost_pct=1e-3, reward_scaling=2**-11, initial_stocks=None, ): price_ary = config["price_array"] tech_ary = config["tech_array"]...
Why is the state space for the StockTradingEnv a 291 dimension space? how is the state_space derived? what is the rationale? stock_dimension = len(train.tic.unique()) state_space = 1 + 2*stock_dimension +...
i notice that you have 2 stock trading environments (1) stocktrading and (2) stocktrading_np. What is the difference?
where are the agents stored? in mmcv util? i cannot find mmcv util, could you please help me? Is the PPO agent used in the tutorial, from stablebaslines or RLLIb?