softlearning
softlearning copied to clipboard
SQL algorithm is not working
Seems there are some error with SQL algorithm. I can run SAC but not SQL
ERROR trial_runner.py:567 -- Trial id=a3651_00000-seed=1911: Error processing event.
Traceback (most recent call last):
File "/opt/rh/rh-python36/root/usr/lib/python3.6/site-packages/ray/tune/trial_runner.py", line 515, in _process_trial
result = self.trial_executor.fetch_result(trial)
File "/opt/rh/rh-python36/root/usr/lib/python3.6/site-packages/ray/tune/ray_trial_executor.py", line 488, in fetch_result
result = ray.get(trial_future[0], timeout=DEFAULT_GET_TIMEOUT)
File "/opt/rh/rh-python36/root/usr/lib/python3.6/site-packages/ray/worker.py", line 1428, in get
raise value.as_instanceof_cause()
ray.exceptions.RayTaskError(TypeError): ray::ExperimentRunner.train() (pid=24394, ip=10.22.134.202)
File "python/ray/_raylet.pyx", line 484, in ray._raylet.execute_task
File "python/ray/_raylet.pyx", line 438, in ray._raylet.execute_task.function_executor
File "/opt/rh/rh-python36/root/usr/lib/python3.6/site-packages/ray/tune/trainable.py", line 336, in train
result = self.step()
File "/home/app/softlearning/examples/development/main.py", line 101, in step
diagnostics = next(self.train_generator)
File "/home/app/softlearning/softlearning/algorithms/rl_algorithm.py", line 182, in _train
timestep=self._total_timestep)
File "/home/app/softlearning/softlearning/algorithms/rl_algorithm.py", line 346, in _do_training_repeats
for i in range(self._n_train_repeat)
File "/home/app/softlearning/softlearning/algorithms/rl_algorithm.py", line 346, in
/home/app/softlearning/softlearning/algorithms/sql.py:338 _do_updates *
Qs_values, Qs_losses = self._update_Q(
/home/app/softlearning/softlearning/algorithms/sql.py:205 _update_Q *
Q_observations = {
TypeError: 'NoneType' object is not iterable
Hey @ivan-ji-walmart, thanks for opening this issue! Sorry about this. Some of my recent changes must have broken the SQL implementation and has left unnoticed since I haven't been using SQL in a long time. I'll try to get this fixed in the upcoming weeks, but in the meanwhile, I'd highly recommend using the official reference implementation found at https://github.com/haarnoja/softqlearning.