Inverse-Reinforcement-Learning issues

Are Ziebart's thesis, equation 9.2 and find_policy() function the same?

Hi Matthew! This repo is just great: It works, its transparant and modular! I only found two differences between Ziebart's thesis and your implementation. Can you let me know if...

tessavdheiden

About feature_matrix

1

Hello！I am a graduate student from China,and in your demo,the feature_matrix is not provided,so when giving state space,how to obtain the feature_matrix?I would be grateful if you could answer. Apart...

jjjhfffjj

Theano package error

I am currently running code and am getting an error: I reshaped the feature matrix as reshaped_to_2d_reshape as (num_of_states, num_of_dimensions) and am still getting this error. I am not sure...

Sankalp1233

Theano package help

I need help to try to remove this warning: WARNING (theano.configdefaults): g++ not available, if using conda: `conda install m2w64-toolchain` C:\Users\Sankalp Chauhan\AppData\Local\Programs\Python\Python37\lib\site-packages\theano\configdefaults.py:560: UserWarning: DeprecationWarning: there is no c++ compiler.This is...

Sankalp1233

IRL large state space

1

I would like to ask you if you can list some references in order to understand how you formulated the block matrix form of the linear program for solving large...

spruzzer

mohitsharma0690

How to deal with non-tabular environment?

The environments of GridWorld and ObjectWorld are all tabular environments, in which the states are discreate and limited. We can easily write down the feature matrix by listing all possible...

Charlesyyun

Inverse-Reinforcement-Learning
Inverse-Reinforcement-Learning copied to clipboard

Metadata

Are Ziebart's thesis, equation 9.2 and find_policy() function the same?

About feature_matrix

Theano package error

Theano package help

IRL large state space

Feature matrix

Wrong syntax to unpack the object in the iteration

maxent seems to be using max instead of softmax for V_soft?

How to deal with non-tabular environment?

← Metadata

Owner

Metadata

Inverse-Reinforcement-Learning Inverse-Reinforcement-Learning copied to clipboard

Metadata

← Metadata

Owner

Metadata

Inverse-Reinforcement-Learning
Inverse-Reinforcement-Learning copied to clipboard