OSRL Some questions about the technology used in the CDT paper

Some questions about the technology used in the CDT paper

Open Eternity-Wang opened this issue 1 year ago • 4 comments

Hi, can you please explain in detail the reason for relabelling infeasible target return pairs (Data augmentation by return relabeling in the paper)? I'm very confused about its relationship with outlier filtering, which is mentioned in the D.2 section of the paper.

Sep 13 '24 14:09 Eternity-Wang

OSRL OSRL copied to clipboard

Some questions about the technology used in the CDT paper

OSRL
OSRL copied to clipboard