OSRL icon indicating copy to clipboard operation
OSRL copied to clipboard

Some questions about the technology used in the CDT paper

Open Eternity-Wang opened this issue 1 year ago • 4 comments

Hi, can you please explain in detail the reason for relabelling infeasible target return pairs (Data augmentation by return relabeling in the paper)? I'm very confused about its relationship with outlier filtering, which is mentioned in the D.2 section of the paper.

Eternity-Wang avatar Sep 13 '24 14:09 Eternity-Wang