concept-erasure
concept-erasure copied to clipboard
Applying this during decoding time
Hi, thanks for repository and paper. Is it possible to apply this to generation tasks in language models and not just classification ? I am very interested in this aspect. Also, just to confirm, the scrubber is a technique that is applied during inference and doesn't modify model parameters right ? It only modifies hidden representations ?