C2D
C2D copied to clipboard
tabular data/ noisy instances
Hi, thanks for sharing your implementation. I have two questions about it:
- Does it also work on tabular data?
- Is it possible to identify the noisy instances (return the noisy IDs or the clean set)?
Thanks!
Self-supervision for tabular data is hard. If you manage to get one, C2D ought to work. As for identification of noisy instances, you can just try to run inference on training set: the samples with high loss are likely to be noisy.