xiaokening
xiaokening
After I use cleanlab to find near_duplicate_issue of my data, I don't know how to deal with the data whose is_near_duplicate_issue is True(these data are indeed near duplicated) to improve...
the shape of pred_probs is (351133, 1522), the shape of embedding is (351133, 45812) the code snippet is below: `lab = Datalab(data_for_train, label_name="vps")` `lab.find_issues(pred_probs=pred_probs,features=embeddings,issue_types={"outlier": {}, "near_duplicate": {}, "non_iid": {}})` `lab.report()`...
# Ask a Question I got the following error when I try to use Shape Inference.Can anyone help me,Thanks!  
## Description When I train xtransformer with pecos model, a training error occurs in the matcher stage. the size of dataset is 108457, Hierarchical label tree: [32, 1102]。In the matcher...