graphein
graphein copied to clipboard
Structure-informed dataset splitting
trafficstars
Create good train/val/test sets based on SCOP/CATH classifications. Sequence-based approaches (e.g. identity thresholding or BLAST) are bad practice and should not be encouraged.