ProteinFlow icon indicating copy to clipboard operation
ProteinFlow copied to clipboard

Calling ProteinEntry.from_pickle(<path>).to_pdb(<target_path>) on the entire dataset reveals errors

Open ardagoreci opened this issue 1 year ago • 2 comments

Hi Liza,

I noticed when trying to create a W&B table visualization for the entire dataset that converting the pickle files into pdbs reveals multiple bugs.

Firstly, I got a "UnpicklingError: unpickling stack underflow" from the line "protein_entry = ProteinEntry.from_pickle(pickle_path)" It did not happen with every protein, so when I handled that exception I realized that PDBParser could not properly parse a few of the generate pdb files, throwing out an error in the line "structure = parser.get_structure(pdb_id, target_path)"

Screen Shot 2024-01-27 at 23 26 16 Screen Shot 2024-01-27 at 23 26 55

ardagoreci avatar Jan 27 '24 23:01 ardagoreci