boltz icon indicating copy to clipboard operation
boltz copied to clipboard

Processing raw data error

Open wensm77 opened this issue 1 year ago • 3 comments

hi! When I use your code to process cif files, I get an error. I checked the code, you did not pass in ‘covalents’ when you finally returned it. image image

  File "/home/wensm/boltz/scripts/process/mmcif.py", line 1121, in parse_mmcif
    return ParsedStructure(data=data, info=info)
TypeError: ParsedStructure.__init__() missing 1 required positional argument: 'covalents'

wensm77 avatar Dec 24 '24 03:12 wensm77

And why is msa_id set to null? This causes the msa file to be unable to be matched during trainnig.

# rcsb.py line 136
    for i, chain in enumerate(structure.chains):
        key = f"{pdb_id}_{chain['entity_id']}"
        chain_info.append(
            ChainInfo(
                chain_id=i,
                chain_name=chain["name"],
                msa_id="",  # FIX
                mol_type=int(chain["mol_type"]),
                cluster_id=clusters.get(key, -1),
                num_residues=int(chain["res_num"]),
            )
        )

wensm77 avatar Dec 24 '24 08:12 wensm77

Running into the same issue!

navvye avatar Feb 04 '25 10:02 navvye

same issue here

kevincheng7 avatar Oct 30 '25 02:10 kevincheng7