yeast-GEM icon indicating copy to clipboard operation
yeast-GEM copied to clipboard

Feat-stoichiometry coefficients for subunits in the enzyme complex

Open feiranl opened this issue 3 years ago • 10 comments

Description of the issue:

  • Stoichiometry coefficients are missing in current GEM but available in Complex Portal and PDB database. Should we include this information and which field is suitable to store such info?

feiranl avatar Jan 31 '22 21:01 feiranl

Maybe firstly store such information in a tsv file firstly? @feiranl @edkerk

hongzhonglu avatar Mar 29 '22 08:03 hongzhonglu

That's a good idea. It's not that straightforward where this information should be stored. In the long run it should ideally be kept in the model file, and not just as a separate TSV. But this raises three issues:

  • Neither RAVEN nor COBRA MATLAB structures have a dedicated field for this. Of course we could define a new model field/structure, but this should be done smartly.
  • It should also be properly recorded in the SBML file. I don't think the FBCv2 package has a way to record this.
  • RAVEN functions should be modified to consider this new field when making modifications to the model structure. For instance, if a reaction associated to a complex is deleted, then this complex should also be deleted. Similar as how removeReactions also removes genes and annotations when required.

This is not of the highest priority, but would be good to start a discussion on how to overcome especially the first two issues.

edkerk avatar Mar 29 '22 08:03 edkerk

After we have the curated datasets. we can put it under this folder https://github.com/SysBioChalmers/yeast-GEM/tree/main/data/databases?

hongzhonglu avatar Mar 29 '22 08:03 hongzhonglu

  • Stoichiometry coefficients are missing in current GEM but available in Complex Portal and PDB database.

I'm really unsure about which coefficients this discussion is about, could you give an example?

mihai-sysbio avatar Mar 29 '22 08:03 mihai-sysbio

The number of protein subunits that make up a protein complex. It is not always just 1 copy of each subunit. See for instance pyruvate dehydrogenase: https://www.ebi.ac.uk/complexportal/complex/CPX-3207:

image

This would for instance make a difference if the model would be turned into an ec-model, but there are likely also other use cases.

edkerk avatar Mar 29 '22 08:03 edkerk

This would for instance make a difference if the model would be turned into an ec-model, but there are likely also other use cases.

Thanks for the example @edkerk. Wouldn't then this be more useful to be dealt with by a future version of GECKO instead? I'm thinking that it would make more sense for this information to be following the same structure regardless of the model.

mihai-sysbio avatar Mar 29 '22 08:03 mihai-sysbio

But that is if GECKO is the only purpose. GECKO should be modified to be able to deal with such information anyway (it currently assumes 1 copy per subunit), but should that information only be provided in GECKO, or distributed as part of the generic yeast-GEM, also ready for other applications?

edkerk avatar Mar 29 '22 08:03 edkerk

generic yeast-GEM, also ready for other applications

Seems there no direct applications of this information in yeast-GEM?

hongzhonglu avatar Mar 29 '22 09:03 hongzhonglu

I'm very hesitant to "copy" external data in any repository, unless it would be useful for a good chunk of the userbase.

also ready for other applications?

I see (but I don't know how often this would be the case). Perhaps what can be stored in this repository is a script that fetches the data via an API, and maybe does some reformatting, but not the data itself, since that will by default get stale so it requires work to keep up to date.

mihai-sysbio avatar Mar 29 '22 09:03 mihai-sysbio

I agree with Mihail. But here it is not just "copy". We get this stoi info from ComplexPortal and information from PDB database. The script extracts protein structures from PDB database, mapping the subunit through sequence alignment and then find the stoi.

feiranl avatar Mar 29 '22 09:03 feiranl