EVcouplings icon indicating copy to clipboard operation
EVcouplings copied to clipboard

Improve mean-field integration

Open thomashopf opened this issue 7 years ago • 9 comments

While investigating #104 I found a couple of issues with the mean-field protocol and couplings objects that would be great to address:

  1. The mean_field couplings protocol should allow to specify the pseudocount weight as a parameter (given the inference method itself has the option already)

  2. It looks like there may be some sign problem with the mutation effects computed using mean-field couplings models (damaging > 0 instead of < 0)

  3. Inference of the independent model shoul be based on pseudocounted f_i matrix (this is the way I originally implemented it, before we moved over to the l2-based inference that is currently present in the CouplingsModel object. Ideally, this would be based on detecting the model is from mean-field upon loading, and returning a MeanFieldCouplingsObject that has the appropriate behavior. For now, the fix from #104 is to specify a default lambda_h of 0.01 and apply l2-regularized inference instead.

@sophiamersmann if you have any time to look into this would be greatly appreciated!

thomashopf avatar Jan 26 '18 16:01 thomashopf

Hi @thomashopf , yes, I will have a look at it as soon as possible.

sophiamersmann avatar Feb 05 '18 17:02 sophiamersmann

Great, thanks!

thomashopf avatar Feb 09 '18 16:02 thomashopf

Thanks @sophiamersmann for implementing all the changes!

Whenever someone has time it would be great to verify on one of the beta-lactamase datasets that the mutation effect calculation with mean-field works as intended and we get a correlation around or above r=0.7 (negative sign -> damaging mutation) - leaving the ticket open until then.

If we added the model files to the testing files and create a test around this, this would serve as an additional comprehensive final check if the CouplingsModel class and the mutate stage are intact (partly solving the catch 22 how to get orthogonal values for testing these calculations).

thomashopf avatar Mar 12 '18 17:03 thomashopf

@sophiamersmann would it be too much asking to integrate the tests in the pipeline? That would really be the cherry on top 🌈 😃

sacdallago avatar Apr 06 '18 14:04 sacdallago

Sure! Do you want me to use any specific dataset?

sophiamersmann avatar Apr 10 '18 09:04 sophiamersmann

@sophiamersmann you can use our test data:

http://marks.hms.harvard.edu/evcouplings_test_cases.tar.gz

This contains alignments and runs for a monomer and complex. let me know if you need to add some additional files and I will add them to the archive.

b-schubert avatar Apr 26 '18 15:04 b-schubert

sorry for not getting back here 😮 I read it and then things piled up

sacdallago avatar Apr 26 '18 17:04 sacdallago

Don't worry Christian. Thanks, Benni. I'll get it done as soon as possible. :)

sophiamersmann avatar May 03 '18 09:05 sophiamersmann

Fixed a sign error in b88245011a714950e7c033a4ec0ebb1f442c71b3, now mutation effects have the correct sign. But somehow they still look odd overall so leaving open until completely verified

thomashopf avatar Apr 21 '20 07:04 thomashopf