How to deal with FAMIS data?
Hello everyone ,
I am confused that how to edit the experimental design for FAMIS data?
Now I have raw data with 2 compensation voltage(-45 and -70 v). My experimental design shows below
But the output.csv is not what I expect. For example,It shows the intensity of sample001, -45 and -70v ,respectively. What I expected is every sample gives me 1 intensity, not distinguished by voltage.
So is there anything wrong with my experimental design and how can I fix it? Thanks.
Hi!
That is a good question. Your approach actually could work. But you should probably declare the different voltages as different fractions.
1 1 s1v1.mzML 1 1 2 s1v2.mzML 1 2 1 s2v1.mzML 1 2 2 s2v2.mzML 1 ...
(if you post your design as a Markdown table, I can edit it for you)
Thank you for your reply. I used your approach shows below

my command shows here
/home/host/software/nextflow run nf-core/proteomicslfq
--input '/path/to/rawdata/.raw'
--database '/path/to/fasta/.fasta'
-profile docker
--max_memory 600.GB
--max_cpus 120
--expdesign 'expdesign.tsv'
--add_decoys
But it also reminds me an error

my command err shows here
Do you know how to fix the error?
Thank you so much.
Hello,
Do you really want to compare across FAIMS voltages?
If not, try to make the conditions 001,008,010. And in your case, Bio replicate should always be 1. (Unless the voltages used different cell material of course).
It works finally. Thanks! But you misunderstood me. I do not want to compare intensity between voltages. Actually,I want it gives me one intensity per sample. Can you help me?
Great! Actually, I assumed that already.
If not, try to ..
So, have a look. You should have already intensities per sample.
Here is my out.csv. It also gives me the intensities separated by different voltage per sample. And I find that there are duplicated protein names between one sample with different voltages, for example, 4000 proteins are duplicated in sample1-45v and sample 1-70v. And their intensities are different in 2 references. So which reference should I accept for the intensity of duplicated protein?thank s.

Hmm yes, this does not look correct. Can you show your current experimental design?
Here!
Can you try with consecutive numbers starting from 1 in each column? That means fractions 1,2,1,2,.. And Conditions 1,1,2,2,..
Still the same results.

I disagree. Now it seems like you have the correct annotation of conditions. What were you expecting?
Do you run the MSstats step? If so, have a look in the msstats/msstats_output.csv. There the intensities should be aggregated further already.
Regarding your "duplication problem": Duplicated proteins are not the problem. This will be common. The only thing that needs to be resolved are peptides with the same sequence, modification and same charge state that might appear at different voltages/fractions. A common thing to do for fractions is to take the highest intensity feature.
But note that I would let MSstats do the work and just look at the MSstats results as mentioned in my previous comment.