hi Michael
Please forgive me for bringing this up again... In my defense: You said nothing is preventing users from converting .d folders to mzml and analyze diaPASEF data :)
I have tried a few things to see how different softwares currently handly diaPASEF. For this I generated a ~10ng, 5min human HEK active gradient diaPASEF run on a timsTOF Ultra.
file size as .d folder:
1.4GB
I have now tried 3 different ways to generate an mzml file:
- MSConvert, standard settings, so I think it came down to 4.5 million scan events which led to a final size of 25GB.
- MSConvert, Combine ion mobility scans 6.3GB
- timsconvert, standard settings 5.7GB
I searched the data with sage and for comparison I also included a 30min ddaPASEF run, which was directly analyzed from .d



the ddaPASEF seems to work nicely looking at the q value distribution of spectrum, peptide or protein level, but for all other searches the q value distribution looks odd to me.
I would be happy to share the data with you, or do more testing benchmarking. As I already mentioned there is currently no non-proprietary option to analyze diaPASEF data with large search spaces, so I am highly motivated to get this off the ground.
You also said earlier that it is not trivial to collapse / handle the ion mobility in tims data.
Would you be open to have a look at the msconvert ion mobility combine option and at the resulting data structure?
Maybe this can already solve the problem and sage "only" needs to be adapted to be able to handle this specific mzml ?
Oh and btw working with sage made me smile more than once! It is a lot of fun to work with a tool so fast! I still remember the days I had to run semitryptic searches in MaxQuant for a week xD
Best, Klemens
hi Michael
Please forgive me for bringing this up again... In my defense: You said nothing is preventing users from converting .d folders to mzml and analyze diaPASEF data :)
I have tried a few things to see how different softwares currently handly diaPASEF. For this I generated a ~10ng, 5min human HEK active gradient diaPASEF run on a timsTOF Ultra.
file size as .d folder:
1.4GB
I have now tried 3 different ways to generate an mzml file:
I searched the data with sage and for comparison I also included a 30min ddaPASEF run, which was directly analyzed from .d
the ddaPASEF seems to work nicely looking at the q value distribution of spectrum, peptide or protein level, but for all other searches the q value distribution looks odd to me.
I would be happy to share the data with you, or do more testing benchmarking. As I already mentioned there is currently no non-proprietary option to analyze diaPASEF data with large search spaces, so I am highly motivated to get this off the ground.
You also said earlier that it is not trivial to collapse / handle the ion mobility in tims data.
Would you be open to have a look at the msconvert ion mobility combine option and at the resulting data structure?
Maybe this can already solve the problem and sage "only" needs to be adapted to be able to handle this specific mzml ?
Oh and btw working with sage made me smile more than once! It is a lot of fun to work with a tool so fast! I still remember the days I had to run semitryptic searches in MaxQuant for a week xD
Best, Klemens