Skip to content

cemilcansaylan/oktoberfest

 
 

Repository files navigation

Oktoberfest: Rescoring, Collision Energy Calibration and Spectral Library Generation for Proteomics

PyPI Python Version License Read the documentation at https://oktoberfest.readthedocs.io/ CI Status Codecov pre-commit Ruff

Oktoberfest is a python tool for collision energy calibration, rescoring search results and generating spectral libraries for proteomics research within the Prosit ecosystem. It offers an end to end pipeline that takes search results, predicts peptide properties using koina, plots summaries and quality control figures and performs FDR estimation with either mokapot or percolator.

How to cite

Please always cite the main publication

[1] Picciani M, Gabriel W, Giurcoiu VG et al. (2023), Oktoberfest: Open-source spectral library generation and rescoring pipeline based on Prosit, Proteomics.

Please cite the Prosit model(s) you are using

When using Prosit

[2] Gessulat S, Schmidt T, Zolg DP et al. (2019), PROSIT: Proteome-wide prediction of peptide tandem mass spectra by deep learning, Nature Methods.

[3] Wilhelm M, Zolg DP, Graber M et al. (2021), Deep learning boosts sensitivity of mass spectrometry-based immunopeptidomics, Nature Communications.

When using Prosit-TMT

[4] Gabriel W, The M, Zolg D et al. (2022), Prosit-TMT: Deep Learning Boosts Identification of TMT-Labeled Peptides, Analytical Chemistry.

When using Prosit-timsTOF

[5] Adams C, Gabriel W, Laukens K et al. (2024), Fragment ion intensity prediction improves the identification rate of non-tryptic peptides in timsTOF, Nature Communications.

When using Prosit-XL

[6] Kalhor M, Saylan C, Picciani M et al. (2025), Prosit-XL: enhanced cross-linked peptide identification by fragment intensity prediction to study protein interactions and structures, Nature Communications.

When using Prosit-MultiFrag

[7] Levin N, Saylan CC, Lapin J et al. (2026), Integration of alternative fragmentation techniques into standard LC-MS workflows using a single deep learning model enhances proteome coverage, Nature Methods.

Please cite in case you use quantification

[8] The M, Samaras P, Kuster B, Wilhelm, M. (2022), Reanalysis of ProteomicsDB using an accurate, sensitive, and scalable false discovery rate estimation approach for protein groups, Molecular & Cellular Proteomics.

About

Rescoring and spectral library generation pipeline for proteomics.

Resources

License

Contributing

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors

Languages

  • Python 66.5%
  • Jupyter Notebook 31.7%
  • Other 1.8%