|国家预印本平台
首页|NovoBoard: a comprehensive framework for evaluating the false discovery rate and accuracy of de novo peptide sequencing

NovoBoard: a comprehensive framework for evaluating the false discovery rate and accuracy of de novo peptide sequencing

NovoBoard: a comprehensive framework for evaluating the false discovery rate and accuracy of de novo peptide sequencing

来源:bioRxiv_logobioRxiv
英文摘要

Abstract De novo peptide sequencing is a fundamental research area in mass spectrometry (MS) based proteomics. However, those methods have often been evaluated using a couple of simple metrics that do not fully reflect their overall performance. Moreover, there has not been an established method to estimate the false discovery rate (FDR) and the significance of de novo peptide-spectrum matches (PSMs). Here we propose NovoBoard, a comprehensive framework to evaluate the performance of de novo peptide sequencing methods. The framework consists of diverse benchmark datasets (including tryptic, nontryptic, immunopeptidomics, and different species), and a standard set of accuracy metrics to evaluate the fragment ions, amino acids, and peptides of the de novo results. More importantly, a new approach is designed to evaluate de novo peptide sequencing methods on target-decoy spectra and to estimate their FDRs. Our results thoroughly reveal the strengths and weaknesses of different de novo peptide sequencing methods, and how their performances depend on specific applications and the types of data. Our FDR estimation also shows that some tools may perform better than the others in distinguishing between de novo PSMs and random matches, and can be used to assess the significance of de novo PSMs.

Li Ming、Tran Ngoc Hieu、Mao Zeping、Zhang Qing、Shan Baozhen、Xin Lei、Qiao Rui、Li Wenting、Pan Shengying

David R. Cheriton School of Computer Science, University of WaterlooBioinformatics Solutions Inc.Bioinformatics Solutions Inc.||David R. Cheriton School of Computer Science, University of WaterlooBioinformatics Solutions Inc.Bioinformatics Solutions Inc.Bioinformatics Solutions Inc.Bioinformatics Solutions Inc.Bioinformatics Solutions Inc.Bioinformatics Solutions Inc.

10.1101/2024.04.16.589668

生物科学研究方法、生物科学研究技术生物化学分子生物学

Li Ming,Tran Ngoc Hieu,Mao Zeping,Zhang Qing,Shan Baozhen,Xin Lei,Qiao Rui,Li Wenting,Pan Shengying.NovoBoard: a comprehensive framework for evaluating the false discovery rate and accuracy of de novo peptide sequencing[EB/OL].(2025-03-28)[2025-04-26].https://www.biorxiv.org/content/10.1101/2024.04.16.589668.点此复制

评论