About AbNovoBench

Advancing antibody research through standardized benchmarking and collaborative innovation

Our Mission

To establish a comprehensive, standardized, and reproducible benchmarking system dedicated to evaluating monoclonal antibody de novo sequencing.

Our Vision

To empower researchers with high-quality datasets, consistent evaluation tools, and pre-trained models—accelerating antibody therapeutic development without the need for retraining from scratch.

Our Impact

By offering the largest publicly available antibody sequencing dataset and ready-to-use evaluation pipeline, AbNovoBench helps researchers identify the most suitable models for their needs and drive innovation in antibody informatics.

Project Overview

About AbNovoBench

Empowering antibody research through unified training, large-scale curated datasets, and comprehensive, reproducible benchmarking.

Background

Monoclonal antibodies (mAbs) have become indispensable tools in modern research and biomedicine, yet accurate and complete sequence information identification remains challenging. Traditional mRNA-based methods rely on the availability of hybridoma clones and fail to detect post-translational modifications critical for antibody function. Mass spectrometry (MS)-based de novo sequencing provides a powerful alternative, enabling direct readout of secreted antibody sequences. However, benchmarking de novo tools in this domain has been limited by a lack of standardized datasets and unified evaluation protocols.

The Challenge

Most existing deep learning models for peptide sequencing were not specifically designed for monoclonal antibody applications. The field has faced three major obstacles: (1) inconsistent training datasets across studies that hinder fair comparisons; (2) insufficient availability of curated antibody-specific test data; and (3) the absence of a standardized, reproducible, and publicly accessible benchmarking framework tailored to antibodies.

Our Solution

AbNovoBench addresses these challenges by offering the largest high-quality benchmarking dataset for monoclonal antibody sequencing to date, comprising over 1.6 million peptide-spectrum matches (PSMs) from 131 antibodies across six species and 11 proteases. In addition, it provides eight fully annotated monoclonal antibody datasets with known full-length amino acid sequences, specifically designed for evaluating downstream assembly performance. The platform integrates a unified training set, standardized evaluation metrics, and automated scoring systems to support comprehensive, reproducible, and fair comparisons across different de novo peptide sequencing algorithms and assembly strategies.

Mass Spectrometry

Machine Learning

Proteomics

Bioinformatics

Key Statistics

1.6M+

Peptide-Spectrum Matches

131

Monoclonal Antibodies

Species Covered

Protease Types

Known Monoclonal Antibodies

0.33TB

raw data

Research Team

Meet the researchers and developers behind AbNovoBench

👨‍🔬

Ningshao Xia

Academician of Chinese Academy of Engineering

Leading expert in vaccine development and infectious diseases, with extensive research in viral hepatitis, AIDS, influenza, and diagnostic development.

👨‍💼

Quan Yuan

Changjiang Young Scholar

Recipient of the Changjiang Young Scholars Program by the Ministry of Education, specializing in hepatitis research and vaccine development.

👨‍💻

Rongshan Yu

IET Fellow; IEEE Senior Member

Distinguished researcher in computational biology and bioinformatics, with expertise in machine learning applications for biological data analysis.

Collaborators & Partners

Institutions and organizations supporting AbNovoBench

🏛️

National Institute for Data Science in Health and Medicine, Xiamen University

Research Institute

🔬

State Key Laboratory of Vaccines for Infectious Diseases, Xiamen University

Laboratory

🔬

Xiang An Biomedicine Laboratory, Xiamen University

Laboratory

🏫

School of Public Health, Xiamen University

Academic

🏫

School of Life Sciences, Xiamen University

Academic

🏫

School of Informatics, Xiamen University

Academic

🏥

The First Affiliated Hospital of Xiamen University, School of Medicine, Xiamen University

Medical

🧬

Aginome Scientific, Xiamen

Industry

Get in Touch

Connect with our team for collaborations and inquiries

Distributed Research Team

luolingplay@gmail.com wenbin_jiang1@163.com

abnovobench.com

Contribute

Join our open-source community and help improve AbNovoBench

GitHub Repository

• Contribute datasets and models

• Report bugs and feature requests

• Improve documentation

• Share research findings

View on GitHub