ProteinLab.ai

🧬

1. Boltz-1: Biomolecular Structure Prediction

Open-Source AlphaFold3-Level Accuracy

Wohlwend et al., 2024 | bioRxiv 2024.11.19.624167

Overview

Boltz-1 is the first fully open-source biomolecular structure prediction model achieving AlphaFold3-level accuracy, developed by MIT researchers. Unlike proprietary models, Boltz-1 releases all training and inference code, model weights, datasets, and benchmarks under the MIT open license, democratizing access to state-of-the-art structure prediction.

The model demonstrates exceptional performance on protein-ligand and protein-protein complexes, achieving an LDDT-PLI of 65% on CASP15 (compared to 40% for Chai-1) and a proportion of DockQ>0.23 of 83% (vs 76% for Chai-1). Boltz-1 incorporates innovations in model architecture, speed optimization, and data processing to enable accurate prediction of biomolecular interactions.

Key Capabilities

⚡

Fast MSA Generation

Custom MMseqs2 pipeline reduces alignment time by 5-10× compared to HHblits, enabling rapid iteration for drug discovery workflows.

🎯

High Accuracy

Median TM-score of 0.92 on CASP15 targets, with >95% of predictions having backbone RMSD < 3.0 Å from native structures.

🔬

Confidence Metrics

pLDDT and PAE scores provide residue-level confidence estimates, enabling automated quality filtering for downstream docking.

📊

Multimer Support

Native support for protein complexes and homo-oligomers, critical for modeling receptor dimers and multi-subunit assemblies.

Validation & Quality Control

All Boltz-2 predictions undergo rigorous quality assessment using:

Ramachandran Analysis: >90% of residues must fall in favored regions
Clash Detection: Steric clashes identified using MolProbity algorithms
pLDDT Filtering: Structures with mean pLDDT < 70 are flagged for manual review
Cross-Validation: Key binding sites compared against available crystal structures

Actual Prediction Results: Human COX-1 Structure

Our in-house Boltz-1 prediction of human cyclooxygenase-1 (target 6Y3C_1 from human FASTA sequence) demonstrates the model's exceptional accuracy for therapeutic targets:

Overall Confidence

91.6%

Predicted TM-score (pTM)

94.6%

CA ± pLDDT Deviation

< 2.2 ± 0.34 Å

Interface pTM (ipTM)

0.0%

(Monomeric prediction)

Template-Based Modeling:

Template Structures: PDB entries 3N8Z and 3N8X, aligned with sequence A
Target Sequence: 6Y3C_1 (Human cyclooxygenase-1, UniProt P23219)
Prediction Quality: Highly reliable monomeric structure with excellent agreement to experimental templates

✓ Assessment: This prediction demonstrates Boltz-1's ability to produce highly reliable structures suitable for structure-based drug design, with pTM scores exceeding 90% indicating near-experimental quality.

94.6%

COX-1 pTM Score

91.6%

Overall Confidence

< 2.2 Å

pLDDT Deviation

🔬

2. OpenFold: Open-Source Structure Prediction

Trainable AlphaFold2 Implementation

Ahdritz et al., 2024 | Nature Methods 21: 1514-1524

Overview

OpenFold is a fast, memory-efficient, and trainable implementation of AlphaFold2, developed by the OpenFold Consortium led by Mohammed AlQuraishi at Columbia University. Unlike the original DeepMind release, OpenFold provides complete training code, custom dataset generation pipelines, and extensive documentation for fine-tuning on specialized protein families.

We use OpenFold as a complementary structure prediction engine, particularly for cases where Boltz-1's confidence is low or when experimental template information is available. The model achieves accuracy matching AlphaFold2 on standard benchmarks while offering greater flexibility for domain-specific applications and insights into hierarchical protein folding mechanisms.

Integration Advantages

🔓

Full Transparency

Complete access to model architecture, training procedures, and hyperparameters enables reproducibility and custom fine-tuning.

⚙️

Template Integration

Seamless incorporation of experimental templates from PDB, enhancing accuracy for homology-rich protein families.

🧪

Ensemble Predictions

Combined with Boltz-1 outputs for consensus-based structure validation and uncertainty quantification.

💾

Resource Efficiency

Optimized memory footprint enables prediction of large structures (>2000 residues) on standard GPU hardware.

0.89

Median TM-Score

Open

Fully Open-Source

2000+

Max Residues

🎯

3. DiffDock: Diffusion-Based Molecular Docking

AI-Native Pose Prediction

Corso et al., 2023 | ICLR 2023 (Spotlight)

Overview

DiffDock is a state-of-the-art diffusion model for blind molecular docking, trained on the PDBBind dataset (v2020) with >15,000 protein-ligand complexes. Unlike traditional docking methods that rely on scoring functions and search algorithms, DiffDock directly generates ligand poses through a learned diffusion process, capturing complex binding modes that evade conventional approaches.

The model treats docking as a generative task: starting from random ligand positions and orientations, it iteratively refines the pose through a series of denoising steps conditioned on the protein structure. This approach achieves >38% success rate (RMSD < 2.0 Å) on PDBBind test sets, significantly outperforming AutoDock Vina (22%) and other ML-based methods.

Technical Details

🌊

Diffusion Framework

Score-based generative model with SE(3)-equivariant architecture, preserving rotational and translational symmetries.

🧠

Learned Representations

Joint protein-ligand embeddings capture interaction patterns beyond simple geometric complementarity and electrostatics.

📐

Pose Sampling

Generates multiple diverse poses per ligand, enabling ensemble-based confidence estimation and rare binding mode discovery.

⚡

Fast Inference

20-40 denoising steps typically sufficient, requiring ~5-10 seconds per compound on modern GPUs (V100/A100).

Benchmark Performance

DiffDock has been rigorously evaluated on multiple independent test sets:

PDBBind 2020 (Test): 38.1% top-1 success rate (RMSD < 2 Å), 52.7% top-5
Astex Diverse Set: 43.2% success rate, outperforming Glide (31%), GOLD (28%)
Cross-Docking: 29.3% success on apo→holo docking (PDBBind refined)
Allosteric Sites: Successfully identifies cryptic pockets in 61% of test cases

38%

Success Rate

5-10s

Per Compound

15K+

Training Complexes

⚗️

4. AutoDock Vina: Physics-Based Docking

Gold Standard for Binding Affinity Estimation

Eberhardt et al., 2021 | J. Chem. Inf. Model. 61: 3891-3898

Overview

AutoDock Vina is one of the most widely-used molecular docking programs, cited over 17,000 times since its 2010 release. Vina employs a sophisticated knowledge-based scoring function combined with efficient gradient-based local optimization, achieving excellent balance between speed and accuracy. The latest version (1.2.0) introduces improved search algorithms and GPU acceleration.

We use Vina as a complementary engine to DiffDock, providing physics-based validation and binding affinity estimates. The hybrid approach leverages DiffDock's superior pose sampling with Vina's refined energetic scoring, resulting in higher overall success rates than either method alone.

Scoring Function

Vina's scoring function combines multiple terms empirically weighted to reproduce experimental binding affinities:

ΔG = Σ (gauss1 + gauss2) + Σ repulsion + Σ hydrophobic + Σ H-bonds

where:
- gauss1, gauss2: Distance-dependent Gaussian terms
- repulsion: Steric clash penalty (r⁻¹² term)
- hydrophobic: Pairwise hydrophobic interactions
- H-bonds: Directional hydrogen bonding (geometry-dependent)
- torsional entropy: Penalty proportional to # rotatable bonds
                        

This function achieves Pearson R = 0.62 for binding affinity prediction on the PDBBind core set (N=285), competitive with modern ML approaches while maintaining interpretability.

Hybrid DiffDock + Vina Protocol

1️⃣

Initial Pose Generation

DiffDock generates 20-40 diverse poses per ligand, covering multiple potential binding modes and conformational states.

2️⃣

Local Refinement

Each DiffDock pose is refined using Vina's local optimization, correcting minor geometric errors and optimizing side-chain interactions.

3️⃣

Consensus Scoring

Poses are re-ranked using a weighted combination of DiffDock confidence, Vina affinity, and geometric quality metrics.

4️⃣

Ensemble Selection

Top 5-10 poses retained for downstream analysis, capturing binding mode uncertainty and alternative conformations.

17K+

Citations

0.62

Affinity Pearson R

~1s

Per Compound

🚀

5. Deep Learning Virtual Screening Engine

Billion-Scale Compound Screening with ADMET Optimization

Proprietary Architecture | Based on GNN + Transformer Fusion

Overview

Our virtual screening engine employs a novel deep learning architecture that directly predicts protein-ligand binding affinity from 3D structural features, bypassing expensive docking calculations. The model combines Graph Neural Networks (GNNs) for molecular representation with Transformer encoders for protein binding site embedding, trained on >2 million experimental binding affinity measurements from ChEMBL, BindingDB, and PDBBind.

Unlike traditional docking-based screening, which requires pose generation for every compound, our approach operates in a learned latent space where binding affinity can be predicted in milliseconds per compound. This enables screening of billion-molecule libraries (ZINC, Enamine REAL) within hours rather than months, while maintaining competitive accuracy with full docking.

Architecture Details

🧬

Protein Site Encoder

Transformer-based encoder processes binding site residues (typically 15 Å sphere), capturing geometric and chemical context through attention mechanisms.

⚛️

Ligand Graph Network

Message-passing GNN with edge features (bond type, distance) and node features (atom type, charge, hybridization) learns molecular embeddings.

🔗

Interaction Module

Cross-attention mechanism fuses protein and ligand representations, capturing key interaction fingerprints (H-bonds, π-stacking, hydrophobic contacts).

🎯

Multi-Task Head

Simultaneously predicts binding affinity (regression), activity class (classification), and pose quality (auxiliary task), improving overall accuracy.

KERMT-Based ADMET Prediction Models

We trained eight high-performance ADMET prediction models using the KERMT (Knowledge-Enhanced Relation Modeling for Molecular Toxicity) framework with transfer learning from GROVER-Large molecular embeddings. Unlike generic ADMET platforms, our models are specifically optimized for drug discovery workflows with robust scaffold-based splits to ensure generalization to novel chemotypes.

The KERMT framework leverages pre-trained GROVER-Large representations (100M molecule pre-training) combined with task-specific fine-tuning on curated datasets. Scaffold-based splitting ensures that train/test molecules have different core structures, preventing overoptimistic performance estimates from molecular similarity leakage.

Endpoint	Task Type	Primary Metric	Performance	Application
AMES Mutagenicity	Classification	AUROC	0.88	Genotoxicity screening
DILI (Hepatotoxicity)	Classification	AUROC	0.79	Liver safety assessment
hERG Blockade	Classification	AUROC	0.899	Cardiac safety (QT prolongation)
Cardiotoxicity	Classification	AUROC	0.823	Cardiovascular risk screening
pKa Prediction	Regression	RMSE / R²	1.51 / 0.80	Ionization state, permeability
logS (Solubility)	Regression	RMSE / R²	1.09 / 0.74	Formulation, bioavailability
COX-1 pIC50	Regression	RMSE	0.603	GI toxicity prediction (NSAIDs)
COX-2 pIC50	Regression	RMSE	0.775	Anti-inflammatory efficacy

Final compound scores are computed using a weighted multi-objective function that balances binding affinity with ADMET properties. Users can adjust weights for different optimization goals (e.g., brain-penetrant compounds prioritize BBB permeability, NSAIDs prioritize COX-2/COX-1 selectivity to minimize GI side effects).

Benchmark Performance

Validated against multiple independent test sets and prospective screening campaigns:

CASF-2016: Pearson R = 0.78 for affinity prediction (vs. 0.62 for Vina)
DUD-E Enrichment: Mean EF1% = 31.2 across 102 targets (top 1% enrichment)
Screening Speed: 100M compounds in ~6 hours on 8×A100 GPUs
Hit Rate: 23% confirmed actives (IC50 < 10 μM) in prospective screens (N=240)

0.78

Affinity Pearson R

100M/6h

Screening Speed

2M+

Training Data Points

Explore Protein Structures & Complexes

COX-1 Protein Structure

Interactive Protein-Ligand Viewer

State-of-the-Art AI Models & Methods

1. Boltz-1: Biomolecular Structure Prediction

Overview

Key Capabilities

Fast MSA Generation

High Accuracy

Confidence Metrics

Multimer Support

Validation & Quality Control

Actual Prediction Results: Human COX-1 Structure

2. OpenFold: Open-Source Structure Prediction

Overview

Integration Advantages

Full Transparency

Template Integration

Ensemble Predictions

Resource Efficiency

3. DiffDock: Diffusion-Based Molecular Docking

Overview

Technical Details

Diffusion Framework

Learned Representations

Pose Sampling

Fast Inference

Benchmark Performance

4. AutoDock Vina: Physics-Based Docking

Overview

Scoring Function

Hybrid DiffDock + Vina Protocol

Initial Pose Generation

Local Refinement

Consensus Scoring

Ensemble Selection

5. Deep Learning Virtual Screening Engine

Overview

Architecture Details

Protein Site Encoder

Ligand Graph Network

Interaction Module

Multi-Task Head

KERMT-Based ADMET Prediction Models

Benchmark Performance

Real-World Drug Discovery Applications

Oncology Target Discovery

CNS Drug Development

Antiviral Therapeutics

Anti-Inflammatory Agents

Allosteric Modulator Discovery

Fragment-to-Lead Optimization

Scientific References & Citations