Frequently Asked Questions

General Questions

What is AlphaFold 3?

AlphaFold 3 is a deep learning system for predicting the 3D structures of biomolecular complexes, including proteins, nucleic acids, ligands, and post-translational modifications. It represents a significant advancement over AlphaFold 2, extending structure prediction beyond proteins to diverse biomolecular interactions.The model was published in Nature: Accurate structure prediction of biomolecular interactions with AlphaFold 3.

How is AlphaFold 3 different from AlphaFold 2?

Key differences include:

Broader scope: AlphaFold 3 can model proteins, RNA, DNA, ligands, ions, and post-translational modifications
Complex interactions: Predicts interactions between different biomolecule types
Improved accuracy: Enhanced performance on protein-ligand and protein-nucleic acid interactions
Covalent modifications: Supports modeling of covalent bonds and modifications
New architecture: Uses an updated neural network architecture optimized for diverse biomolecules

Can I use AlphaFold 3 for commercial purposes?

No, AlphaFold 3 is licensed for non-commercial use only under the CC-BY-NC-SA 4.0 license.

The AlphaFold 3 source code and model parameters are subject to non-commercial licensing terms. Review the terms of use before use.

For commercial applications, please contact the AlphaFold team at [email protected].

Is there a web server for AlphaFold 3?

Yes, AlphaFold 3 is available at alphafoldserver.com for non-commercial use.However, the server has limitations:

More limited set of ligands
Fewer covalent modifications supported
No ability to customize model parameters

The local installation provides full functionality and flexibility.

Model Access and Installation

How do I get access to the AlphaFold 3 model parameters?

To request access to the model parameters:

Complete the access request form
Wait for approval from Google DeepMind (typically 2-3 business days)
You will receive a download link via email

You may only use AlphaFold 3 model parameters if received directly from Google. Access is granted at Google DeepMind’s sole discretion.

What hardware do I need to run AlphaFold 3?

Minimum requirements:

GPU: NVIDIA GPU with at least 16GB VRAM (e.g., V100, A100, RTX 4090)
RAM: 32GB+ system RAM
Disk: Several TB for databases (500GB+ for minimal setup)
CPU: Multi-core processor for data pipeline

Recommended:

GPU: A100 80GB or H100 for large complexes
RAM: 64GB+ system RAM
Disk: 3-5TB SSD for full databases

The data pipeline (genetic searches) can run on CPU-only machines without a GPU.

Can I run AlphaFold 3 without Docker?

While Docker is the officially supported method, technically you can run AlphaFold 3 without Docker by:

Manually installing all dependencies
Setting up the Python environment
Configuring paths for databases and models

However, this is not recommended because:

Complex dependency management
Potential version conflicts
Lack of official support

Docker ensures a consistent, reproducible environment.

How long does installation take?

Installation time varies:

Docker build: 30-60 minutes (depending on network speed)
Database download: Several hours to days (depending on which databases and network speed)
Model parameters: 15-30 minutes (depending on network speed)

Total setup time: Typically 1-3 days for a complete installation.

Running Predictions

How long does a prediction take?

Prediction time depends on several factors:Data pipeline (genetic searches):

Simple proteins: 1-4 hours
Complex proteins: 4-24 hours
Very large complexes: 24-48+ hours

Inference (GPU):

Small proteins (<200 residues): 5-15 minutes
Medium proteins (200-500 residues): 15-45 minutes
Large complexes (>500 residues): 45 minutes - 2 hours

You can run data pipeline and inference separately using --run_data_pipeline and --run_inference flags.

Can I run multiple predictions in parallel?

Yes, with caveats:

Data pipeline: Can run multiple in parallel if you have enough CPU and RAM
Inference: Limited by GPU memory - typically 1-2 predictions per GPU depending on complex size

For maximum efficiency:

Run data pipeline on CPU-only machines in parallel
Run inference sequentially or on multiple GPUs

What is a model seed and how many should I use?

A model seed controls the random initialization of the neural network. Using multiple seeds generates diverse predictions.Recommendations:

Standard use: 5 seeds ("modelSeeds": [1, 2, 3, 4, 5])
Quick testing: 1 seed
High confidence needed: 10+ seeds

The model ranks all predictions, with the best one typically selected.

Can I stop and resume a prediction?

Data pipeline: Cannot be easily resumed - must restart if interruptedInference: Cannot be resumed - must rerun if interrupted

Save your data pipeline outputs to avoid rerunning expensive genetic searches.

Best practices:

Run data pipeline separately with --run_data_pipeline=true --run_inference=false
Save MSA and template outputs
Run inference later with pre-computed data

Input and Output

What types of molecules can AlphaFold 3 model?

AlphaFold 3 supports:

Proteins: Any amino acid sequence
RNA: Nucleotide sequences
DNA: Nucleotide sequences
Ligands: Small molecules (via SMILES or CCD codes)
Ions: Common ions (via CCD codes)
Post-translational modifications: Glycans, phosphorylation, etc.
Covalent bonds: Disulfide bridges, peptide bonds to ligands, etc.

See the Input Format documentation for details.

How do I interpret the confidence scores?

AlphaFold 3 provides several confidence metrics:pLDDT (per-residue confidence):

90: Very high confidence
70-90: Generally reliable
50-70: Low confidence
<50: Should not be interpreted

Ranking score:

Higher scores indicate better overall model quality
Compares different model seeds
Score of -99 or lower indicates severe problems

Always inspect structures visually and validate predictions experimentally when possible.

What output files does AlphaFold 3 produce?

For each prediction:

CIF files: 3D structure coordinates (mmCIF format)
JSON files: Confidence scores and metadata
Ranking file: Comparison of different model seeds
Data pipeline outputs: MSAs, templates (if run)

See the Output Format documentation for details.

Can I use AlphaFold 3 for protein design?

AlphaFold 3 is primarily a structure prediction tool, not a design tool. However:Possible uses:

Validate designed sequences by predicting their structures
Screen multiple design candidates
Predict effects of mutations

Not designed for:

Inverse folding (sequence design from structure)
Optimizing sequences for specific functions

Consider tools specifically designed for protein design (e.g., ProteinMPNN, RFdiffusion).

Ligands and Small Molecules

How do I add a ligand to my prediction?

Ligands can be specified in two ways:1. Using SMILES:

{
  "ligand": {
    "id": ["L"],
    "smiles": "CC(C)CC1=CC=C(C=C1)C(C)C(=O)O"
  }
}

2. Using CCD codes:

{
  "ligand": {
    "id": ["L"],
    "ccdCodes": ["ATP"]
  }
}

See the Input Format documentation for complete examples.

What if my ligand isn't in the CCD?

For non-standard ligands:

Use SMILES notation: Define the molecule with a SMILES string
Include stereochemistry: Ensure stereochemistry is explicit in SMILES
Specify covalent bonds: Use bondedAtomPairs if needed

See the Community Tools page for tools like JAAG that help generate ligand inputs.

How do I model glycans?

Glycan modeling requires:

Correct CCD codes: Use SNFG nomenclature
Bonded atom pairs: Specify glycosidic bonds
Proper JSON structure: Follow the glycan format

See the paper Modeling glycans with AlphaFold 3 for:

Step-by-step tutorials
Ready-to-run scripts
Comprehensive CCD tables
Discussion of limitations

Tools like JAAG automate glycan input generation.

Troubleshooting

Why is my prediction quality poor?

Common causes:

Input errors: Check sequences and ligand definitions
Hardware issues: See Known Issues for GPU-specific problems
Inherent uncertainty: Some targets are genuinely difficult to predict
Insufficient data: Limited homologs in genetic databases
Complex interactions: Novel or unusual binding modes

See the Troubleshooting page for detailed solutions.

Where can I get help?

Support resources:

Documentation: Check the official documentation
Known Issues: Review the Known Issues page
GitHub: Search the issues tracker
Community: See Community Tools for community resources
Email: Contact [email protected]

When reporting issues, include:

Hardware configuration
Error messages
Input JSON
Steps to reproduce

Citation and Publication

How do I cite AlphaFold 3?

See the Citation page for the complete citation information.

Any publication using AlphaFold 3 source code, model parameters, or outputs must cite the AlphaFold 3 paper.

Can I share AlphaFold 3 predictions publicly?

Getting Started

Core Concepts

User Guides

Advanced Usage

Resources

Frequently Asked Questions

General Questions

Model Access and Installation

Running Predictions

Input and Output

Ligands and Small Molecules

Troubleshooting

Citation and Publication

Build docs developers (and LLMs) love

Getting Started

Core Concepts

User Guides

Advanced Usage

Resources

​General Questions

​Model Access and Installation

​Running Predictions

​Input and Output

​Ligands and Small Molecules

​Troubleshooting

​Citation and Publication

Build docs developers (and LLMs) love

General Questions

Model Access and Installation

Running Predictions

Input and Output

Ligands and Small Molecules

Troubleshooting

Citation and Publication