Entity Types
AlphaFold 3 supports four primary entity types in thesequences array:
Protein
Amino acid sequences with modifications and templates
RNA
RNA sequences with modifications and MSA
DNA
DNA sequences with modifications
Ligand
Small molecules via CCD codes or SMILES
Protein Entities
Proteins are the most feature-rich entity type, supporting MSA, templates, and modifications.Basic Structure
Field Specifications
Uppercase letter(s) for unique chain ID. Use array for homomers:
["A", "B", "C"]Amino acid sequence using 1-letter standard codes
Post-translational modifications. Each has
ptmType (CCD code) and ptmPosition (1-based)Optional textual description (version 4+)
A3M format MSA or empty string. Mutually exclusive with
unpairedMsaPathPath to A3M MSA file (absolute or relative to JSON)
A3M format paired MSA. Recommended to use
unpairedMsa insteadPath to paired MSA file
Structural templates in mmCIF format with alignment mappings
Homodimer Example
With Custom MSA
RNA Entities
RNA sequences support modifications and MSA.Field Specifications
Uppercase letter(s) for chain ID
RNA sequence using only
A, C, G, UEach has
modificationType (CCD code) and basePosition (1-based)A3M format MSA
Path to MSA file
DNA Entities
DNA sequences support modifications but not MSA or templates.Field Specifications
Uppercase letter(s) for chain ID
DNA sequence using only
A, C, G, TEach has
modificationType (CCD code) and basePosition (1-based)Ligand Entities
Ligands can be specified three ways:- CCD Codes
- SMILES
- User CCD
Field Specifications
Uppercase letter(s) for ligand ID
List of CCD codes (standard or custom). Mutually exclusive with
smilesSMILES definition. Mutually exclusive with
ccdCodesSMILES JSON Escaping
Backslashes in SMILES must be escaped. Use
jq or Python to properly escape:Ions as Ligands
Ions are treated as ligands. For example, a magnesium ion:
Structural Templates
Templates are only supported for proteins.Single-chain protein template in mmCIF format. Mutually exclusive with
mmcifPathPath to mmCIF file (can be gzip, xz, or zstd compressed)
0-based indices in query sequence
0-based indices in template sequence (account for unresolved residues)
Covalent Bonds
Define bonds between or within entities usingbondedAtomPairs:
[entityId, residueId, atomName]
- Entity ID: Chain ID from
idfield - Residue ID: 1-based position within chain
- Atom name: From CCD definition
Glycan Example
Complete Example
Here’s a comprehensive input demonstrating multiple entity types:Next Steps
Input Format
Top-level structure overview
Output Format
Understanding prediction outputs