Compilation Stages

Overview

The MCC compiler pipeline can be stopped at any intermediate stage using stage-specific flags. This is useful for:

Debugging compiler issues
Inspecting intermediate representations
Testing individual pipeline stages
Understanding the compilation process

Stage Flags

All stage flags are mutually exclusive - you can only specify one at a time.

—lex

mcc --lex input.c

Stop after lexing (tokenization). The compiler will:

Preprocess the input file
Tokenize the source into a stream of tokens
Stop and exit

Use cases:

Debugging lexer issues
Verifying token recognition
Testing preprocessor integration

Example:

$ mcc --lex hello.c
# Exits after tokenization
# No output files produced

—parse

mcc --parse input.c

Stop after parsing. The compiler will:

Preprocess and tokenize
Parse tokens into an Abstract Syntax Tree (AST)
Stop and exit

Use cases:

Debugging parser issues
Verifying syntax correctness
Inspecting AST structure (with tracing enabled)

Example:

$ mcc --parse hello.c
# Exits after parsing
# AST is built but not lowered

With tracing:

$ RUST_LOG=mcc_syntax=trace mcc --parse hello.c
# Shows detailed AST construction

—tacky

mcc --tacky input.c

Stop after lowering to TACKY (Three Address Code). The compiler will:

Preprocess, tokenize, and parse
Perform semantic analysis and type checking
Lower the AST to Three Address Code intermediate representation
Stop and exit

Use cases:

Debugging semantic analysis
Inspecting the intermediate representation
Verifying lowering correctness
Testing optimization passes (if implemented)

Example:

$ mcc --tacky factorial.c
# Exits after TACKY generation
# IR is generated but not converted to assembly

What is TACKY? TACKY (Three Address Code) is an intermediate representation where each instruction has at most three operands. It’s easier to optimize and analyze than the AST. Example transformation:

// Source C code
int x = a + b * c;

// TACKY IR (conceptual)
t1 = b * c
t2 = a + t1
x = t2

—codegen

mcc --codegen input.c

Stop after code generation. The compiler will:

Complete all stages up through TACKY
Generate assembly instructions from TACKY IR
Stop before rendering assembly text
Exit

Use cases:

Debugging code generation
Inspecting assembly instruction structure
Testing register allocation
Verifying instruction selection

Example:

$ mcc --codegen fibonacci.c
# Exits after assembly generation
# Assembly instructions created but not rendered to text

Note: This stops after generating the internal assembly representation but before converting it to textual assembly code. To see the assembly text, use the -S flag instead (see Assembly Output).

Assembly Output

-S

mcc -S input.c

Generate assembly file and keep it after compilation. Unlike the stage flags, -S runs the full pipeline but preserves the assembly file. Behavior:

Runs complete compilation pipeline
Generates assembly text file (.s extension)
Keeps the assembly file instead of deleting it
Continues to assemble and link to produce executable

Output: Creates input.s alongside the executable Example:

$ mcc -S hello.c
$ ls
hello  hello.c  hello.s

$ cat hello.s
	.text
	.globl main
main:
	pushq	%rbp
	movq	%rsp, %rbp
	movl	$42, %eax
	popq	%rbp
	ret

With custom output:

mcc -S -o myprogram hello.c
# Creates: myprogram (executable) and myprogram.s (assembly)

Combining Stage Flags with Other Options

Stage flags can be combined with other options:

With Output Specification

mcc --parse -o parsed_output main.c

The output flag is accepted but may not produce a file since parsing doesn’t generate output artifacts.

With Color Control

mcc --codegen --color always program.c

Useful for capturing colorized diagnostics from intermediate stages.

With Target Specification

mcc --codegen --target x86_64-unknown-linux-gnu main.c

The target affects code generation, so specifying it with --codegen is meaningful.

Debugging with Stages

Narrow Down Compilation Errors

If compilation fails, use stage flags to identify which stage is problematic:

# Check if parsing succeeds
mcc --parse problem.c

# Check if lowering succeeds
mcc --tacky problem.c

# Check if codegen succeeds
mcc --codegen problem.c

Inspect Intermediate Output

Combine stage flags with tracing to see internal state:

# See detailed parsing output
RUST_LOG=mcc_syntax=debug mcc --parse input.c

# See lowering details
RUST_LOG=mcc=debug mcc --tacky input.c

# See codegen details
RUST_LOG=mcc=debug mcc --codegen input.c

Compare Assembly Output

# Generate assembly with MCC
mcc -S myprogram.c

# Compare with GCC output
gcc -S -o myprogram.gcc.s myprogram.c

# Diff the assembly
diff myprogram.s myprogram.gcc.s

Stage Progression

The complete stage pipeline:

Input (.c file)
    ↓
[Preprocessing]  → Preprocessed source
    ↓
[Lexing]         → Token stream        (--lex stops here)
    ↓
[Parsing]        → Abstract Syntax Tree (--parse stops here)
    ↓
[Type Checking]  → Validated AST
    ↓
[Lowering]       → TACKY IR            (--tacky stops here)
    ↓
[Code Gen]       → Assembly IR         (--codegen stops here)
    ↓
[Rendering]      → Assembly text (.s)  (-S saves this)
    ↓
[Assembling]     → Object file (.o)
    ↓
[Linking]        → Executable

Exit Behavior

When using stage flags:

Success: Exit code 0 if the stage completes without errors
Failure: Non-zero exit code with diagnostics if the stage fails
No output files: Stage flags typically don’t produce output files (except -S)

Basic Compilation - Standard compilation workflow
Command Options - Complete flag reference

Get Started

Core Concepts

Command Reference

Compilation Stages

API Reference

Testing

Development

Compilation Stages

Overview

Stage Flags

—lex

—parse

—tacky

—codegen

Assembly Output

-S

Combining Stage Flags with Other Options

With Output Specification

With Color Control

With Target Specification

Debugging with Stages

Narrow Down Compilation Errors

Inspect Intermediate Output

Compare Assembly Output

Stage Progression

Exit Behavior

Build docs developers (and LLMs) love

Get Started

Core Concepts

Command Reference

Compilation Stages

API Reference

Testing

Development

​Overview

​Stage Flags

​—lex

​—parse

​—tacky

​—codegen

​Assembly Output

​-S

​Combining Stage Flags with Other Options

​With Output Specification

​With Color Control

​With Target Specification

​Debugging with Stages

​Narrow Down Compilation Errors

​Inspect Intermediate Output

​Compare Assembly Output

​Stage Progression

​Exit Behavior

​Related Commands

Build docs developers (and LLMs) love

Overview

Stage Flags

—lex

—parse

—tacky

—codegen

Assembly Output

-S

Combining Stage Flags with Other Options

With Output Specification

With Color Control

With Target Specification

Debugging with Stages

Narrow Down Compilation Errors

Inspect Intermediate Output

Compare Assembly Output

Stage Progression

Exit Behavior

Related Commands