Optimization Configuration

Overview

Optimization configuration controls Heretic’s automatic parameter search process. Heretic uses Optuna with a TPE (Tree-structured Parzen Estimator) sampler to co-minimize refusal count and KL divergence from the original model.

Heretic’s optimization is fully automatic. You typically don’t need to change these settings, but they’re available for fine-tuning performance or experimenting with the abliteration process.

Optimization Trials

Heretic runs multiple abliteration trials to find optimal parameters. Each trial tests a different combination of ablation weights and directions.

n_trials

The total number of optimization trials to run.

# Default: 200 trials
n_trials = 200

# Quick test (may not find optimal parameters)
n_trials = 50

# Extensive search (slower but may find better results)
n_trials = 400

Typical values:

50-100: Quick experiments, testing configuration changes
200 (default): Good balance of quality and speed
300-400: Extensive search for challenging models

More trials take longer but may find better abliteration parameters. The default of 200 trials works well for most models.

n_startup_trials

The number of initial trials that use random sampling for exploration, before the TPE optimizer takes over.

# Default: 60 startup trials
n_startup_trials = 60

# Less exploration
n_startup_trials = 30

# More exploration
n_startup_trials = 100

Relationship to n_trials:

Startup trials should be approximately 25-30% of total trials
After startup trials complete, TPE uses the results to guide further exploration
Default: 60 startup trials out of 200 total (30%)

The startup phase explores the parameter space randomly. After that, Optuna’s TPE sampler uses Bayesian optimization to intelligently search for better parameters.

Batch Processing

Batch size controls how many input sequences are processed in parallel. Larger batches are faster but require more VRAM.

batch_size

Number of input sequences to process simultaneously.

# Automatic batch size detection (recommended)
batch_size = 0

# Manual batch size
batch_size = 32

Values:

0 (default): Automatic detection - Heretic benchmarks your system at startup to find the optimal batch size
1-128: Manual batch size - Use when you know your hardware’s limits

Leave this at 0 unless you’re experiencing memory issues. Heretic’s automatic detection usually finds the optimal value.

max_batch_size

The maximum batch size to try when automatically determining the optimal batch size.

# Default maximum
max_batch_size = 128

# Conservative limit (less VRAM available)
max_batch_size = 64

# Aggressive limit (large VRAM, small model)
max_batch_size = 256

When to adjust:

Decrease if you have limited VRAM or run into OOM errors during benchmarking
Increase if you have high VRAM and want to process larger batches for speed

Setting max_batch_size too high can cause out-of-memory errors during the initial benchmark phase.

KL Divergence Parameters

KL divergence measures how much the abliterated model differs from the original model. Heretic balances minimizing refusals with minimizing KL divergence to preserve the model’s capabilities.

kl_divergence_scale

Assumed “typical” value of KL divergence for abliterated models. Used to ensure balanced co-optimization of KL divergence and refusal count.

# Default scale
kl_divergence_scale = 1.0

# More weight on preserving model quality
kl_divergence_scale = 0.5

# More weight on reducing refusals
kl_divergence_scale = 2.0

How it works:

This value normalizes KL divergence to be comparable to refusal count in the optimization objective
Higher values make the optimizer prioritize reducing refusals over preserving model behavior
Lower values make the optimizer prioritize preserving the original model’s behavior

The default value of 1.0 typically provides a good balance. You rarely need to change this unless you’re specifically trying to bias the optimization toward quality preservation or refusal reduction.

kl_divergence_target

The KL divergence threshold below which the optimizer focuses on refusal count instead of KL divergence.

# Default target
kl_divergence_target = 0.01

# Stricter quality preservation
kl_divergence_target = 0.005

# More lenient (faster to reach)
kl_divergence_target = 0.02

Purpose: Prevents the optimizer from exploring parameter combinations that barely change the model (“do nothing” abliterations). Below this threshold, the objective switches to primarily minimizing refusals. Typical values:

0.005-0.01: Strict quality preservation
0.01 (default): Good balance
0.02-0.05: More aggressive abliteration

Advanced Abliteration Parameters

These options control how the abliteration process modifies the model’s weight matrices.

orthogonalize_direction

Whether to orthogonalize refusal directions relative to the “good” direction before ablation.

# Standard ablation (default)
orthogonalize_direction = false

# Orthogonalized ablation
orthogonalize_direction = true

What it does:

When enabled, Heretic adjusts the refusal directions so that only the component orthogonal to the “good” direction is subtracted
This can help preserve beneficial model behaviors while removing refusals

When to enable:

If standard abliteration degrades model quality too much
When you want to be more conservative with modifications
For models where “good” and “bad” directions are not well-separated

This is an advanced feature. The default (false) works well for most models. Enable this if you’re experiencing significant quality degradation.

row_normalization

How to apply row normalization to weight matrices during ablation.

# No normalization (default)
row_normalization = "none"

# Pre-normalization
row_normalization = "pre"

# Full normalization with magnitude preservation
row_normalization = "full"

Options:

"none" (default)
"pre"
"full"

No row normalization. Abliteration is applied directly to the weight matrices.Best for: Most models, standard use cases

Like "pre", but renormalizes to preserve original row magnitudes using a LoRA adapter.Best for: Maximum preservation of weight matrix structureNote: Creates larger output files and may slow down evaluation. Configure the LoRA rank with full_normalization_lora_rank.

Row normalization options other than "none" are experimental. They may improve results for some models but can also degrade quality. Test carefully.

full_normalization_lora_rank

The rank of the LoRA adapter when using row_normalization = "full".

# Only relevant when row_normalization = "full"
row_normalization = "full"
full_normalization_lora_rank = 3

# Higher rank (more accurate, larger files)
full_normalization_lora_rank = 8

# Lower rank (smaller files, less accurate)
full_normalization_lora_rank = 1

Trade-offs:

Higher rank: More accurate preservation of row magnitudes, but larger output files and slower evaluation
Lower rank: Smaller files and faster evaluation, but less accurate approximation
Default (3): Good balance for most use cases

Study Checkpoints

Heretic automatically saves optimization progress to disk, allowing you to resume interrupted runs.

study_checkpoint_dir

Directory where optimization study progress is saved.

# Default checkpoint directory
study_checkpoint_dir = "checkpoints"

# Custom location
study_checkpoint_dir = "/path/to/my/checkpoints"

What’s saved:

Trial history and results
Best parameters found so far
Optimization state (for resuming)

If Heretic is interrupted, it will automatically resume from the last checkpoint when you run it again with the same model and configuration.

Complete Example Configurations

# Fast optimization for testing
n_trials = 50
n_startup_trials = 15

# Automatic batch sizing
batch_size = 0
max_batch_size = 64

# Standard KL divergence settings
kl_divergence_scale = 1.0
kl_divergence_target = 0.01

# No advanced features
orthogonalize_direction = false
row_normalization = "none"

Performance Tips

Speed up optimization

Reduce n_trials (try 100 instead of 200)
Increase max_batch_size if you have VRAM headroom
Use quantization: quantization = "bnb_4bit"
Reduce evaluation dataset sizes (see Evaluation Configuration)

Improve result quality

Increase n_trials (try 300-400)
Lower kl_divergence_scale (try 0.5)
Lower kl_divergence_target (try 0.005)
Enable orthogonalize_direction = true
Use larger evaluation datasets

Reduce VRAM usage

Lower max_batch_size (try 32 or 64)
Set manual batch_size to a safe value
Enable quantization in model loading config
Reduce max_response_length (see Evaluation Configuration)

Troubleshoot optimization issues

Problem: All trials produce similar results

Increase n_startup_trials for more exploration
Check if your refusal markers are appropriate

Problem: High KL divergence but still many refusals

Increase kl_divergence_scale to allow more aggressive ablation
Try orthogonalize_direction = true

Problem: Low refusals but model quality degraded

Decrease kl_divergence_scale to prioritize quality
Lower kl_divergence_target to require better preservation
Try row_normalization = "full"

Model Loading

Configure batch sizes and memory usage

Evaluation

Configure datasets and response generation

CLI Reference

Configuration

Advanced

Optimization Configuration

Overview

Optimization Trials

n_trials

n_startup_trials

Batch Processing

batch_size

max_batch_size

KL Divergence Parameters

kl_divergence_scale

kl_divergence_target

Advanced Abliteration Parameters

orthogonalize_direction

row_normalization

full_normalization_lora_rank

Study Checkpoints

study_checkpoint_dir

Complete Example Configurations

Performance Tips

Model Loading

Evaluation

Build docs developers (and LLMs) love

CLI Reference

Configuration

Advanced

​Overview

​Optimization Trials

​n_trials

​n_startup_trials

​Batch Processing

​batch_size

​max_batch_size

​KL Divergence Parameters

​kl_divergence_scale

​kl_divergence_target

​Advanced Abliteration Parameters

​orthogonalize_direction

​row_normalization

​full_normalization_lora_rank

​Study Checkpoints

​study_checkpoint_dir

​Complete Example Configurations

​Performance Tips

​Related Configuration

Model Loading

Evaluation

Build docs developers (and LLMs) love

Overview

Optimization Trials

n_trials

n_startup_trials

Batch Processing

batch_size

max_batch_size

KL Divergence Parameters

kl_divergence_scale

kl_divergence_target

Advanced Abliteration Parameters

orthogonalize_direction

row_normalization

full_normalization_lora_rank

Study Checkpoints

study_checkpoint_dir

Complete Example Configurations

Performance Tips

Related Configuration