Enrichment Design

Bridge the gap between research hypotheses and table enrichment. Define WHAT to research about each company before running enrichment.

Trigger Phrases

“data points”, “enrichment columns”, “column design”, “what to research”, “data point builder”, “build columns”, “segmentation columns”, “personalization columns”

When to Use

After market-research has produced a hypothesis set
Before list-enrichment - this skill designs the columns, that skill runs them
When the user says “what should we research about these companies?”

Two Modes

Mode 1: Segmentation

Goal: Design columns that score or confirm hypothesis fit per company. Input: Hypothesis set (from market-research or context file) Process:

Read the hypothesis set
For each hypothesis, propose 1-2 columns that would confirm or deny fit
Discuss with user - refine, add, remove
Output final column_configs

Example: If hypothesis is “Database blind spot - 80-90% of targets invisible to standard tools”:

Column: “Data Infrastructure Maturity” (select: [“No CRM”, “Basic CRM”, “Full stack”])
Column: “Digital Footprint Score” (grade: 1-5)

Mode 2: Personalization

Goal: Design columns that capture company-specific hooks for email personalization. Input: Target list + what the user wants to personalize on Process:

Ask what hooks matter for this campaign (leadership quotes, recent launches, hiring signals, tech stack, etc.)
Propose 2-4 columns with prompts
Discuss with user - refine
Output final column_configs

Example: For personalization hooks:

Column: “Recent Product Launch” (text: describe any product launched in last 6 months)
Column: “Leadership Public Statement” (text: find a public quote from CEO/CTO about [topic])

Interactive Column Design

Do NOT just generate columns silently. Walk through this with the user.

Present the framework

Show the user the two modes and ask which applies (or both).

Propose initial columns

Based on hypotheses or user input, propose 3-5 columns. For each, show:

Column: [name]
Type: [output_format]
Agent: [research_pro | llm]
Prompt: [the actual prompt text]
Why: [what this tells us for segmentation/personalization]

Refine together

Ask:

“Any columns to add?”
“Any to remove or merge?”
“Should any prompts be more specific?”

Confirm column budget

Guidance:

3-5 columns is the sweet spot
6-7 is acceptable if each serves a clear purpose
8+ adds noise - push back and suggest merging

Output column_configs

Generate the final column configs as a JSON array ready for list-enrichment.

Column Config Format

[
  {
    "kind": "agent",
    "name": "Column Display Name",
    "key": "column_key_snake_case",
    "value": {
      "agent_type": "research_pro",
      "prompt": "Research prompt using {input} for domain...",
      "output_format": "text"
    }
  }
]

Column Design Guidelines

Agent Type Selection

Data point type	Agent type	Why
Factual data from the web (funding, launches, news)	`research_pro`	Needs web research
Classification from company profile	`llm`	Profile data is enough
Nuanced judgment (maturity, fit score)	`research_reasoning`	Needs chain-of-thought
People/org structure	`linkedin`	LinkedIn-specific

Output Format Selection

Data point type	Format	When
Free-form research	`text`	Open-ended questions
Score/rating	`grade`	1-5 scale assessments
Category	`select`	Mutually exclusive buckets
Multiple tags	`multiselect`	Non-exclusive tags
Structured data	`json`	Multiple related fields
Yes/no with evidence	`json`	`{"match": bool, "evidence": str}`

Prompt Writing Tips

Always include {input} for the company domain
Be specific about output format in the prompt itself
Include fallback: “If not found, return N/A” or “If unclear, return ‘Unknown’”
For select/multiselect: list the labels in the prompt too
For hypothesis scoring: reference the specific hypothesis in the prompt
Keep prompts under 200 words

Example Columns

Segmentation Column Example

{
  "kind": "agent",
  "name": "Hypothesis 1 Fit",
  "key": "h1_fit",
  "value": {
    "agent_type": "research_reasoning",
    "prompt": "Grade how well {input} matches this hypothesis: 'Database blind spot - relies on manual outreach to non-indexed partners'. Look for: (1) fragmented supplier/partner base, (2) limited online presence of targets, (3) reliance on events/intros. Return 1-5 where 5 = perfect fit.",
    "output_format": "grade"
  }
}

Personalization Column Example

{
  "kind": "agent",
  "name": "Recent Product Launch",
  "key": "recent_launch",
  "value": {
    "agent_type": "research_pro",
    "prompt": "Find any product or feature launched by {input} in the last 6 months. Include: product name, launch date, brief description. If none found, return 'N/A'.",
    "output_format": "text"
  }
}

Output Handoff

After column design is complete:

Present the final column_configs JSON to the user
Tell the user: “These configs are ready for list-enrichment. Run that skill with your table ID and these columns.”
If the user wants to run immediately, hand off to list-enrichment workflow

Get Started

Core Concepts

Skills Reference

Guides

API Integration

Trigger Phrases

When to Use

Two Modes

Mode 1: Segmentation

Mode 2: Personalization

Interactive Column Design

Column Config Format

Column Design Guidelines

Agent Type Selection

Output Format Selection

Prompt Writing Tips

Example Columns

Output Handoff

Build docs developers (and LLMs) love

Get Started

Core Concepts

Skills Reference

Guides

API Integration

​Trigger Phrases

​When to Use

​Two Modes

​Mode 1: Segmentation

​Mode 2: Personalization

​Interactive Column Design

​Column Config Format

​Column Design Guidelines

​Agent Type Selection

​Output Format Selection

​Prompt Writing Tips

​Example Columns

​Output Handoff

Build docs developers (and LLMs) love

Trigger Phrases

When to Use

Two Modes

Mode 1: Segmentation

Mode 2: Personalization

Interactive Column Design

Column Config Format

Column Design Guidelines

Agent Type Selection

Output Format Selection

Prompt Writing Tips

Example Columns

Output Handoff