Visualization - GraphRAG

GraphRAG generates GraphML files that can be visualized using graph visualization tools like Gephi. This guide walks you through the complete process of creating beautiful, insightful visualizations of your knowledge graph.

Overview

Visualizing your knowledge graph helps you:

Understand structure: See how entities and relationships are organized
Identify clusters: Discover communities and themes in your data
Debug issues: Spot problems with entity extraction or relationships
Communicate insights: Share visual representations with stakeholders

Prerequisites

Before visualizing your graph, you need:

A completed GraphRAG index with GraphML snapshots enabled
Gephi installed on your system
The Leiden Algorithm plugin for Gephi

Enable GraphML snapshots

GraphML snapshots must be enabled during indexing to generate visualization files.

Configure settings.yaml

Open your settings.yaml and ensure GraphML snapshots are enabled:

settings.yaml

snapshots:
  graphml: true

This setting is enabled by default in newly initialized projects.

Run the index

graphrag index --root ./my-project

This generates a graph.graphml file in your output directory.

Locate the GraphML file

After indexing completes, find the file at:

./my-project/output/graph.graphml

If you’ve already run indexing without GraphML enabled, you’ll need to re-run the index with the setting enabled.

Visualization workflow

Follow these steps to create a professional graph visualization:

1. Import into Gephi

Launch Gephi

Open Gephi on your system.

Import the GraphML file

Go to File → Open
Navigate to your output folder
Select graph.graphml
Click Open

You’ll see a basic visualization of your graph nodes and edges.

2. Install Leiden Algorithm plugin

The Leiden algorithm detects communities in your graph, which is essential for meaningful visualization.

Open Plugin Manager

Go to Tools → Plugins

Find Leiden Algorithm

Click the Available Plugins tab
Search for “Leiden Algorithm”
Check the box next to it
Click Install

Restart Gephi

Restart Gephi after installation completes

3. Run statistics

Generate statistics that will help you visualize the graph structure.

Calculate Average Degree

In the Statistics panel (right side):

Find Average Degree
Click Run
Click Close on the report dialog

Run Leiden Algorithm

In the Statistics panel:

Find Leiden Algorithm
Click Run
Configure settings:
- Quality function: Modularity
- Resolution: 1.0
Click OK
Close the report when complete

The Leiden algorithm identifies communities (clusters) in your graph. Higher resolution values create more, smaller communities.

4. Color nodes by cluster

Color-code nodes based on their community membership.

Open Appearance panel

Find the Appearance panel in the upper left

Configure node colors

Click the Nodes tab
Click Partition (not Ranking)
Click the color palette icon in the upper right
Select Cluster from the dropdown

Generate color palette

Click Palette…
Click Generate…
Uncheck Limit number of colors
Click Generate
Click OK

Apply colors

Click Apply to color your graph

Your graph should now show different colored clusters representing communities in your knowledge graph.

5. Resize nodes by degree centrality

Make important nodes (with many connections) larger.

Select ranking mode

In the Appearance panel:

Ensure Nodes is selected
Click Ranking (not Partition)
Click the sizing icon (three circles of different sizes)

Configure node sizes

Select Degree from the dropdown
Set Min size: 10
Set Max size: 150

Apply sizing

Click Apply

Nodes with more connections will now appear larger, making hubs visually prominent.

6. Layout the graph

Arrange nodes spatially to reveal structure.

Step 1: OpenORD layout

Select OpenORD

In the Layout panel (lower left), select OpenORD

Configure settings

Set the following stage iterations:

Liquid: 50
Expansion: 50
Cooldown: 0
Crunch: 0
Simmer: 0

Run layout

Click Run
Watch the progress bar
Click Stop when complete

OpenORD does initial positioning. The graph may still look messy - that’s expected.

Step 2: ForceAtlas2 layout

Select ForceAtlas2

In the Layout panel, select Force Atlas 2

Configure settings

Adjust the following settings:

Scaling: 15
Dissuade Hubs: ✓ checked
LinLog mode: ✗ unchecked
Prevent Overlap: ✓ checked

Run layout

Click Run
Watch as nodes settle into position
Click Stop when nodes stop moving significantly

This may take several minutes for large graphs. Be patient!

Your graph should now have a clean, organized layout with distinct communities.

7. Add labels (optional)

Display entity names on the visualization.

Show labels

In the bottom toolbar of the graph view, click the Show node labels button (“T” icon)

Configure label appearance

Click the label settings button
Adjust:
- Font size: Based on your preference
- Show labels: For visible nodes only
- Label color: Black or contrasting color

Resize labels (optional)

You can also size labels by node degree for better readability

For large graphs, consider showing labels only for important nodes (high degree) to avoid clutter.

Understanding your visualization

Graph elements

Nodes

Represent entities extracted from your documents (people, organizations, concepts)

Edges

Represent relationships between entities

Colors

Indicate communities - groups of closely related entities

Size

Indicates centrality - how many connections an entity has

Interpreting patterns

Dense clusters: Topics or themes with many interconnected entities Bridge nodes: Entities connecting different communities (often important cross-cutting concepts) Peripheral nodes: Mentioned infrequently or in isolation Star patterns: Central entities with many direct connections (key people, organizations, or concepts)

Export as image

Open Preview

Click the Preview tab at the top of Gephi

Configure preview

Adjust settings for best appearance:

Preset: Default
Background color: White
Show labels: As desired

Export

Click Export: SVG/PDF/PNG
Choose format (PNG for presentations, SVG for editing)
Set resolution (high for publications)
Save file

Export interactive version

Gephi can export interactive web visualizations:

Install the Sigma Exporter plugin
Go to File → Export → Sigma.js template
Configure and export to create an interactive HTML visualization

Advanced techniques

Filter by community

Focus on specific communities:

Open Filters panel

Find the Filters panel on the right

Add partition filter

Expand Attributes → Partition
Drag Cluster to the Queries area

Select communities

Check/uncheck clusters to show/hide communities

Apply filter

Click Filter to update the visualization

Size by other metrics

You can size nodes by different centrality measures:

Betweenness centrality: Nodes that connect different parts of the graph
Closeness centrality: Nodes close to all others
Eigenvector centrality: Nodes connected to other important nodes

Run these under Statistics → Network Overview, then use them in the Appearance panel.

Multi-level analysis

GraphRAG’s Leiden algorithm creates hierarchical communities. To visualize different levels:

Run Leiden multiple times with different resolution parameters
Create separate visualizations for each level
Compare to see how communities nest within each other

Troubleshooting

Graph appears as a dense ball

Solution:

Run ForceAtlas2 longer (it may take time to untangle)
Increase Scaling parameter to 20-30
Enable Prevent Overlap
Try Fruchterman Reingold layout instead

Nodes are all the same color

Solution:

Ensure you ran the Leiden Algorithm
Check that you selected Partition not Ranking
Verify Cluster appears in the dropdown
Re-run Leiden if needed

Cannot find graph.graphml file

Solution:

Verify snapshots.graphml: true in settings.yaml
Re-run indexing with snapshots enabled
Check the storage.base_dir setting for output location

Gephi crashes or runs slowly

Solution:

Increase Gephi’s memory allocation
Filter the graph to show fewer nodes
Use a more powerful machine for large graphs
Consider sampling your data before indexing

Example workflow summary

Here’s the complete process at a glance:

Enable GraphML

snapshots:
  graphml: true

Run indexing

graphrag index --root ./my-project

Import to Gephi

Open output/graph.graphml

Install Leiden plugin

Tools → Plugins → Leiden Algorithm

Run statistics

Average Degree
Leiden Algorithm (Modularity, Resolution 1.0)

Apply appearance

Color by Cluster (Partition)
Size by Degree (10-150)

Layout

OpenORD (Liquid 50, Expansion 50)
ForceAtlas2 (Scaling 15, Dissuade Hubs, Prevent Overlap)

Export

Preview → Export as PNG/SVG

Next steps

Configuration

Learn about all configuration options

Best practices

Optimize your GraphRAG workflow

Query methods

Learn different search approaches

Data model

Understand the output schema

Get Started

Core Concepts

Indexing

Query Engine

Prompt Tuning

Configuration

Guides

​Overview

​Prerequisites

​Enable GraphML snapshots

​Visualization workflow

​1. Import into Gephi

​2. Install Leiden Algorithm plugin

​3. Run statistics

​4. Color nodes by cluster

​5. Resize nodes by degree centrality

​6. Layout the graph

​Step 1: OpenORD layout

​Step 2: ForceAtlas2 layout

​7. Add labels (optional)

​Understanding your visualization

​Graph elements

Nodes

Edges

Colors

Size

​Interpreting patterns

​Export and share

​Export as image

​Export interactive version

​Advanced techniques

​Filter by community

​Size by other metrics

​Multi-level analysis

​Troubleshooting

​Example workflow summary

​Next steps

Configuration

Best practices

Query methods

Data model

Build docs developers (and LLMs) love

Overview

Prerequisites

Enable GraphML snapshots

Visualization workflow

1. Import into Gephi

2. Install Leiden Algorithm plugin

3. Run statistics

4. Color nodes by cluster

5. Resize nodes by degree centrality

6. Layout the graph

Step 1: OpenORD layout

Step 2: ForceAtlas2 layout

7. Add labels (optional)

Understanding your visualization

Graph elements

Interpreting patterns

Export and share

Export as image

Export interactive version

Advanced techniques

Filter by community

Size by other metrics

Multi-level analysis

Troubleshooting

Example workflow summary

Next steps