Streaming

Overview

Streaming mode enables incremental parsing of markdown content as it arrives, making it ideal for processing LLM outputs or any scenario where content is generated progressively. The parser maintains internal state across calls and only returns blocks that have been finalized.

When streaming is disabled (default), the parser finalizes all open blocks at the end of the input. With streaming enabled, blocks are only emitted once they become stable and closed.

How streaming works

The parser implements streaming by tracking which blocks are “closed” versus “open”:

Closed blocks: Fully parsed and stable, ready to be emitted
Open blocks: Still being built, may change as more content arrives

Blocks close when the parser encounters content that definitively ends them, such as:

A blank line after a paragraph
A closing fence for a code block
New block-level content that can’t continue the current block

Internal implementation

The parser uses an isClosed flag on each internal block node to track state (see markdown-parser.ts:42-44):

if (!stream) {
  // Close the latest spine of the root node
  closeRightmostPath(this.root);
}

When streaming is disabled, the closeRightmostPath function walks the rightmost child chain and marks all open blocks as closed before returning.

Basic usage

Complete parsing
Streaming mode

import { MarkdownParser } from "markdown-parser";

const parser = new MarkdownParser();

// Parse complete markdown (streaming: false by default)
const nodes = parser.parse("# Hello World\nThis is a paragraph.");
// Returns:
// [
//   { type: "heading", level: 1, children: [...] },
//   { type: "paragraph", children: [...] }
// ]

import { MarkdownParser } from "markdown-parser";

const parser = new MarkdownParser();

// First chunk - heading is closed, paragraph still open
const nodes1 = parser.parse("# Hello World\nThis", { stream: true });
// Returns: [{ type: "heading", level: 1, children: [...] }]

// Second chunk - first paragraph closes, second still open
const nodes2 = parser.parse(" is a paragraph\n\nThis is", { stream: true });
// Returns: [{ type: "paragraph", children: [...] }]

// Final chunk - close all remaining blocks
const nodes3 = parser.parse(" another.\n", { stream: false });
// Returns: [{ type: "paragraph", children: [...] }]

When to use streaming

LLM outputs

Process AI-generated markdown as it streams from the model, providing real-time rendering

Network streams

Parse markdown content arriving over HTTP streams or WebSockets without buffering

Large files

Process large markdown files chunk by chunk to reduce memory usage

Real-time editing

Update parsed output as users type without re-parsing the entire document

Block closure rules

Different block types close under different conditions:

Paragraphs and tables

Close when encountering:

A blank line
The start of another block element (heading, code block, etc.)

// markdown-parser.ts:181-195
if (child.type === "paragraph" || child.type === "table") {
  if (isLineEmpty(line)) {
    child.isClosed = true;
    return;
  }
}

Fenced code blocks

Close when encountering a closing fence with the same marker and at least as many characters:

// markdown-parser.ts:126-139
if (child.type === "fenced-code-block") {
  if (isCodeFenceEnd(line, { marker, numOfMarkers })) {
    child.isClosed = true;
    return;
  }
  child.lines.push(sliceLeadingIndent(line, child.indentLevel));
  return;
}

Indented code blocks

Close when encountering a non-indented, non-empty line:

// markdown-parser.ts:142-157
if (child.type === "indented-code-block") {
  if (isIndentedCodeLine(line)) {
    child.lines.push(sliceLeadingIndent(line, 4));
    return;
  } else if (isLineEmpty(line)) {
    child.lines.push("");
    return;
  }
  child.isClosed = true;
  break;
}

HTML blocks

Close based on their specific end pattern or blank line interruption:

// markdown-parser.ts:159-178
if (child.type === "html-block") {
  if (child.canBeInterruptedByBlankLine && isLineEmpty(line)) {
    child.isClosed = true;
    return;
  }
  if (child.endPattern?.test(line.trim())) {
    child.isClosed = true;
    return;
  }
}

Headings and thematic breaks

Always closed immediately upon creation (single-line elements):

// markdown-parser.ts:252-253
{
  type: "heading",
  isClosed: true,
  // ...
}

Link reference definitions

When streaming is enabled, link references may not resolve immediately if their definition arrives in a later chunk.

From the README:

CommonMark specification allows link reference definitions to appear after the links that use them. Therefore, when streaming is enabled, it is important to consider that a link reference might not resolve, since its definition could arrive in a later chunk of the input.

The parser only processes reference definitions for finalized blocks (see markdown-parser.ts:519-553):

private parseReferenceLinkDefinitions(parent) {
  for (let i = 0; i < parent.children.length; i++) {
    const block = parent.children[i];
    
    // Only parse closed/finalized blocks
    if (!block.isClosed) break;
    
    if (block.type === "paragraph") {
      const result = parseLinkReferenceDefinitions(block.lines);
      for (const definition of result.definitions) {
        if (!this.referenceDefinitions.has(label)) {
          this.referenceDefinitions.set(label, { href, title });
        }
      }
    }
  }
}

State management

The parser maintains state across streaming calls using:

Root node: Contains all parsed blocks (see markdown-parser.ts:13-18)
Next node index: Tracks which blocks have been emitted (see markdown-parser.ts:19)
Reference definitions: Stores link references across chunks (see markdown-parser.ts:20-21)
Line splitter: Handles line boundaries across chunks (see markdown-parser.ts:13)

export class MarkdownParser {
  private splitter = new LineSplitter();
  private root: RootNode_internal = {
    type: "root",
    children: [],
    parent: null,
  };
  private nextNodeIndex = 0;
  private referenceDefinitions: Map<string, { href: string; title?: string }> = new Map();
}

The parser only returns blocks that have been fully finalized since the last call, preventing duplicate emissions.

Performance considerations

Memory usage: Streaming mode keeps internal state for open blocks, but emits and discards closed blocks, preventing unbounded memory growth. Latency: Blocks are emitted as soon as they close, minimizing delay between input and output. Incremental processing: The line-by-line parsing approach (see markdown-parser.ts:25-27) ensures minimal reprocessing:

for (const line of this.splitter.split(input, { stream })) {
  this.parseLine(line.replace(/\0/g, "\uFFFD"));
}

Best practices

Create a single parser instance

Reuse the same MarkdownParser instance across streaming calls to maintain state:

const parser = new MarkdownParser();

// Multiple streaming calls reuse the same parser
stream.on('data', (chunk) => {
  const nodes = parser.parse(chunk, { stream: true });
  renderNodes(nodes);
});

Finalize with stream: false

Always call with stream: false at the end to close remaining open blocks:

stream.on('end', () => {
  const finalNodes = parser.parse("", { stream: false });
  renderNodes(finalNodes);
});

Handle partial references

Be aware that link references may not resolve until their definition is parsed:

// Chunk 1: [link][ref]
// -> Link may not resolve yet

// Chunk 2: [ref]: https://example.com
// -> Previous link now resolves

Get Started

Core Concepts

Guides

Overview

How streaming works

Internal implementation

Basic usage

When to use streaming

LLM outputs

Network streams

Large files

Real-time editing

Block closure rules

Link reference definitions

State management

Performance considerations

Best practices

Build docs developers (and LLMs) love

Get Started

Core Concepts

Guides

​Overview

​How streaming works

​Internal implementation

​Basic usage

​When to use streaming

LLM outputs

Network streams

Large files

Real-time editing

​Block closure rules

​Link reference definitions

​State management

​Performance considerations

​Best practices

Build docs developers (and LLMs) love

Overview

How streaming works

Internal implementation

Basic usage

When to use streaming

Block closure rules

Link reference definitions

State management

Performance considerations

Best practices