Line Sort/Dedupe

Overview

The Line Sort/Dedupe tool sorts and deduplicates text lines. Supports ascending/descending sort, case-insensitive deduplication with first-seen casing preservation, and combined sort+dedupe operations.

Use Cases

Text Cleanup: Remove duplicate lines from lists
Data Preparation: Sort lines alphabetically for diff/merge
Log Analysis: Deduplicate and sort log entries
List Management: Clean up email lists, usernames, IDs
Configuration Files: Sort and dedupe config entries
SQL Output: Clean up query results

Input Format

Plain text with one item per line:

banana
apple
cherry
apple
date

user123
user456
USER123
user789
user456

ERROR: Connection failed
WARNING: Low memory
ERROR: Connection failed
INFO: Process started
WARNING: Low memory

Actions

Sort Ascending (default)

Sort lines alphabetically A→Z:

apple
apple
banana
cherry
date

Sort Descending

Sort lines alphabetically Z→A:

date
cherry
banana
apple
apple

Dedupe

Remove duplicates, preserve first-seen casing, maintain original order:

banana
apple
cherry
date

Dedupe + Sort

Remove duplicates, then sort alphabetically:

apple
banana
cherry
date

Output Format

Processed text with one item per line:

apple
banana
cherry
date

Metadata:

4 lines | 1 duplicate(s)

Deduplication Rules

Case Insensitive

Comparison ignores case:

apple
Apple
APPLE

→ Treated as same value

First-Seen Casing

Preserves casing of first occurrence:

Apple
apple
APPLE

→ Output: Apple (first-seen)

Whitespace Handling

Lines are trimmed before comparison:

  apple  
apple
 apple

→ Treated as same value → Output: apple (first trimmed version)

Empty Lines

Empty lines (after trimming) are ignored:

apple

banana
  
cherry

→ Empty lines removed

Examples

zebra
apple
mango
banana

apple
banana
cherry
date

banana
apple
cherry
apple
date
banana

banana
apple
cherry
apple
date
banana

Apple
banana
APPLE
Cherry
Banana
apple

apple

banana
  
cherry

apple

ERROR: Connection timeout
WARNING: Low disk space
ERROR: Connection timeout
INFO: Service started
WARNING: Low disk space
ERROR: Connection timeout

Implementation Details

From lib/tools/engine.ts:968-1004:

case 'line-sort': {
  if (!input) return { output: '', meta: '0 lines | 0 duplicate(s)' };
  const lines = input.split('\n');

  // Build a first-seen Map iteratively
  const buildFirstSeenMap = (src: string[]): Map<string, string> => {
    const seen = new Map<string, string>();
    for (const l of src) {
      const key = l.trim().toLowerCase();
      if (key && !seen.has(key)) seen.set(key, l.trim());
    }
    return seen;
  };

  // Apply dedupe option before sorting when explicitly requested
  const workingLines = options.dedupe === true
    ? [...buildFirstSeenMap(lines).values()]
    : lines;

  let result: string[];
  switch (action) {
    case 'sort-asc':
    case 'default':
      result = [...workingLines].sort((a, b) => a.localeCompare(b));
      break;
    case 'sort-desc': 
      result = [...workingLines].sort((a, b) => b.localeCompare(a)); 
      break;
    case 'dedupe': 
      result = [...buildFirstSeenMap(lines).values()]; 
      break;
    case 'dedupe-sort': 
      result = [...buildFirstSeenMap(lines).values()].sort((a, b) => a.localeCompare(b)); 
      break;
    default: 
      result = [...workingLines].sort((a, b) => a.localeCompare(b));
  }

  const nonEmpty = lines.filter((l) => l.trim()).length;
  const uniqueCount = new Set(lines.map((l) => l.trim().toLowerCase()).filter(Boolean)).size;
  const dupes = nonEmpty - uniqueCount;
  return { output: result.join('\n'), meta: `${result.length} lines | ${dupes} duplicate(s)` };
}

First-Seen Casing:

const buildFirstSeenMap = (src: string[]): Map<string, string> => {
  const seen = new Map<string, string>();
  for (const l of src) {
    const key = l.trim().toLowerCase();
    if (key && !seen.has(key)) seen.set(key, l.trim());
  }
  return seen;
};

Normalize key: trim + lowercase
Only store first occurrence
Preserve original casing in value

Sorting:

lines.sort((a, b) => a.localeCompare(b))  // Ascending
lines.sort((a, b) => b.localeCompare(a))  // Descending

Uses localeCompare for proper Unicode sorting
Supports accented characters and non-English alphabets

Edge Cases

Single Line

apple

→ Output: apple → Meta: 1 lines | 0 duplicate(s)

All Duplicates

apple
apple
apple

→ Output: apple → Meta: 1 lines | 2 duplicate(s)

Empty Input

(empty)

→ Output: (empty) → Meta: 0 lines | 0 duplicate(s)

Only Whitespace

→ Output: (empty) → Meta: 0 lines | 0 duplicate(s)

Performance

Lines: Handles up to 100K lines efficiently
Algorithm: O(n log n) for sorting, O(n) for deduplication
Memory: Stores all lines in memory (approx 1MB per 10K lines)

Comparison with Unix Tools

sort + uniq

sort file.txt | uniq

= Line Sort/Dedupe with “Dedupe + Sort” action

sort -u

sort -u file.txt

= Line Sort/Dedupe with “Dedupe + Sort” action

sort -r

sort -r file.txt

= Line Sort/Dedupe with “Sort Descending” action

The Line Sort/Dedupe tool uses first-seen casing preservation for deduplication, which differs from Unix uniq (which requires sorted input and preserves last-seen casing).

For maximum duplicate removal, use the “Dedupe + Sort” action. For preserving original line order, use the “Dedupe” action.

Very large inputs (over 1 million lines) may cause performance issues or browser memory limits. For massive datasets, use command-line tools like sort and uniq instead.

Encoding & Encryption

Escaping & Entities

Preview & Comparison

Format Converters

Code Beautifiers

Generators

Data Transformers

Overview

Use Cases

Input Format

Actions

Sort Ascending (default)

Sort Descending

Dedupe

Dedupe + Sort

Output Format

Deduplication Rules

Case Insensitive

First-Seen Casing

Whitespace Handling

Empty Lines

Examples

Implementation Details

Edge Cases

Single Line

All Duplicates

Empty Input

Only Whitespace

Performance

Comparison with Unix Tools

sort + uniq

sort -u

sort -r

Build docs developers (and LLMs) love

Encoding & Encryption

Escaping & Entities

Preview & Comparison

Format Converters

Code Beautifiers

Generators

Data Transformers

​Overview

​Use Cases

​Input Format

​Actions

​Sort Ascending (default)

​Sort Descending

​Dedupe

​Dedupe + Sort

​Output Format

​Deduplication Rules

​Case Insensitive

​First-Seen Casing

​Whitespace Handling

​Empty Lines

​Examples

​Implementation Details

​Edge Cases

​Single Line

​All Duplicates

​Empty Input

​Only Whitespace

​Performance

​Comparison with Unix Tools

​sort + uniq

​sort -u

​sort -r

Build docs developers (and LLMs) love

Overview

Use Cases

Input Format

Actions

Sort Ascending (default)

Sort Descending

Dedupe

Dedupe + Sort

Output Format

Deduplication Rules

Case Insensitive

First-Seen Casing

Whitespace Handling

Empty Lines

Examples

Implementation Details

Edge Cases

Single Line

All Duplicates

Empty Input

Only Whitespace

Performance

Comparison with Unix Tools

sort + uniq

sort -u

sort -r