Deduplication
Remove duplicate items from your list while preserving the first occurrence order.None (default)
No deduplication is performed. All items are preserved exactly as entered, including duplicates.Case-sensitive
Removes exact duplicates.ABC123 and abc123 are treated as different items.
Example:
- Input
- Output (case-sensitive)
Case-insensitive
Removes duplicates using lowercase comparison.ABC123 and abc123 are treated as the same item.
Example:
- Input
- Output (case-insensitive)
Case-insensitive deduplication preserves the original casing of the first occurrence. Only subsequent duplicates (regardless of case) are removed.
Implementation
Validation
Filter items based on format rules. Invalid items are removed before splitting, and SplitBox shows you examples of what was filtered out.None (default)
No validation is performed. All non-empty items are accepted.Alphanumeric
Only accept items containing letters, numbers, underscores, and hyphens. Regex pattern:/^[A-Za-z0-9_-]+$/
Example:
- Input
- Output (alphanumeric)
Spaces are not allowed in alphanumeric mode. Use underscores or hyphens instead.
/^[^\s@]+@[^\s@]+\.[^\s@]+$/
Example:
- Input
- Output (email)
Custom regex
Provide your own regular expression pattern for custom validation rules. Example use cases:UUID validation
Pattern:
^[0-9a-f]{8}-[0-9a-f]{4}-[0-9a-f]{4}-[0-9a-f]{4}-[0-9a-f]{12}$Only accepts valid UUIDs like 550e8400-e29b-41d4-a716-446655440000Phone number validation
Pattern:
^\+?[1-9]\d{1,14}$Accepts international phone numbers in E.164 formatImplementation
Preprocessing statistics
SplitBox tracks and reports what was removed during preprocessing:Preprocessing complete:
- Started with 1,523 items
- Removed 47 empty lines
- Removed 12 invalid items (examples:
bad@item,invalid#id,wrong!format) - Removed 83 duplicates
- Final count: 1,381 items ready to split
Processing order
Preprocessing happens in this exact order:Common workflows
Clean transaction IDs
- Validation: Alphanumeric
- Deduplication: Case-insensitive
Email list cleanup
- Validation: Email
- Deduplication: Case-insensitive
UUID processing
- Validation: Custom regex (UUID pattern)
- Deduplication: Case-sensitive
API token batching
- Validation: Alphanumeric
- Deduplication: Case-sensitive