Overview
IPED uses a comprehensive parser framework built on top of Apache Tika to extract metadata and content from digital evidence. The parser architecture enables processing of diverse file formats, application artifacts, and embedded data structures commonly encountered in forensic investigations.Apache Tika Integration
IPED extends Apache Tika’s parser framework to provide specialized forensic capabilities:Standard Tika Parsers
Office documents, PDFs, images, videos, and common file formats
Custom IPED Parsers
Chat applications, P2P clients, browser artifacts, and mobile data
SQLite Processing
Automated extraction from SQLite databases with schema detection
Embedded Document Extraction
Recursive parsing of containers and embedded artifacts
Parser Categories
IPED implements specialized parsers across multiple forensic domains:Communication Artifacts
- Chat Applications: WhatsApp, Telegram, Skype, Discord, Threema
- Email Clients: PST, MBOX, DBX, MSG formats
- Social Media: Facebook, Instagram, TikTok artifacts
Browser Forensics
- Chrome/Chromium: History, downloads, searches, cache
- Firefox: Places database, bookmarks, session data
- Safari: SQLite and plist-based artifacts
- Edge/IE: WebCacheV01.dat, index.dat files
P2P Applications
- BitTorrent: Torrent files, resume.dat structures
- eMule: known.met, part.met sharing records
- Shareaza: Library.dat file catalogs
- Ares: ShareH.dat and ShareL.dat databases
Mobile Artifacts
- UFDR Format: Cellebrite extraction support
- AD1 Format: AccessData image processing
- iOS: Plist files, SQLite databases
- Android: XML preferences, SQLite databases
Parser Architecture
Base Classes
Key Components
Identifies file types for parser selection (e.g.,
application/x-whatsapp-db)Provides access to searcher, item metadata, and configuration
Receives parsed content as HTML or structured events
Stores extracted properties using standardized property names
Handles recursive extraction of embedded artifacts
Metadata Extraction
IPED parsers extract standardized metadata properties:Communication Properties
P2P Properties
Browser Properties
Parser Configuration
Parsers support configuration through Tika annotations:HTML Report Generation
Most parsers generate HTML reports for visualization:HTML reports include CSS styling, hyperlinks to related items, and structured data presentation for analysis interface integration.
Item Linking
Parsers can link artifacts to case items using various strategies:Hash-Based Linking
Query-Based Linking
Parser Examples
See the following pages for detailed parser documentation:Chat Applications
WhatsApp, Telegram, Skype parsers
Web Browsers
Chrome, Firefox, Safari, Edge parsers
P2P Applications
BitTorrent, eMule, Shareaza, Ares parsers
Mobile Artifacts
UFDR and mobile data parsers
Performance Considerations
Optimization Features
- Deleted Record Recovery: Optional scanning of SQLite free pages
- Backup Merging: Combines multiple backup databases
- Batch Processing: Searches performed in batches to reduce overhead
- Parallel Processing: Multi-threaded parsing where applicable
Next Steps
Explore Chat Parsers
Learn about WhatsApp, Telegram, and Skype parsing
Browser Artifacts
Understand browser history and download extraction
P2P Forensics
Examine peer-to-peer application parsers