Available Examples
Transformers Research
12 foundational AI papers analyzed into 425 entities
FTX Collapse
9 Wikipedia articles about the cryptocurrency exchange collapse
Epstein/Maxwell Deposition
Court deposition with OCR-scanned PDF processing
What These Examples Show
Each example includes:- Complete source documents — PDFs or text files you can inspect
- Full pipeline outputs — Extractions, graph data, merge proposals, narratives
- Interactive graph viewer — Open
output/graph.htmlin any browser - Real cost data — Actual LLM API costs from running the pipeline
- Reproducible commands — Re-run from existing extractions or start fresh
Quick Start
All examples can be explored immediately without installation:Comparison Table
| Example | Documents | Final Entities | Relations | Domain | Total Cost |
|---|---|---|---|---|---|
| Transformers | 12 PDFs | 425 | 1,122 | Academic research | ~$0.72 |
| FTX | 9 text files | 373 | 1,184 | Finance/journalism | ~$0.28 |
| Epstein | 1 PDF (36 sections) | 190 | 387 | Legal/investigative | N/A |
Running Examples Yourself
Each example supports two workflows:Start from Existing Extractions (Free)
No LLM calls required — use the pre-extracted entities:Re-run Full Pipeline
Extract fresh entities from source documents:Next Steps
Install sift-kg
Get started with your own documents
Pipeline Guide
Understand the full extraction workflow