Quick start
Set up Quark and chat with your first document in minutes.
Architecture
Understand the ingestion pipeline, memory system, and vector search.
CLI usage
Learn all CLI commands for ingesting documents and querying them locally.
API reference
Integrate Quark into your application via the REST API.
How it works
Ingest your documents
Point Quark at a PDF. It extracts text and images using Unstructured.io and pdfplumber, then embeds everything into a Qdrant vector database.
Ask a question
Type your question in the CLI or send it to the
/chat/completions endpoint. Quark retrieves the most relevant chunks and re-ranks them for precision.Key features
Multimodal ingestion
Processes text, tables, and images from PDFs. Visual elements are described by a vision LLM and made searchable.
Dual-layer memory
Redis handles short-term session context. Mem0 stores long-term user preferences and history across sessions.
Re-ranked retrieval
VoyageAI embeddings combined with a re-ranking pass deliver highly relevant context to the LLM.
Grounded responses
The LLM is instructed to use only your document content and cite every source — no hallucinations.
CLI interface
A full-featured terminal UI with session management, ingest tracking, and chat history — no browser needed.
REST API
An Elysia-powered HTTP server exposes ingestion, retrieval, and session management endpoints for programmatic access.