This guide assumes you have already installed Syft Space. If not, install it first using Docker, Desktop App, or from source.
Step 1: Register your account
After installation, open Syft Space in your browser:Developer tokens are only needed for local deployments when publishing endpoints. See Installation for details.
Step 2: Add your data
Create a dataset to store your documents:Click Add Dataset
Fill in the dataset details:
- Name: my-docs
- Type: Local file (ChromaDB)
- Summary: My first dataset
Upload files
Upload PDF, TXT, or DOCX files. They’ll be automatically:
- Extracted and chunked
- Embedded with OpenAI
- Indexed in ChromaDB
Step 3: Connect a model
Configure an AI model to answer queries:Click Add Model
Fill in the model details:
- Name: gpt-4
- Type: OpenAI
- API Key: Your OpenAI API key
- Model: gpt-4 (or gpt-3.5-turbo for lower cost)
Step 4: Create an endpoint
Combine your dataset and model into a queryable endpoint:Click Add Endpoint
Fill in the endpoint details:
- Name: My Docs Assistant
- Slug: my-docs (must be unique)
- Dataset: my-docs
- Model: gpt-4
- Response Type: Both (summary + references)
Step 5: Test your endpoint
Query your endpoint using the built-in interface or API:- Web interface
- cURL
- Python
- Open your endpoint detail page
- Use the built-in query interface
- Ask a question: “What are the main topics in these documents?”
- View the AI-generated summary and source references
Step 6: Publish (optional)
Make your endpoint discoverable on SyftHub:Choose visibility
- Organization: Only accessible to your team
- Public: Discoverable on SyftHub marketplace
Confirm
Your endpoint is now live at syfthub.openmined.org
Publishing requires either a developer token (local) or public IP (cloud). Configure this in Settings → Network.
What’s next?
Core concepts
Understand how datasets, models, endpoints, and policies work
Creating datasets
Learn about dataset types and file ingestion
Building endpoints
Master endpoint configuration and testing
API reference
Explore the complete REST API
Example use case
Company knowledge base:- Dataset: Upload your internal documentation (wikis, guides, policies)
- Model: Connect GPT-4 for high-quality answers
- Endpoint: Create “company-kb” endpoint with summary + references
- Policies:
- Access: Only allow @company.com emails
- Rate limit: 100 requests/hour per user
- Publish: Share within organization