Search Endpoint

The Search API enables semantic search across all user memories using vector embeddings. Query text is embedded and matched against stored memory vectors to find the most similar memories ranked by relevance.

Search Memories

curl -X POST https://api.azen.sh/api/v1/memory/search \
  -H "azen-api-key: YOUR_API_KEY" \
  -H "Content-Type: application/json" \
  -d '{
    "query": "outdoor activities",
    "topK": 5
  }'

Performs semantic vector search across all user memories using the provided query text. Returns the most similar memories ranked by similarity score.

Request Body

query

string

required

The search query text to find similar memories. Must be at least 1 character long.Example: "outdoor activities"

topK

number

default:"5"

Maximum number of top results to returnMinimum: 1Maximum: 50Default: 5

Response

status

string

required

Response status indicatorValue: "success"

memories

array

required

Array of matching memory objects ordered by relevance (highest similarity first)

Show Memory Object

string

required

Unique identifier for the memory (UUID format)Example: "550e8400-e29b-41d4-a716-446655440000"

content

string

required

The decrypted text content of the memoryExample: "I love hiking in the mountains"

metadata

object | null

required

Optional metadata associated with the memoryExample: null

createdAt

string

required

ISO 8601 timestamp when the memory was createdExample: "2024-01-15T10:30:00.000Z"

embedded

boolean

required

Indicates whether the memory has been successfully embedded for vector searchExample: true

rawMatches

array

required

Raw similarity scores and IDs from vector search

Show RawMatch Object

string

required

Memory identifier (may include chunk identifier separated by ::)Example: "550e8400-e29b-41d4-a716-446655440000::0"

score

number

required

Similarity score between query and memory (higher is better)Example: 0.87

values

array

required

Vector values (typically empty array in responses)Example: []

{
  "status": "success",
  "memories": [
    {
      "id": "550e8400-e29b-41d4-a716-446655440000",
      "content": "I love hiking in the mountains",
      "metadata": null,
      "createdAt": "2024-01-15T10:30:00.000Z",
      "embedded": true
    },
    {
      "id": "550e8400-e29b-41d4-a716-446655440001",
      "content": "Rock climbing is my favorite weekend activity",
      "metadata": null,
      "createdAt": "2024-01-14T15:20:00.000Z",
      "embedded": true
    }
  ],
  "rawMatches": [
    {
      "id": "550e8400-e29b-41d4-a716-446655440000::0",
      "score": 1.00110459,
      "values": []
    },
    {
      "id": "550e8400-e29b-41d4-a716-446655440001::0",
      "score": 0.82,
      "values": []
    }
  ]
}

How It Works

Query Embedding: Your search query is converted into a vector embedding using the same model used for memory embeddings
Vector Search: The query vector is compared against all memory vectors in your organization’s namespace
Ranking: Results are ranked by cosine similarity score (higher scores indicate better matches)
Decryption: Matched memories are decrypted and returned with their similarity scores

Best Practices

Query Length: Queries don’t need to be exact matches. Natural language queries work well (e.g., “outdoor activities” will match “hiking”, “camping”, etc.)
Top K Selection: Start with the default topK: 5 and adjust based on your use case. Higher values return more results but may include less relevant matches
Score Interpretation: Scores typically range from 0 to 1+. Scores above 0.7 generally indicate good semantic similarity
Empty Results: If no memories are returned, it means either:
- No memories have been created yet
- Memories haven’t finished embedding yet (check embedded: false on memories)
- The query doesn’t semantically match any stored memories

Overview

Endpoints

Search Endpoint

Search Memories

Request Body

Response

How It Works

Best Practices

Build docs developers (and LLMs) love

Overview

Endpoints

​Search Memories

​Request Body

​Response

​How It Works

​Best Practices

Build docs developers (and LLMs) love

Search Memories

Request Body

Response

How It Works

Best Practices