Quickstart

This guide walks you through setting up SvaraAI locally and starting your first emotion-aware therapy conversation.

Prerequisites

Before you begin, ensure you have the following installed:

Node.js (v18 or higher)
npm or yarn
Git
A Hume AI account with API credentials

Get your API credentials

Visit Hume AI and create an account if you don’t have one.

Generate API credentials

Navigate to your Hume AI dashboard and generate:

API Key (for voice interface authentication)
Secret Key (for backend OAuth2 token generation)
Config ID (for your custom EVI configuration)

Save your credentials

Keep these values handy - you’ll need them in the next section.

Clone and install

Clone the repository

Clone the SvaraAI repository to your local machine:

git clone https://github.com/whysooharsh/svaraAI.git
cd svaraAI

Install backend dependencies

Navigate to the backend directory and install dependencies:

cd Backend
npm install

The backend uses Express.js and includes the following key dependencies:

express - Web server framework
cors - Cross-origin resource sharing
dotenv - Environment variable management

Install frontend dependencies

In a new terminal, navigate to the frontend directory and install dependencies:

cd Frontend
npm install

The frontend uses React with Vite and includes:

@humeai/voice-react - Hume AI voice SDK
react and react-dom (v19)
framer-motion - Animation library
tailwindcss - Styling framework

Configure environment variables

SvaraAI requires environment variables for both the frontend and backend.

# Hume AI Credentials
VITE_HUME_API_KEY=your_hume_api_key_here
VITE_HUME_CONFIG_ID=your_hume_config_id_here

Never commit your .env files to version control. The API keys provide full access to your Hume AI account.

Environment variable breakdown

Variable	Location	Purpose
`VITE_HUME_API_KEY`	Frontend & Backend	Authenticates voice interface connections
`VITE_HUME_CONFIG_ID`	Frontend	Specifies your custom EVI configuration
`HUME_SECRET_KEY`	Backend	Generates OAuth2 access tokens
`GEMINI_API_KEY`	Backend	Authenticates with Google Gemini API
`GEMINI_PROMPT`	Backend	Template for therapeutic response generation
`PORT`	Backend	Backend server port (defaults to 5000)

Start the development servers

SvaraAI runs two separate servers: a backend API and a frontend React application.

cd Backend
npm run dev

You should see output similar to:

Server running on PORT 5000

The frontend development server (Vite) automatically proxies /api requests to the backend server at http://localhost:5000. This is configured in vite.config.ts.

Start your first conversation

Open the application

Navigate to http://localhost:5173/playground in your browser.

Grant microphone permissions

When prompted, allow the application to access your microphone. This is required for voice-based therapy sessions.

Connect to the voice interface

Click the connect button to establish a connection with Hume AI’s Empathic Voice Interface (EVI). The application will:

Authenticate using your API key
Load your custom configuration
Initialize the emotion detection models

Frontend/src/components/startCall.tsx

const connectOptions: ConnectOptions = {
  auth: { type: "apiKey", value: apiKey },
  configId: configId,
};
await connect(connectOptions);

Start speaking

Once connected, simply start speaking. SvaraAI will:

Detect emotions in your voice using prosody analysis
Analyze language patterns for sentiment
Provide empathetic, context-aware responses
Track conversation insights for later review

How it works

SvaraAI uses Hume AI’s Empathic Voice Interface to create emotion-aware conversations:

Frontend/src/components/chat.tsx

import { VoiceProvider } from "@humeai/voice-react";
import Messages from "./message";
import Controls from "./controls";
import StartCall from "./startCall";

export default function ChatInterface() {
  const apiKey = import.meta.env.VITE_HUME_API_KEY || "";
  const configId = import.meta.env.VITE_HUME_CONFIG_ID || "";

  return (
    <VoiceProvider>
      <Messages />
      <Controls />
      <StartCall apiKey={apiKey} configId={configId} />
    </VoiceProvider>
  );
}

Architecture overview

Frontend - React application with Hume AI voice SDK
- Real-time voice communication
- Emotion visualization
- Conversation history
Backend - Express.js API server
- OAuth2 token generation for secure API access
- Audio analysis endpoints
- Conversation data persistence
Hume AI EVI - Emotion detection and response generation
- Prosody analysis (voice tone, pitch, rhythm)
- Language sentiment analysis
- Context-aware empathetic responses

Available routes

SvaraAI provides several routes for different functionalities:

Route	Description
`/`	Landing page with product information
`/playground`	Interactive voice therapy interface
`/insights`	View conversation analytics and emotion insights

Next steps

Voice emotion detection

Learn how SvaraAI detects emotions from voice using Hume AI.

Conversation insights

Learn how to interpret emotion data and conversation analytics.

API reference

Explore backend endpoints for audio analysis and conversation management.

Architecture overview

Understand how SvaraAI’s components work together.

Troubleshooting

Missing API credentials error

If you see “Missing API credentials. Check your .env file”, ensure:

Your .env file exists in the correct directory
Variable names match exactly (including VITE_ prefix)
No quotes around the values
You’ve restarted the development server after adding variables

Unable to connect to voice interface

Connection failures are usually caused by:

Microphone permissions denied - Check browser settings
Invalid API key - Verify credentials in Hume AI dashboard
Backend server not running - Ensure Express server is running on port 5000
Network issues - Check your internet connection

Backend server fails to start

If the backend fails with port conflicts:

Server failed to start: Error: listen EADDRINUSE: address already in use :::5000

Either kill the process using port 5000 or change the PORT variable in your .env file.

Hume API authentication fails

The backend generates OAuth2 tokens using this flow:

Backend/utils/humeClient.ts

export const getHumeAccessToken = async (): Promise<string> => {
  const apiKey = process.env.VITE_HUME_API_KEY;
  const secretKey = process.env.HUME_SECRET_KEY;
  
  const authString = `${apiKey}:${secretKey}`;
  const encoded = Buffer.from(authString).toString('base64');
  
  const res = await fetch('https://api.hume.ai/oauth2-cc/token', {
    method: 'POST',
    headers: {
      'Authorization': `Basic ${encoded}`,
    },
    body: new URLSearchParams({ 
      grant_type: 'client_credentials' 
    }).toString(),
  });
  
  return res.json().access_token;
};

If authentication fails, verify both your API key and secret key are correct.

Get help

If you encounter issues not covered here:

Get Started

Core Features

Architecture

Integrations

Prerequisites

Get your API credentials

Clone and install

Configure environment variables

Environment variable breakdown

Start the development servers

Start your first conversation

How it works

Architecture overview

Available routes

Next steps

Voice emotion detection

Conversation insights

API reference

Architecture overview

Troubleshooting

Get help

Build docs developers (and LLMs) love

Get Started

Core Features

Architecture

Integrations

​Prerequisites

​Get your API credentials

​Clone and install

​Configure environment variables

​Environment variable breakdown

​Start the development servers

​Start your first conversation

​How it works

​Architecture overview

​Available routes

​Next steps

Voice emotion detection

Conversation insights

API reference

Architecture overview

​Troubleshooting

​Get help

Build docs developers (and LLMs) love

Prerequisites

Get your API credentials

Clone and install

Configure environment variables

Environment variable breakdown

Start the development servers

Start your first conversation

How it works

Architecture overview

Available routes

Next steps

Troubleshooting

Get help