Voice-First AI Navigation

Build voice-first experiences where an AI agent can understand natural speech, navigate your app’s UI automatically, and execute frontend or backend functions in real time — all powered by seamless voice interaction, without clicks.

Get Started API Reference

Quick start

Get your voice-enabled application up and running in minutes

Install the packages

Install the backend and frontend packages for your platform.

npm install @navai/voice-backend @navai/voice-frontend

Set up your backend

Configure your Express server with NAVAI routes to handle client secrets and function execution.

import express from "express";
import { registerNavaiExpressRoutes } from "@navai/voice-backend";

const app = express();
app.use(express.json());

registerNavaiExpressRoutes(app);

app.listen(3000, () => {
  console.log("Server running on port 3000");
});

Set your OPENAI_API_KEY environment variable to enable the Realtime API integration.

Add voice to your React app

Use the useWebVoiceAgent hook to add voice interaction to your frontend.

import { useWebVoiceAgent } from "@navai/voice-frontend";

function App() {
  const { isConnected, connect, disconnect } = useWebVoiceAgent({
    backendUrl: "http://localhost:3000"
  });

  return (
    <div>
      <button onClick={isConnected ? disconnect : connect}>
        {isConnected ? "Disconnect" : "Start Voice"}
      </button>
    </div>
  );
}

Define voice-triggered functions

Create functions that can be executed via voice commands.

export const weatherFunction = {
  name: "get_weather",
  description: "Get the current weather for a location",
  parameters: {
    type: "object",
    properties: {
      location: { type: "string" }
    }
  },
  run: async ({ location }) => {
    return { temperature: 72, condition: "sunny" };
  }
};

Learn more in the function execution guide.

Explore by platform

Choose your development platform to get started

Web applications

Build voice-enabled React web apps with automatic UI navigation

Mobile apps

Add voice to React Native and Expo apps with WebRTC transport

Backend services

Set up secure backend routes for client secrets and function execution

Core features

Everything you need for voice-first experiences

Real-time voice interaction

UI navigation

Let users navigate your app with voice commands — no clicks required

Function execution

Execute frontend and backend functions dynamically via voice

Multi-platform support

Works with React, React Native, Expo, and any backend framework

Secure credentials

Ephemeral client secrets for secure OpenAI Realtime API access

Multilingual

Support multiple languages with customizable accents and tones

Ready to build voice-first?

Get started with the quickstart guide or explore the API reference to integrate NAVAI into your application.

View Quickstart Learn Core Concepts

Get Started

Core Concepts

Backend Integration

Frontend Integration

Mobile Integration

Guides

Voice-First AI Navigation

Quick start

Explore by platform

Web applications

Mobile apps

Backend services

Core features

Real-time voice interaction

UI navigation

Function execution

Multi-platform support

Secure credentials

Multilingual

Ready to build voice-first?

Build docs developers (and LLMs) love