Generating Sounds with AI

I’d like to think I’m someone with good taste in sounds. I listen to a lot of music, I play a lot of games, I interact with a lot of apps. I know what sounds I like, and I absolutely know what sucks. The problem is I’m not a sound engineer. But luckily we’re in 2026 and AI is a thing, so recently I’ve been using Cursor to help me generate custom sounds for my projects. It’s actually been a charm and saves me having to source sounds online like the old days.

The Web Audio API

It starts with the Web Audio API. This is the backbone of all the sounds. It’s what lets you generate audio programmatically in the browser without needing audio files. It’s been available across browsers since April 2021 and works on everything modern. The docs for it are extremely dense and kind of a bore to read through unless you’re someone who understands sound completely (not me). So what I did, is funnel it into Cursor and started conversing with it.

Prompting Cursor

The first few prompts were actually quite good. I was able to get a click sound from scratch by just describing what I wanted. However, I wanted to build a library of sounds, so I needed to be able to describe the sounds I wanted and have Cursor be able to generate them from scratch. To do this, I like to understand what makes a sound good or bad. This is a process of trial and error, but it’s a process that can be automated with AI. Cursor told me that I needed to understand the math behind the sounds I wanted to create. Mainly that some sounds required things like “filtered noise” or “oscillators” or “envelopes”. I had no idea what “filtered noise” meant, so I asked. Basically, noise is just random values. By itself it sounds harsh and unnatural, but when used right is rather tasteful.

Filtering noise with a bandpass filter keeps only a specific range of frequencies, which is what makes it sound like a click. When you filter noise, you can make it sound like a click, a whoosh, a thud, depending on what frequencies you keep.

I asked Cursor to show me different filter types on the same noise. lowpass made it muffled. highpass made it thin and harsh. bandpass in the right range made it sound like a click. Once I had something that resembled a click, it was wrong in ways I could describe but not fix. I wanted a click sound that was short and percussive, but it was either too long or too short. It was either too harsh or too muffled. It was either too high or too low. Finally, when I asked it to “fade naturally,” it introduced me to envelopes. Envelopes are how sounds change volume over time. It explained that real sounds don’t just stop, they decay. And they decay exponentially, not linearly.

Basically you can split it into two parts: the attack and the decay. Turning up the attack makes the sound louder and faster, while turning up the decay makes the sound quieter and slower.

I spent some time going back and forth with Cursor, tweaking the attack and decay until I had a click sound I actually liked. Once I understood the basic loop, I started building out more sounds. I described what I wanted and listened to the results. I described what was wrong and repeated the process until I had a sound I liked.

The Sound Library

Here’s an example of what the implementation looks like. This is a complete sound library built using the Web Audio API:

let audioContext: AudioContext | null = null;

function getAudioContext(): AudioContext {
  if (!audioContext) {
    audioContext = new AudioContext();
  }
  if (audioContext.state === "suspended") {
    audioContext.resume();
  }
  return audioContext;
}

export const sounds = {
  click: () => {
    try {
      const ctx = getAudioContext();
      const t = ctx.currentTime;

      // Create noise buffer
      const noise = ctx.createBufferSource();
      const buf = ctx.createBuffer(1, ctx.sampleRate * 0.008, ctx.sampleRate);
      const data = buf.getChannelData(0);
      for (let i = 0; i < data.length; i++) {
        data[i] = (Math.random() * 2 - 1) * Math.exp(-i / 50);
      }
      noise.buffer = buf;

      // Apply bandpass filter
      const filter = ctx.createBiquadFilter();
      filter.type = "bandpass";
      filter.frequency.value = 4000 + Math.random() * 1000;
      filter.Q.value = 3;

      // Control volume
      const gain = ctx.createGain();
      gain.gain.value = 0.5 + Math.random() * 0.15;

      // Connect and play
      noise.connect(filter);
      filter.connect(gain);
      gain.connect(ctx.destination);
      noise.start(t);
    } catch {}
  },

  success: () => {
    try {
      const ctx = getAudioContext();
      const t = ctx.currentTime;

      // Three ascending notes
      const notes = [523.25, 659.25, 783.99];
      const spacing = 0.08;

      notes.forEach((freq, i) => {
        const osc = ctx.createOscillator();
        const osc2 = ctx.createOscillator();
        const gain = ctx.createGain();
        const filter = ctx.createBiquadFilter();

        osc.type = "triangle";
        osc.frequency.value = freq;
        osc2.type = "sine";
        osc2.frequency.value = freq * 2;

        filter.type = "lowpass";
        filter.frequency.value = 3000;

        const start = t + i * spacing;
        const duration = 0.15;

        // Envelope: attack and decay
        gain.gain.setValueAtTime(0, start);
        gain.gain.linearRampToValueAtTime(0.25, start + 0.01);
        gain.gain.exponentialRampToValueAtTime(0.001, start + duration);

        osc.connect(gain);
        osc2.connect(gain);
        gain.connect(filter);
        filter.connect(ctx.destination);

        osc.start(start);
        osc2.start(start);
        osc.stop(start + duration);
        osc2.stop(start + duration);
      });
    } catch {}
  },
};

Key Concepts

Filtered Noise: Random audio data passed through filters to create percussive sounds like clicks and ticks. Oscillators: Tone generators that create sine, triangle, sawtooth, and square waves for melodic sounds. Envelopes: Control how volume changes over time using attack (fade in) and decay (fade out) parameters. Filters: Shape the frequency content of sounds:

lowpass - Keeps low frequencies, removes highs (muffled)
highpass - Keeps high frequencies, removes lows (thin)
bandpass - Keeps a specific frequency range (focused)

The Web Audio API gives you complete control over sound generation. You can create click, pop, toggle, whoosh, success, error, and warning sounds all programmatically.

Looking Back

I went into this knowing nothing about audio engineering and came out with a library of sounds. Sounds I understand and can tweak instead of random files I downloaded. The process was collaborative. I provided taste and AI provided implementation. Neither of us could have done it alone. What I learned is that sometimes not knowing the technicalities is actually a good thing. Using Cursor as a translator helped me turn feelings into logic.

When trying to create something new, it’s easy to get stuck in your own head and think you need to know everything about the thing you’re trying to create. This is not the case anymore.

You can get a long way with just a few questions and a lot of listening. Try it out for yourself and see what you can create.

Get Started

Design Principles

Animation & Motion

Visual Design

Sound & Audio

The Web Audio API

Prompting Cursor

The Sound Library

Key Concepts

Looking Back

Build docs developers (and LLMs) love

Get Started

Design Principles

Animation & Motion

Visual Design

Sound & Audio

​The Web Audio API

​Prompting Cursor

​The Sound Library

​Key Concepts

​Looking Back

Build docs developers (and LLMs) love

The Web Audio API

Prompting Cursor

The Sound Library

Key Concepts

Looking Back