Transcribe audio to text using OpenAI-compatible format
/v1/audio/transcriptions endpoint provides OpenAI-compatible audio transcription. It accepts multipart audio file uploads and returns transcribed text.
This endpoint:
gpt-4o-transcribe only)Bearer YOUR_API_KEYmultipart/form-data encoding.
"gpt-4o-transcribe".Any other value will return a 400 error with:400 Bad Request
Missing File:
400 Bad Request
Model Access Denied:
403 Forbidden
Rate Limit Exceeded:
429 Too Many Requests
Upstream Error:
502 Bad Gateway
No Accounts Available:
503 Service Unavailable
gpt-4o-transcribe.
This is enforced for OpenAI API compatibility. If you need to use a different transcription model, contact your administrator.
allowed_models configured, it must include gpt-4o-transcribe to use this endpoint.
API Key Configuration:
403 Forbidden and model_not_allowed error.
Valid Configuration:
/v1/models (note: gpt-4o-transcribe may not appear in the models list but is still accessible if allowed).
gpt-4o-transcribe.
Rate limit headers:
/v1/audio/transcriptions format with these specifics:
Similarities:
file and model parametersprompt parametertext fieldgpt-4o-transcribe is supported (OpenAI supports whisper-1 and variants)/backend-api/transcribe (internal format, no model parameter required)/v1/chat/completions (text generation with chat format)/v1/responses (text generation with responses format)