API Reference

Complete reference for all Verbatik REST API endpoints. All endpoints require authentication via API key.

Base URL

https://api.verbatik.com

Authentication

Authorization: Bearer vbt_your_api_key

Endpoints Overview

Method	Endpoint	Description
`GET`	`/api/v1/voices`	List pre-trained voices
`POST`	`/api/v1/tts`	Text-to-speech synthesis
`POST`	`/api/v1/voice-training`	Clone a voice from audio
`POST`	`/api/v1/voice-design`	Design a voice from description
`POST`	`/api/v1/voice-cloning`	Generate speech with a cloned voice
`GET`	`/api/v1/my-voices`	List your cloned/designed voices
`POST`	`/api/v1/text-to-music`	Generate music from text
`POST`	`/api/audio-upload`	Upload an audio file

GET /api/v1/voices

List available pre-trained TTS voices.

Query Parameters:

Parameter	Type	Description
`language`	string	Filter by language code (e.g., `en-US`).
`gender`	string	`Male`, `Female`, `Neutral`.
`search`	string	Search by name or language.

Response:

[
  {
    "id": "jenny-en-us",
    "name": "Jenny",
    "gender": "Female",
    "language_code": "en-US",
    "language_name": "English (United States)",
    "is_neural": true,
    "sample_url": "https://...",
    "styles": ["cheerful", "sad"]
  }
]

POST /api/v1/tts

Convert text to speech using pre-trained voices.

Header	Required	Description
`Content-Type`	Yes	`text/plain` or `application/ssml+xml`
`X-Voice-ID`	No	Voice slug. Default: `jenny-en-us`.
`X-Store-Audio`	No	`true` for URL response instead of binary.

Body: Plain text or SSML (max 25,000 characters). Cost: $0.025 per 1,000 characters.

POST /api/v1/voice-training

Clone a voice from an audio sample.

{
  "audio_url": "https://example.com/sample.mp3",
  "name": "My Voice",
  "noise_reduction": false,
  "volume_normalization": false,
  "accuracy": 0.8,
  "preview_text": "Hello, this is a preview."
}

Cost: $3.00 per voice.

Response:

{
  "success": true,
  "voice_id": "uuid",
  "name": "My Voice",
  "preview_url": "https://...",
  "cost_cents": 300,
  "balance_cents": 1700
}

POST /api/v1/voice-design

Create a voice from a text description.

{
  "prompt": "A warm, friendly female voice...",
  "name": "Friendly Voice",
  "preview_text": "Hello, this is a preview."
}

Cost: $3.00 per voice.

POST /api/v1/voice-cloning

Generate speech using a cloned or designed voice.

Header	Required	Description
`Content-Type`	Yes	`text/plain`
`X-Voice-ID`	Yes	Cloned voice UUID.
`X-Store-Audio`	No	`true` to store audio.
`X-Speed`	No	0.5–2.0 (default: 1).
`X-Volume`	No	0–10 (default: 1).
`X-Pitch`	No	-12 to 12 (default: 0).
`X-Emotion`	No	happy, sad, angry, fearful, disgusted, surprised, neutral.
`X-English-Normalization`	No	true/false.
`X-Voice-Modify-Pitch`	No	-100 to 100.
`X-Voice-Modify-Intensity`	No	-100 to 100.
`X-Voice-Modify-Timbre`	No	-100 to 100.
`X-Sample-Rate`	No	8000, 16000, 22050, 24000, 32000, 44100.
`X-Bitrate`	No	32000, 64000, 128000, 256000.
`X-Format`	No	mp3, pcm, flac.
`X-Language-Boost`	No	Language code for enhanced recognition.

Body: Plain text (max 5,000 characters). Cost: $0.08 per 1,000 characters.

GET /api/v1/my-voices

List all cloned and designed voices in your workspace.

Parameter	Type	Description
`status`	string	Filter: `pending`, `ready`, `failed`.

[
  {
    "id": "uuid",
    "name": "My Voice",
    "status": "ready",
    "preview_url": "https://...",
    "source_audio_url": "https://...",
    "created_at": "2025-01-15T10:30:00.000Z",
    "last_used_at": "2025-01-20T14:00:00.000Z"
  }
]

POST /api/v1/text-to-music

Generate music from text prompts.

{
  "prompt": "An upbeat electronic track",
  "tags": ["electronic", "upbeat"],
  "lyrics": "Feel the rhythm...",
  "num_songs": 1,
  "output_format": "mp3",
  "store_audio": true,
  "name": "My Track"
}

Cost: $0.20 per minute of audio.

POST /api/audio-upload

Upload an audio file for use with voice cloning.

Content-Type: multipart/form-data

Returns a URL for use with the voice-training endpoint.

Common Error Responses

Status	Description
`400`	Bad request — invalid parameters or missing required fields.
`401`	Unauthorized — invalid or missing API key.
`402`	Payment required — insufficient balance.
`403`	Forbidden — no access to this resource.
`404`	Not found — resource does not exist.
`429`	Rate limit exceeded — too many requests.
`500`	Internal server error — unexpected failure.

All errors follow this format:

{
  "success": false,
  "error": "Description of the error"
}

CORS

All endpoints support CORS:

Access-Control-Allow-Origin: *
Access-Control-Allow-Methods: GET, POST, OPTIONS
Access-Control-Allow-Headers: Content-Type, Authorization
Access-Control-Max-Age: 86400

API Reference

On this page