VeenaMax TTS

VeenaMAX transforms text into expressive, real-time speech across multiple Indian languages for seamless communication.


API

If you're looking for an API, you can choose from your desired programming language.

POST
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 import requests import base64 # Use this function to convert an image file from the filesystem to base64 def image_file_to_base64(image_path): with open(image_path, 'rb') as f: image_data = f.read() return base64.b64encode(image_data).decode('utf-8') # Use this function to fetch an image from a URL and convert it to base64 def image_url_to_base64(image_url): response = requests.get(image_url) image_data = response.content return base64.b64encode(image_data).decode('utf-8') # Use this function to convert a list of image URLs to base64 def image_urls_to_base64(image_urls): return [image_url_to_base64(url) for url in image_urls] api_key = "YOUR_API_KEY" url = "https://api.segmind.com/v1/veena-max-tts" # Request payload data = { "text": "Segmind lagao, model chalao, itna tez ki result aane se pehle chai bhi tthandi na ho.", "speaker_id": "vinaya_assist", "normalize": True } headers = {'x-api-key': api_key} response = requests.post(url, json=data, headers=headers) print(response.content) # The response is the generated image
RESPONSE
audio/mp3
HTTP Response Codes
200 - OKImage Generated
401 - UnauthorizedUser authentication failed
404 - Not FoundThe requested URL does not exist
405 - Method Not AllowedThe requested HTTP method is not allowed
406 - Not AcceptableNot enough credits
500 - Server ErrorServer had some issue with processing

Attributes


textstr *

Provide the text to convert into speech. Use greetings or instructions, like 'Welcome to VeenaMAX, your TTS solution.'


speaker_idenum:str *

Choose a voice for your text. For a calm tone, select 'soumya_calm'; for impact, select 'agastya_impact'.

Allowed values:


normalizebool ( default: true )

Enable text normalization for better pronunciation. Use this for complex texts or mixed languages.

To keep track of your credit usage, you can inspect the response headers of each API call. The x-remaining-credits property will indicate the number of remaining credits in your account. Ensure you monitor this value to avoid any disruptions in your API usage.

Resources to get you started

Everything you need to know to get the most out of VeenaMax TTS

VeenaMAX: Effective Usage Guide

VeenaMAX is Maya Research’s cutting-edge Text-to-Speech (TTS) solution, purpose-built for Indian languages and Hinglish content. With eight unique voice personalities, real-time streaming, automatic script detection, and domain-specific terminology support, VeenaMAX delivers natural, expressive, studio-quality audio for diverse applications—from IVR systems and customer support to e-learning and healthcare.

1. Quick Setup

  1. •Sign up for the Maya Research TTS API.
  2. •Retrieve your API key and endpoint information.
  3. •Choose between streaming and non-streaming modes:
    • •Streaming: Real-time audio for chatbots, voice assistants, and live translations.
    • •Non-Streaming: Batch generation of full audio files for downloadable content.

2. Core Parameters

Use the following JSON payload for every request:

{
  "text": "Your content here.",
  "speaker_id": "vinaya_assist",
  "normalize": true
}
  • •text (string, required): Input text to convert.
  • •speaker_id (enum, required): Select one of eight voices.
  • •normalize (bool, optional): Enable for better pronunciation of numbers, acronyms, and mixed languages.

3. Voice Selection by Use Case

– IVR & Customer Support
• speaker_id: vinaya_assist (neutral, helpful)
• normalize: true
Ideal for troubleshooting prompts, account inquiries, and telephony menus.

– E-Learning & Educational Content
• speaker_id: soumya_calm (steady, clear)
• normalize: true
Ensures attention retention and accurate reading of technical terms.

– Marketing & Announcements
• speaker_id: agastya_impact (dynamic, engaging)
• normalize: false (retain branded stylizations)
Delivers high-energy calls to action and promotional scripts.

– Conversational Chatbots & Accessibility
• speaker_id: charu_soft or mohini_whispers (soothing, gentle)
• normalize: true
Creates warm, empathetic interactions for visually impaired users.

4. Best Practices & Tips

  • •Punctuation & Pauses: Use commas, periods, and ellipses to introduce natural breaks.
  • •Phonetic Spellings: Spell uncommon names or jargon phonetically to improve accuracy.
  • •Language Mixing: Rely on VeenaMAX’s auto script detection—no need for manual tagging.
  • •Terminology Handling: Enable normalize for domain-heavy content (finance, healthcare).

5. Optimizing Output Quality

  • •Streaming Latency: Reduce buffer size in your audio player for faster playback.
  • •Volume Normalization: Post-process with a limiter for consistent loudness in multi-segment audio.
  • •Batch Generation: Use non-streaming mode for producing large libraries of pre-recorded voiceovers.

By following this guide and selecting the right speaker_id plus the normalize flag, you can harness VeenaMAX’s full potential—transforming text into rich, context-aware speech that resonates with Indian language audiences.

Other Popular Models

Discover other models you might be interested in.

Cookie settings

We use cookies to enhance your browsing experience, analyze site traffic, and personalize content. By clicking "Accept all", you consent to our use of cookies.