Skip to content
MiniMax logo

MiniMax Speech 2.8 Turbo

Text-to-SpeechMiniMaxProxied

MiniMax Speech 2.8 Turbo turns text into natural, expressive speech with voice cloning, emotion control, and 40+ language support at faster speeds.

Model Info
Terms and Licenselink
More informationlink
PricingView pricing in the Cloudflare dashboard

Usage

TypeScript
const response = await env.AI.run(
'minimax/speech-2.8-turbo',
{
text: 'Hello! Welcome to Cloudflare AI Gateway. Let me show you what we can do.',
voice_id: 'English_expressive_narrator',
speed: 1,
volume: 1,
pitch: 0,
format: 'mp3',
},
{
gateway: { id: 'default' },
}
)
console.log(response)

Examples

Fast Narration — Speed up narration for quick playback
TypeScript
const response = await env.AI.run(
'minimax/speech-2.8-turbo',
{
text: 'This is a fast-paced summary of the key findings from the quarterly report. Revenue is up fifteen percent and user growth exceeded expectations.',
voice_id: 'English_expressive_narrator',
speed: 1.5,
volume: 1,
pitch: 0,
format: 'mp3',
},
{
gateway: { id: 'default' },
}
)
console.log(response)
Calm Tone — Calm and steady speech for meditation or relaxation
TypeScript
const response = await env.AI.run(
'minimax/speech-2.8-turbo',
{
text: 'Take a deep breath in. Hold it for a moment. Now slowly exhale. Let your shoulders relax and release any tension.',
voice_id: 'English_expressive_narrator',
speed: 0.8,
volume: 1,
pitch: 0,
emotion: 'calm',
format: 'mp3',
},
{
gateway: { id: 'default' },
}
)
console.log(response)
Adjusted Pitch — Lower the pitch for a deeper voice
TypeScript
const response = await env.AI.run(
'minimax/speech-2.8-turbo',
{
text: 'Good evening. Tonight we explore the mysteries of the deep ocean and the creatures that live in total darkness.',
voice_id: 'English_expressive_narrator',
speed: 1,
volume: 1,
pitch: -6,
format: 'mp3',
},
{
gateway: { id: 'default' },
}
)
console.log(response)

Parameters

text
stringrequiredmaxLength: 10000The text to convert to speech. Maximum 10,000 characters.
voice_id
stringrequireddefault: English_expressive_narratorThe voice ID to use for synthesis
speed
numberrequireddefault: 1minimum: 0.5maximum: 2Speech speed (0.5 to 2)
volume
numberrequireddefault: 1minimum: 0maximum: 10Speech volume (0 to 10)
pitch
integerrequireddefault: 0minimum: -12maximum: 12Pitch adjustment (-12 to 12)
emotion
stringenum: happy, sad, angry, fearful, disgusted, surprised, calm, fluentEmotion control for synthesized speech
format
stringrequireddefault: mp3enum: mp3, flac, wavOutput audio format

API Schemas (Raw)

Input
Output