Skip to main content

Documentation Index

Fetch the complete documentation index at: https://docs.pipeshub.com/llms.txt

Use this file to discover all available pages before exploring further.

OpenAI Text to Speech Configuration

OpenAI TTS Configuration Interface The OpenAI TTS configuration screen in PipesHub where you’ll enter your API Key, Model Name, Voice, and Audio Format PipesHub allows you to integrate with OpenAI’s text-to-speech models to enable voice output in AI workflows and agents. Once configured, PipesHub can convert AI-generated text responses into natural-sounding spoken audio using OpenAI’s TTS models.

Required Fields

API Key *

The API Key is required to authenticate your requests to OpenAI’s TTS services. How to obtain an API Key:
  1. Log in to your OpenAI account
  2. Navigate to the API Keys section
  3. Create a new secret key
  4. Copy the key immediately (it will only be shown once)
Security Note: Your API key should be kept secure and never shared publicly. PipesHub securely stores your API key and uses it only for authenticating requests to OpenAI.

Model Name *

The Model Name field defines which OpenAI TTS model you want to use with PipesHub. Available OpenAI TTS models:
  • tts-1 - Standard quality TTS model optimised for real-time, low-latency applications
  • tts-1-hd - Higher quality TTS model optimised for audio fidelity
  • gpt-4o-mini-tts - GPT-4o Mini powered TTS with natural, expressive speech
How to choose a model:
  • For real-time voice output where speed matters, select tts-1
  • For higher audio quality, select tts-1-hd
  • For the most natural and expressive speech, select gpt-4o-mini-tts
  • Check OpenAI’s TTS documentation for the most up-to-date options

Optional Fields

Voice

Select the default voice used for audio output. You can change this at any time by editing the configuration. Available voices (default: Alloy):
  • Alloy — balanced, neutral tone
  • Echo — warm, conversational tone
  • Fable — expressive, storytelling tone
  • Onyx — deep, authoritative tone
  • Nova — energetic, upbeat tone
  • Shimmer — clear, friendly tone

Audio Format

Select the output audio format for generated speech (default: MP3). Available formats:
  • MP3 — widely compatible, good compression (default)
  • Opus — optimised for internet streaming, low latency
  • AAC — good quality with broad device compatibility
  • FLAC — lossless, highest quality, larger file size
  • WAV — uncompressed, broad compatibility

Configuration Steps

As shown in the image above:
  1. Click Configure on the OpenAI provider card
  2. Enter your OpenAI API Key in the designated field (marked with *)
  3. Specify your desired Model Name (marked with *)
  4. (Optional) Select your preferred Voice from the dropdown (default: Alloy)
  5. (Optional) Select your preferred Audio Format (default: MP3)
  6. (Optional) Set a Model Friendly Name — a human-readable label shown in the UI
  7. Click Add Model to save and validate your credentials
Both the API Key and Model Name are required fields to successfully configure OpenAI TTS integration.

Usage Considerations

  • Each TTS request will count against your OpenAI account’s quota and billing
  • Different models have different pricing — check OpenAI’s pricing page for details
  • Higher quality models (tts-1-hd) cost more per character than standard models
  • Longer texts require more tokens and cost proportionally more

Troubleshooting

  • If you encounter authentication errors, verify your API key is correct and has not expired
  • Ensure your OpenAI account has billing set up
  • Check that the model name is spelled correctly (e.g. tts-1, not TTS-1)
  • Verify your account has access to the TTS API
For additional support, refer to the OpenAI TTS documentation or contact PipesHub support.