Skip to main content

Documentation Index

Fetch the complete documentation index at: https://docs.pipeshub.com/llms.txt

Use this file to discover all available pages before exploring further.

Gemini Speech to Text Configuration

Gemini STT Configuration Interface The Gemini STT configuration screen in PipesHub where you’ll enter your API Key and Model Name PipesHub integrates with Google’s Gemini models to enable speech-to-text transcription in AI workflows and agents. Gemini’s multimodal capabilities allow it to process audio input and produce accurate transcriptions across a wide range of languages.

Required Fields

API Key *

The API Key is required to authenticate your requests to Google’s AI services. How to obtain an API Key:
  1. Log in to Google AI Studio
  2. Click “Create API key”
  3. Select or create a Google Cloud project
  4. Copy the generated API key
Security Note: Your API key should be kept secure and never shared publicly. PipesHub securely stores your API key and uses it only for authenticating requests to Google.

Model Name *

The Model Name field defines which Gemini model you want to use for transcription. Available Gemini STT models:
  • gemini-2.5-flash — fast, cost-effective transcription with strong accuracy
  • gemini-2.5-pro — highest accuracy transcription, best for complex audio
  • gemini-3-flash-preview — next-generation Flash model preview
How to choose a model:
  • For everyday transcription tasks where speed and cost matter, use gemini-2.5-flash
  • For the most accurate transcriptions, especially with complex audio, use gemini-2.5-pro
  • Check Google’s Gemini documentation for the most up-to-date model options

Optional Fields

Model Friendly Name

A human-readable label for this configuration, shown in the PipesHub UI instead of the raw model name. Useful when you have multiple configurations of the same provider.

Configuration Steps

As shown in the image above:
  1. Click Configure on the Gemini provider card in the STT tab
  2. Enter your Google AI API Key in the designated field (marked with *)
  3. Enter the Model Name (marked with *), for example gemini-2.5-flash
  4. (Optional) Set a Model Friendly Name
  5. Click Add Model to save and validate your credentials
Both the API Key and Model Name are required fields to successfully configure Gemini STT integration.

Usage Considerations

  • Each transcription request counts against your Google AI API quota and billing
  • Different models have different pricing — check Google AI’s pricing page for details
  • Audio is sent to Google’s API servers for processing
  • Gemini models support a wide range of languages natively

Troubleshooting

  • If you encounter authentication errors, verify your API key is correct and has not expired
  • Ensure your Google account has billing set up if you are on a paid tier
  • Check that the model name is spelled exactly as shown
  • Verify your API key has access to audio transcription features
For additional support, refer to the Google Gemini documentation or contact PipesHub support.