Gemini Speech to Text Configuration

The Gemini STT configuration screen in PipesHub where you’ll enter your API Key and Model Name PipesHub integrates with Google’s Gemini models to enable speech-to-text transcription in AI workflows and agents. Gemini’s multimodal capabilities allow it to process audio input and produce accurate transcriptions across a wide range of languages.

Required Fields

API Key *

The API Key is required to authenticate your requests to Google’s AI services. How to obtain an API Key:

Log in to Google AI Studio
Click “Create API key”
Select or create a Google Cloud project
Copy the generated API key

Security Note: Your API key should be kept secure and never shared publicly. PipesHub securely stores your API key and uses it only for authenticating requests to Google.

Model Name *

The Model Name field defines which Gemini model you want to use for transcription. Available Gemini STT models:

gemini-2.5-flash — fast, cost-effective transcription with strong accuracy
gemini-2.5-pro — highest accuracy transcription, best for complex audio
gemini-3-flash-preview — next-generation Flash model preview

How to choose a model:

For everyday transcription tasks where speed and cost matter, use gemini-2.5-flash
For the most accurate transcriptions, especially with complex audio, use gemini-2.5-pro
Check Google’s Gemini documentation for the most up-to-date model options

Optional Fields

Model Friendly Name

A human-readable label for this configuration, shown in the PipesHub UI instead of the raw model name. Useful when you have multiple configurations of the same provider.

Configuration Steps

As shown in the image above:

Click Configure on the Gemini provider card in the STT tab
Enter your Google AI API Key in the designated field (marked with *)
Enter the Model Name (marked with *), for example gemini-2.5-flash
(Optional) Set a Model Friendly Name
Click Add Model to save and validate your credentials

Both the API Key and Model Name are required fields to successfully configure Gemini STT integration.

Usage Considerations

Each transcription request counts against your Google AI API quota and billing
Different models have different pricing — check Google AI’s pricing page for details
Audio is sent to Google’s API servers for processing
Gemini models support a wide range of languages natively

Troubleshooting

If you encounter authentication errors, verify your API key is correct and has not expired
Ensure your Google account has billing set up if you are on a paid tier
Check that the model name is spelled exactly as shown
Verify your API key has access to audio transcription features

For additional support, refer to the Google Gemini documentation or contact PipesHub support.

Welcome To PipesHub

System Overview

Authentication

Mail Configuration

AI Providers

Connectors

Integrations

Agents

Toolsets

User Management

Deployment

Developer

Additional Resources

Gemini

Gemini Speech to Text Configuration

Required Fields

API Key *

Model Name *

Optional Fields

Model Friendly Name

Configuration Steps

Usage Considerations

Troubleshooting

Welcome To PipesHub

System Overview

Authentication

Mail Configuration

AI Providers

Connectors

Integrations

Agents

Toolsets

User Management

Deployment

Developer

Additional Resources

Documentation Index

​Gemini Speech to Text Configuration

​Required Fields

​API Key *

​Model Name *

​Optional Fields

​Model Friendly Name

​Configuration Steps

​Usage Considerations

​Troubleshooting

Gemini Speech to Text Configuration

Required Fields

API Key *

Model Name *

Optional Fields

Model Friendly Name

Configuration Steps

Usage Considerations

Troubleshooting