Google Gemini Setup Tutorial

← Back to AI Providers

This tutorial will guide you through setting up Google Gemini's Live API for use with Sokuji. Gemini offers excellent multilingual support and automatic audio processing capabilities.

Good News: Gemini API offers generous free quotas and competitive pricing compared to other providers.

Step 1: Access Google AI Studio

Visit Google AI Studio and sign in with your Google account.

Step 2: Create an API Key

In Google AI Studio:

Click on "Get API key" in the left sidebar
Click "Create API key"
Select "Create API key in new project" (recommended) or choose an existing project
Your API key will be generated and displayed

Security Note: Copy and save your API key immediately. Store it securely and never share it publicly.

Step 3: Enable Required APIs

The API key should automatically have access to Gemini models. If you encounter issues, visit the Google Cloud Console and ensure the Gemini API is enabled for your project.

Step 4: Configure in Sokuji

Open Sokuji and navigate to the Settings panel:

Select "Gemini" as your AI provider
Paste your API key in the "Gemini API Key" field
Choose your preferred model (recommended: gemini-2.0-flash-live-001)
Select a voice (e.g., Aoede, Puck, or Charon)
Configure your source and target languages
Adjust temperature and token settings if needed
Click Save Settings

Step 5: Test Your Setup

Start a translation session to verify everything is working:

Click the Start Session button in Sokuji
Speak into your microphone
You should hear the translated audio output

Gemini-Specific Features

Automatic Processing

Gemini handles turn detection automatically, so you don't need to configure threshold or silence duration settings. The model intelligently determines when you've finished speaking.

Voice Personalities

Gemini offers 30 unique voice personalities with different characteristics. Experiment with different voices to find one that suits your use case:

Aoede: Clear and professional
Puck: Friendly and casual
Charon: Deep and authoritative
Kore: Warm and engaging

Troubleshooting

Common Issues

API Key Error: Ensure your API key is valid and has not expired. Check that the Gemini API is enabled in your Google Cloud project.

Quota Exceeded: Check your usage limits in Google AI Studio and consider upgrading if needed.

Language Not Supported: Verify that both your source and target languages are supported by Gemini. Some language pairs may have better quality than others.

Need More Help? Check the Gemini API documentation or visit our GitHub repository for support.