← Back to AI Providers
Sokuji Logo

Google Gemini Setup Tutorial

This tutorial will guide you through setting up Google Gemini's Live API for use with Sokuji. Gemini offers excellent multilingual support and automatic audio processing capabilities.

Good News: Gemini API offers generous free quotas and competitive pricing compared to other providers.

Step 1: Access Google AI Studio

Visit Google AI Studio and sign in with your Google account.

Step 2: Create an API Key

In Google AI Studio:

  1. Click on "Get API key" in the left sidebar
  2. Click "Create API key"
  3. Select "Create API key in new project" (recommended) or choose an existing project
  4. Your API key will be generated and displayed
Security Note: Copy and save your API key immediately. Store it securely and never share it publicly.

Step 3: Enable Required APIs

The API key should automatically have access to Gemini models. If you encounter issues, visit the Google Cloud Console and ensure the Gemini API is enabled for your project.

Step 4: Configure in Sokuji

Open Sokuji and navigate to the Settings panel:

  1. Select "Gemini" as your AI provider
  2. Paste your API key in the "Gemini API Key" field
  3. Choose your preferred model (recommended: gemini-2.0-flash-live-001)
  4. Select a voice (e.g., Aoede, Puck, or Charon)
  5. Configure your source and target languages
  6. Adjust temperature and token settings if needed
  7. Click Save Settings

Step 5: Test Your Setup

Start a translation session to verify everything is working:

  1. Click the Start Session button in Sokuji
  2. Speak into your microphone
  3. You should hear the translated audio output

Gemini-Specific Features

Automatic Processing

Gemini handles turn detection automatically, so you don't need to configure threshold or silence duration settings. The model intelligently determines when you've finished speaking.

Voice Personalities

Gemini offers 30 unique voice personalities with different characteristics. Experiment with different voices to find one that suits your use case:

Troubleshooting

Common Issues

API Key Error: Ensure your API key is valid and has not expired. Check that the Gemini API is enabled in your Google Cloud project.

Quota Exceeded: Check your usage limits in Google AI Studio and consider upgrading if needed.

Language Not Supported: Verify that both your source and target languages are supported by Gemini. Some language pairs may have better quality than others.

Need More Help? Check the Gemini API documentation or visit our GitHub repository for support.