Overview
Gemini Assistant integrates Google’s Gemini AI models into Automagik Tools, providing vision capabilities, multimodal understanding, and blazing-fast inference with generous free tier access. Think of Gemini Assistant as your specialized AI for vision, large context, and rapid prototyping tasks.Key Features
Vision & Multimodal
Analyze images, videos, and documents
Blazing Fast
Fastest inference with Gemini 2.0 Flash
Large Context
2M token context window with 1.5 Pro
Free Tier
Generous free API access
Code Generation
Strong coding capabilities
Multimodal Input
Text, image, video, audio support
Available Models
Gemini 2.0 Flash (Experimental)
Best for: Fast tasks, rapid iteration, experimentationGemini 1.5 Pro
Best for: Complex reasoning, massive contextGemini 1.5 Flash
Best for: Balanced speed and qualityUse Cases
1. Image Analysis
2. Code Review with Context
3. Video Analysis
4. Rapid Prototyping
Quick Start
Installation
Get API Key
- Visit Google AI Studio
- Create API key
- Configure Gemini Assistant
Configuration
Create~/.automagik/gemini.json:
Environment Variables
Text Generation
Simple Generation
Code Generation
Conversational
Vision Capabilities
Image Analysis
OCR and Text Extraction
UI Analysis
Multimodal Features
Video Analysis
Audio Processing
Document Understanding
Large Context Processing
Entire Codebase Analysis
Document Processing
Integration Patterns
Pattern 1: Vision-Based Testing
Pattern 2: Code Review Assistant
Pattern 3: Documentation Generator
Advanced Features
Function Calling
Structured Output
Streaming Responses
Model Comparison
Speed vs Quality
Context Window vs Cost
Cost Management
Free Tier Limits
Monitor Usage
Best Practices
Use Flash for Speed
Gemini 2.0 Flash is perfect for rapid iteration
Pro for Context
Use 1.5 Pro when you need massive context
Vision for UI
Leverage vision for UI/UX analysis and testing
Monitor Free Tier
Track usage to stay within free limits
Troubleshooting
API key not working
API key not working
Solutions:
- Verify API key is correct
- Check key is enabled in Google AI Studio
- Ensure billing is set up (for paid tier)
- Check API quotas
Rate limit errors
Rate limit errors
Solutions:
- Stay within free tier limits
- Implement retry with backoff
- Upgrade to paid tier
- Distribute requests over time
Vision not working
Vision not working
Solutions:
- Check image format (PNG, JPEG, WebP)
- Verify image size (max 20MB)
- Test with simpler image
- Check model supports vision
Context too large
Context too large
Solutions:
- Use Gemini 1.5 Pro for larger context
- Split into smaller chunks
- Summarize before processing
- Remove unnecessary content

