Google Gemini Backend

The GeminiBackend provides access to Google’s Gemini models via the google-genai SDK. It supports both Google AI Studio (API Key) and Vertex AI (ADC) authentication methods.

Authentication

kanoa automatically detects your environment:

Google AI Studio: If GOOGLE_API_KEY is set, it uses the AI Studio API.
Vertex AI: If no API key is found, it falls back to Application Default Credentials (ADC) for Vertex AI.

File Handling & Context Caching

The backend handles file uploads differently depending on the environment:

AI Studio vs. Vertex AI

AI Studio: Uses the File API to upload files to a temporary staging area. The model references these files via URI.
Vertex AI: The File API is not available. kanoa automatically switches to an inline transfer strategy, reading file bytes and passing them directly to the model as inline data.

Production Best Practice

For production datasets on Vertex AI, we recommend using Google Cloud Storage (gs://) URIs. kanoa supports adding resources from GCS, which allows for more efficient handling of large files.

# Example: Adding a resource from GCS
interpreter.kb.add_resource("gs://my-bucket/my-doc.pdf")

Pricing

Token Type	Price (per 1M tokens)	Notes
Standard Input	$2.00	For context <200K tokens
Cached Input	$0.50	75% savings
Cache Storage	$0.20/hour	Per million cached tokens
Output	$12.00	All output tokens

Context Caching

Gemini supports Context Caching, which allows you to cache large prompts (like knowledge bases) to reduce costs and latency for subsequent requests.

Minimum Size: Caching is typically effective for contexts larger than ~2,048 tokens.
TTL: You can configure the Time-To-Live (TTL) for the cache.
Cost Savings: Cached input tokens are significantly cheaper than standard input tokens.

See the Gemini Context Caching Demo for a working example.

Cache Management CLI

kanoa includes a command-line tool to manage your Gemini context caches.

Listing Caches

List all active caches, their token counts, and expiration times:

python -m kanoa.tools.gemini_cache list

Deleting a Cache

Delete a specific cache by name or ID:

python -m kanoa.tools.gemini_cache delete <cache_name>

Pruning Caches

Delete ALL active caches (useful for cleanup):

python -m kanoa.tools.gemini_cache prune