Skip to content
Google

EmbeddingGemma 300M

flagship
Google · released 2024-06-01 · text
currently routing · 4.2k rpm
8.2M tokens
Context
— / 1M
Input
— / 1M
Output
— t/s
Speed
proprietary
License
/ ABOUT

Google EmbeddingGemma 300M is a compact text embedding model from Google, built on the Gemma architecture and specifically optimized for generating high-quality vector representations. Despite its small 300-million parameter size, it delivers strong performance on embedding benchmarks, making it efficient for deployment in search and retrieval systems.

The model produces dense embeddings that capture semantic meaning effectively, suitable for similarity search, clustering, classification, and RAG (Retrieval-Augmented Generation) applications. Its compact size enables fast inference and low storage requirements for embedding indices.

EmbeddingGemma 300M is ideal for developers seeking Google-quality embeddings with minimal computational overhead, particularly in resource-constrained or high-throughput environments.

Providers for EmbeddingGemma 300M

1 routes · sorted by uptime

ClosedRouter routes requests to the providers best able to handle your prompt size and parameters, with automatic fallbacks to maximize uptime.

Provider
Context
Quant
Uptime · 30d
bf16
0.00%