EmbeddingGemma 300M

flagship

Google · released 2024-06-01 · text

currently routing · 4.2k rpm

8.2M tokens

Context

— / 1M

Input

— / 1M

Output

— t/s

Speed

proprietary

License

/ ABOUT

Google EmbeddingGemma 300M is a compact text embedding model from Google, built on the Gemma architecture and specifically optimized for generating high-quality vector representations. Despite its small 300-million parameter size, it delivers strong performance on embedding benchmarks, making it efficient for deployment in search and retrieval systems.

The model produces dense embeddings that capture semantic meaning effectively, suitable for similarity search, clustering, classification, and RAG (Retrieval-Augmented Generation) applications. Its compact size enables fast inference and low storage requirements for embedding indices.

EmbeddingGemma 300M is ideal for developers seeking Google-quality embeddings with minimal computational overhead, particularly in resource-constrained or high-throughput environments.

Providers for EmbeddingGemma 300M

1 routes · sorted by uptime

ClosedRouter routes requests to the providers best able to handle your prompt size and parameters, with automatic fallbacks to maximize uptime.

Provider

Context

Quant

Uptime · 30d

Cloudflare Workers AI

—

bf16

0.00%