Llama Nemotron Embed 1B v2
flagshipNVIDIA Llama Nemotron Embed 1B v2 is a text embedding model built on the Llama architecture, fine-tuned by NVIDIA for generating high-quality vector representations. With 1 billion parameters, it delivers strong embedding quality while remaining efficient enough for large-scale deployment.
The model produces embeddings optimized for retrieval, semantic search, and text similarity tasks. The v2 update improves upon the original with better handling of long documents, improved multilingual support, and enhanced performance on retrieval benchmarks.
Llama Nemotron Embed 1B v2 is part of NVIDIA's Nemotron model family, offering enterprise-grade embeddings for search, RAG, and recommendation systems.
Providers for Llama Nemotron Embed 1B v2
1 routes · sorted by uptimeClosedRouter routes requests to the providers best able to handle your prompt size and parameters, with automatic fallbacks to maximize uptime.