Llama Nemotron Embed VL 1B V2 (free)

NVIDIA chatvisiontool_use

API ID: nvidia/llama-nemotron-embed-vl-1b-v2-20260224

No data available

About Llama Nemotron Embed VL 1B V2 (free)

Llama Nemotron Embed VL 1B V2 (free) is a general-purpose model from NVIDIA with long context (131k), suitable for conversations, content creation, and general AI tasks.

๐Ÿ†
Price Ranking
#1 lowest price among 950 Chat models โ€” Top 20% cheapest!

Model Specifications

Context Length
131k
Max Output
โ€”
Release Date
2026-02-25
Capabilities
chat vision tool_use
Input Modalities
textimage
Output Modalities
embeddings

Best For

  • Image analysis, document understanding, visual Q&A
  • Conversations, content writing, general assistance

NVIDIA Model Lineup

Compare all models from NVIDIA to find the best fit

Model Input Output Context Capabilities
Llama Nemotron Embed VL 1B V2 (free) Current Free Free 131k chat vision tool_use
Nemotron-4 340B Instruct Free Free 4k chat
Nemotron-4 340B Instruct Free Free 4k chat
Llama 3.1 Nemotron Nano 8B v1 Free Free 131k chat
Llama 3.1 Nemotron Nano 8B v1 Free Free 131k chat
Llama 3.3 Nemotron Super 49B v1 Free Free 131k chat

Similar Models from Other Providers

Cross-brand alternatives with similar capabilities

Google Gemma 3n 4B
Input: Free
Output: Free
Context: 33k
Meta Llama 3.2 3B Instruct
Input: Free
Output: Free
Context: 80k
Alibaba Qwen Qwen2.5-VL 7B Instruct
Input: Free
Output: Free
Context: 33k
ByteDance Seedream 4.5
Input: Free
Output: Free
Context: 4k

๐Ÿš€ Quick Start

Get started with Llama Nemotron Embed VL 1B V2 (free) API

OpenAI-compatible SDK
from openai import OpenAI

client = OpenAI(
    base_url="https://api.provider.com/v1",
    api_key="YOUR_API_KEY"
)

response = client.chat.completions.create(
    model="nvidia/llama-nemotron-embed-vl-1b-v2-20260224",
    messages=[
        {"role": "user", "content": "Hello!"}
    ]
)
print(response.choices[0].message.content)