Nemotron-4 340B Instruct

Name: Nemotron-4 340B Instruct
Availability: OnlineOnly
Author: NVIDIA

NVIDIA chat Free

API ID: nvidia/nemotron-4-340b-instruct

Input Price

            Free
          

/1M tokens

Output Price

            Free
          

/1M tokens

About Nemotron-4 340B Instruct

Nemotron is NVIDIA's language model family, optimized for deployment on NVIDIA hardware. The models deliver strong performance on reasoning, coding, and general language tasks while being specifically tuned for efficient inference on NVIDIA GPUs. Nemotron variants range from compact nano versions to large ultra models. The series integrates well with NVIDIA's AI platform for enterprise deployment. For organizations with NVIDIA infrastructure seeking optimized AI performance, Nemotron offers hardware-specific optimization that general models can't match.

🏆

Price Ranking

#1 lowest price among 950 Chat models — Top 20% cheapest!

Model Specifications

Context Length

Max Output

—

Release Date

2024-06-23

Capabilities

chat

Input Modalities

text

Output Modalities

text

Best For

Conversations, content writing, general assistance

Consider Alternatives For

Image understanding (needs vision capability)

🎉

This model is completely free!

No token costs - use it without worrying about API bills.

Estimate Token Usage

NVIDIA Model Lineup

Compare all models from NVIDIA to find the best fit

Model	Input	Output	Context	Capabilities
Nemotron-4 340B Instruct Current	Free	Free	4k	chat
Llama 3.1 Nemotron Nano 8B v1	Free	Free	131k	chat
Llama 3.1 Nemotron Nano 8B v1	Free	Free	131k	chat
Llama 3.3 Nemotron Super 49B v1	Free	Free	131k	chat
Llama 3.3 Nemotron Super 49B v1	Free	Free	131k	chat
Nemotron 3 Nano 30B A3B	Free	Free	262k	chat reasoning tool_use

View All NVIDIA Models →

Similar Models from Other Providers

Cross-brand alternatives with similar capabilities

Gemma 3n 4B

Google

Input: Free

Output: Free

Context: 33k

Llama 3.2 3B Instruct

🚀 Quick Start

Get started with Nemotron-4 340B Instruct API

OpenAI-compatible SDK

from openai import OpenAI

client = OpenAI(
    base_url="https://api.provider.com/v1",
    api_key="YOUR_API_KEY"
)

response = client.chat.completions.create(
    model="nvidia/nemotron-4-340b-instruct",
    messages=[
        {"role": "user", "content": "Hello!"}
    ]
)
print(response.choices[0].message.content)

Resources

Continue Your Decision

Back to NVIDIA Add to Compare Calculate Cost