Llama 3.2 90B Vision Instruct

Name: Llama 3.2 90B Vision Instruct
Price: 0.35 USD
Availability: OnlineOnly
Author: Meta

Meta chatvision

API ID: meta-llama/llama-3.2-90b-vision-instruct

Input Price

            $0.35
          

/1M tokens

Output Price

            $0.40
          

/1M tokens

About Llama 3.2 90B Vision Instruct

Llama 3.2 90B Vision Instruct is Meta's largest multimodal model, combining powerful language capability with native vision understanding. With 90 billion parameters, it processes both text and images, enabling sophisticated applications requiring visual comprehension. The model excels at image description, visual question answering, document understanding, and tasks combining visual and textual reasoning. Llama 3.2 90B Vision features a substantial context window and demonstrates strong performance on both vision and language benchmarks. Its open weights enable self-hosting and customization for specific visual domains. The model is particularly valuable for document processing, content moderation, and enterprise applications requiring image analysis. For organizations seeking capable multimodal AI with open-source benefits, Llama 3.2 90B Vision offers frontier-adjacent capability with full deployment control. It represents Meta's commitment to advancing open multimodal AI.

💰

Price Ranking

#710 lowest price among 950 Chat models

Model Specifications

Context Length

131k

Max Output

—

Release Date

2024-09-25

Capabilities

chat vision

Input Modalities

textimage

Output Modalities

text

Best For

Image analysis, document understanding, visual Q&A
Conversations, content writing, general assistance

💰 Real-World Cost Examples

Estimated monthly costs for common use cases

Personal AI Assistant

              $0.28
            

/month

50 conversations/day, ~500 tokens each

Customer Service Bot

              $8.85
            

/month

1000 tickets/day, ~800 tokens each

Calculate Your Custom Usage

Meta Model Lineup

Compare all models from Meta to find the best fit

Model	Input	Output	Context	Capabilities
Llama 3.2 90B Vision Instruct Current	Free	Free	131k	chat vision
Llama 3.2 3B Instruct	Free	Free	80k	chat tool_use
Llama 3 70B (Base)	Free	Free	8k	chat
Llama 3 70B (Base)	Free	Free	8k	chat
LlamaGuard 2 8B	Free	Free	8k	chat
Llama 3 8B (Base)	Free	Free	8k	chat

View All Meta Models →

Similar Models from Other Providers

Cross-brand alternatives with similar capabilities

Qwen2.5 VL 72B Instruct

Alibaba Qwen

Input: $0.15

Output: $0.60

Context: 33k

Qwen3 VL 30B A3B Instruct

Alibaba Qwen

Input: $0.15

Output: $0.60

Context: 131k

GPT-4o-mini Search Preview

OpenAI

Input: $0.15

Output: $0.60

Context: 128k

Command R (08-2024)

Cohere

Input: $0.15

Output: $0.60

Context: 128k

Compare These Models

💡 Cheaper Alternatives

Same Brand (Meta)

Llama 3.2 3B Instruct

$0.02 $0.02 80k

Llama 3.1 8B Instruct

$0.02 $0.05 16k

Llama 3 8B Instruct

$0.03 $0.04 8k

Cross Brand

paraphrase-MiniLM-L6-v2 Other

$0.0050 Free 512

🚀 Quick Start

Get started with Llama 3.2 90B Vision Instruct API

OpenAI-compatible SDK

from openai import OpenAI

client = OpenAI(
    base_url="https://api.provider.com/v1",
    api_key="YOUR_API_KEY"
)

response = client.chat.completions.create(
    model="meta-llama/llama-3.2-90b-vision-instruct",
    messages=[
        {"role": "user", "content": "Hello!"}
    ]
)
print(response.choices[0].message.content)

Resources

Continue Your Decision

Back to Meta Add to Compare Calculate Cost