Grok Vision Beta

xAI chatvision Free

API ID: x-ai/grok-vision-beta

Input Price

            Free
          

/1M tokens

Output Price

            Free
          

/1M tokens

About Grok Vision Beta

Grok Vision is xAI's multimodal model, combining Grok's language capability with image understanding. The model processes text and images with Grok's characteristic style. Grok Vision demonstrates xAI's advancement in multimodal AI. For applications requiring vision-language capability with personality, Grok Vision offers multimodal understanding.

🏆

Price Ranking

#1 lowest price among 950 Chat models — Top 20% cheapest!

Model Specifications

Context Length

Max Output

—

Release Date

2024-11-19

Capabilities

chat vision

Input Modalities

textimage

Output Modalities

text

Best For

Image analysis, document understanding, visual Q&A
Conversations, content writing, general assistance

🎉

This model is completely free!

No token costs - use it without worrying about API bills.

Estimate Token Usage

xAI Model Lineup

Compare all models from xAI to find the best fit

Model	Input	Output	Context	Capabilities
Grok Vision Beta Current	Free	Free	8k	chat vision
Grok 2 Vision 1212	Free	Free	33k	chat vision
Grok 2 Vision 1212	Free	Free	33k	chat vision
Grok 2	Free	Free	33k	chat
Grok 2	Free	Free	33k	chat
Grok 2 mini	Free	Free	33k	chat

View All xAI Models →

Similar Models from Other Providers

Cross-brand alternatives with similar capabilities

Gemma 3n 4B

Google

Input: Free

Output: Free

Context: 33k

Llama 3.2 3B Instruct

🚀 Quick Start

Get started with Grok Vision Beta API

OpenAI-compatible SDK

from openai import OpenAI

client = OpenAI(
    base_url="https://api.provider.com/v1",
    api_key="YOUR_API_KEY"
)

response = client.chat.completions.create(
    model="x-ai/grok-vision-beta",
    messages=[
        {"role": "user", "content": "Hello!"}
    ]
)
print(response.choices[0].message.content)

Resources

Continue Your Decision

Back to xAI Add to Compare Calculate Cost