Grok Vision Beta

xAI chatvision Free

API ID: x-ai/grok-vision-beta

Input Price
Free
/1M tokens
Output Price
Free
/1M tokens

About Grok Vision Beta

Grok Vision is xAI's multimodal model, combining Grok's language capability with image understanding. The model processes text and images with Grok's characteristic style. Grok Vision demonstrates xAI's advancement in multimodal AI. For applications requiring vision-language capability with personality, Grok Vision offers multimodal understanding.

๐Ÿ†
Price Ranking
#1 lowest price among 950 Chat models โ€” Top 20% cheapest!

Model Specifications

Context Length
8k
Max Output
โ€”
Release Date
2024-11-19
Capabilities
chat vision
Input Modalities
textimage
Output Modalities
text

Best For

  • Image analysis, document understanding, visual Q&A
  • Conversations, content writing, general assistance
๐ŸŽ‰

This model is completely free!

No token costs - use it without worrying about API bills.

Estimate Token Usage

xAI Model Lineup

Compare all models from xAI to find the best fit

Model Input Output Context Capabilities
Grok Vision Beta Current Free Free 8k chat vision
Grok 2 Vision 1212 Free Free 33k chat vision
Grok 2 Vision 1212 Free Free 33k chat vision
Grok 2 Free Free 33k chat
Grok 2 Free Free 33k chat
Grok 2 mini Free Free 33k chat

Similar Models from Other Providers

Cross-brand alternatives with similar capabilities

Google Gemma 3n 4B
Input: Free
Output: Free
Context: 33k
Meta Llama 3.2 3B Instruct
Input: Free
Output: Free
Context: 80k
Alibaba Qwen Qwen2.5-VL 7B Instruct
Input: Free
Output: Free
Context: 33k
ByteDance Seedream 4.5
Input: Free
Output: Free
Context: 4k

๐Ÿš€ Quick Start

Get started with Grok Vision Beta API

OpenAI-compatible SDK
from openai import OpenAI

client = OpenAI(
    base_url="https://api.provider.com/v1",
    api_key="YOUR_API_KEY"
)

response = client.chat.completions.create(
    model="x-ai/grok-vision-beta",
    messages=[
        {"role": "user", "content": "Hello!"}
    ]
)
print(response.choices[0].message.content)