GLM 4 32B

API ID: thudm/glm-4-32b-0414

Input Price
$0.10
/1M tokens
Output Price
$0.10
/1M tokens

About GLM 4 32B

GLM 4 32B is a budget-friendly general-purpose model from Zhipu AI with standard context (33k), suitable for conversations, content creation, and general AI tasks.

๐Ÿ’ฐ
Price Ranking
#610 lowest price among 950 Chat models

Model Specifications

Context Length
33k
Max Output
โ€”
Release Date
2025-04-17
Capabilities
chat
Input Modalities
text
Output Modalities
text

Best For

  • Conversations, content writing, general assistance

Consider Alternatives For

  • Image understanding (needs vision capability)

๐Ÿ’ฐ Real-World Cost Examples

Estimated monthly costs for common use cases

Personal AI Assistant
$0.07
/month
50 conversations/day, ~500 tokens each
Customer Service Bot
$2.40
/month
1000 tickets/day, ~800 tokens each

Zhipu AI Model Lineup

Compare all models from Zhipu AI to find the best fit

Model Input Output Context Capabilities
GLM 4 32B Current Free Free 33k chat
GLM 4 9B Free Free 32k chat
GLM 4 9B Free Free 32k chat
GLM 4.1V 9B Thinking Free Free 66k chat vision reasoning
GLM Z1 Rumination 32B Free Free 32k chat reasoning
GLM Z1 Rumination 32B Free Free 32k chat reasoning

Similar Models from Other Providers

Cross-brand alternatives with similar capabilities

Mistral Pixtral 12B
Input: $0.10
Output: $0.10
Context: 4k
Mistral Ministral 3 3B 2512
Input: $0.10
Output: $0.10
Context: 131k
Mistral Ministral 8B
Input: $0.10
Output: $0.10
Context: 128k
Microsoft Phi 4
Input: $0.06
Output: $0.14
Context: 16k

๐Ÿš€ Quick Start

Get started with GLM 4 32B API

OpenAI-compatible SDK
from openai import OpenAI

client = OpenAI(
    base_url="https://api.provider.com/v1",
    api_key="YOUR_API_KEY"
)

response = client.chat.completions.create(
    model="thudm/glm-4-32b-0414",
    messages=[
        {"role": "user", "content": "Hello!"}
    ]
)
print(response.choices[0].message.content)