Granite 4.0 Micro

IBM Granite chatreasoningtool_use

API ID: ibm-granite/granite-4.0-h-micro

Input Price
$0.02
/1M tokens
Output Price
$0.11
/1M tokens

About Granite 4.0 Micro

Granite is IBM's enterprise language model series, designed for business applications. The models deliver strong performance on enterprise tasks with focus on reliability and safety. Granite variants include specialized versions for different business needs. The series integrates with IBM's enterprise AI platform. For organizations seeking enterprise-grade AI from IBM, Granite offers business-focused capability.

๐Ÿ“Š
Price Ranking
#299 lowest price among 599 Chat models

Model Specifications

Context Length
131k
Max Output
โ€”
Release Date
2025-10-20
Capabilities
chat reasoning tool_use
Input Modalities
text
Output Modalities
text

Best For

  • Complex reasoning, math problems, multi-step logic
  • Conversations, content writing, general assistance

Consider Alternatives For

  • Image understanding (needs vision capability)
  • Simple Q&A (cheaper models available)

๐Ÿ’ฐ Real-World Cost Examples

Estimated monthly costs for common use cases

Personal AI Assistant
$0.04
/month
50 conversations/day, ~500 tokens each
Customer Service Bot
$1.25
/month
1000 tickets/day, ~800 tokens each
Data Analysis Pipeline
$1.70
/month
500 analyses/day, ~2k tokens each

Similar Models from Other Providers

Cross-brand alternatives with similar capabilities

Google Gemma 3 12B
Input: $0.03
Output: $0.10
Context: 131k
OpenAI Text Embedding 3 Large
Input: $0.13
Output: Free
Context: 8k
Alibaba Qwen Qwen2.5 Coder 7B Instruct
Input: $0.03
Output: $0.09
Context: 33k
Google Gemma 2 9B
Input: $0.03
Output: $0.09
Context: 8k

๐Ÿš€ Quick Start

Get started with Granite 4.0 Micro API

OpenAI-compatible SDK
from openai import OpenAI

client = OpenAI(
    base_url="https://api.provider.com/v1",
    api_key="YOUR_API_KEY"
)

response = client.chat.completions.create(
    model="ibm-granite/granite-4.0-h-micro",
    messages=[
        {"role": "user", "content": "Hello!"}
    ]
)
print(response.choices[0].message.content)