Phi 4 Multimodal Instruct

Microsoft chatvision

API ID: microsoft/phi-4-multimodal-instruct

Input Price
$0.05
/1M tokens
Output Price
$0.10
/1M tokens

About Phi 4 Multimodal Instruct

Phi-4 is Microsoft's latest small language model, pushing the boundaries of what's achievable with efficient architectures. Building on the Phi series' success, Phi-4 introduces significant improvements in reasoning, coding, and general capability while maintaining a practical parameter count. The model demonstrates state-of-the-art performance for its size class, often matching or exceeding much larger models on key benchmarks. Phi-4 features an extended context window and excels at mathematical reasoning, code generation, and complex analysis. Its efficient design enables deployment on consumer hardware and cost-effective cloud instances. The model represents Microsoft's continued research into efficient AI, showing that careful training can substitute for raw scale. For developers seeking cutting-edge capability with minimal resources, Phi-4 offers the latest advances in efficient model design. It's particularly valuable for local deployment, edge computing, and applications requiring strong capability without massive compute.

๐Ÿ’ฐ
Price Ranking
#596 lowest price among 950 Chat models

Model Specifications

Context Length
131k
Max Output
โ€”
Release Date
2025-03-08
Capabilities
chat vision
Input Modalities
textimage
Output Modalities
text

Best For

  • Image analysis, document understanding, visual Q&A
  • Conversations, content writing, general assistance

๐Ÿ’ฐ Real-World Cost Examples

Estimated monthly costs for common use cases

Personal AI Assistant
$0.05
/month
50 conversations/day, ~500 tokens each
Customer Service Bot
$1.65
/month
1000 tickets/day, ~800 tokens each

Microsoft Model Lineup

Compare all models from Microsoft to find the best fit

Model Input Output Context Capabilities
Phi 4 Multimodal Instruct Current Free Free 131k chat vision
Phi-3 Medium 4K Instruct Free Free 4k chat
Phi-3 Medium 4K Instruct Free Free 4k chat
WizardLM-2 7B Free Free 32k chat
WizardLM-2 7B Free Free 32k chat
Phi-3.5 Mini 128K Instruct Free Free 128k chat

Similar Models from Other Providers

Cross-brand alternatives with similar capabilities

Google Gemini Embedding 001
Input: $0.15
Output: Free
Context: 20k
Mistral Codestral Embed 2505
Input: $0.15
Output: Free
Context: 8k
DeepSeek DeepSeek R1 0528 Qwen3 8B
Input: $0.06
Output: $0.09
Context: 131k
Mistral Mistral Small 3
Input: $0.03
Output: $0.11
Context: 33k

๐Ÿš€ Quick Start

Get started with Phi 4 Multimodal Instruct API

OpenAI-compatible SDK
from openai import OpenAI

client = OpenAI(
    base_url="https://api.provider.com/v1",
    api_key="YOUR_API_KEY"
)

response = client.chat.completions.create(
    model="microsoft/phi-4-multimodal-instruct",
    messages=[
        {"role": "user", "content": "Hello!"}
    ]
)
print(response.choices[0].message.content)