Llemma 7b

Other chattool_use

API ID: eleutherai/llemma_7b

Input Price
$0.80
/1M tokens
Output Price
$1.20
/1M tokens

About Llemma 7b

Llemma is EleutherAI's mathematics-focused model, designed for mathematical reasoning and problem-solving. The model excels at mathematical tasks, proofs, and technical analysis. Llemma demonstrates specialized capability for STEM applications. For developers building mathematical AI tools, Llemma offers focused mathematical capability.

๐Ÿ’ฐ
Price Ranking
#796 lowest price among 950 Chat models

Model Specifications

Context Length
4k
Max Output
4k
Release Date
2025-04-14
Capabilities
chat tool_use
Input Modalities
text
Output Modalities
text

Best For

  • Conversations, content writing, general assistance

Consider Alternatives For

  • Image understanding (needs vision capability)

๐Ÿ’ฐ Real-World Cost Examples

Estimated monthly costs for common use cases

Personal AI Assistant
$0.72
/month
50 conversations/day, ~500 tokens each
Customer Service Bot
$22.80
/month
1000 tickets/day, ~800 tokens each

Other Model Lineup

Compare all models from Other to find the best fit

Model Input Output Context Capabilities
Llemma 7b Current Free Free 4k chat tool_use
Riverflow V2 Max Preview Free Free 8k chat vision image_gen
Riverflow V2 Standard Preview Free Free 8k chat vision image_gen
Riverflow V2 Fast Preview Free Free 8k chat vision image_gen
AFM 4.5B Free Free 66k chat
AFM 4.5B Free Free 66k chat

Similar Models from Other Providers

Cross-brand alternatives with similar capabilities

OpenAI GPT-3.5 Turbo
Input: $0.50
Output: $1.50
Context: 16k
Mistral Mistral Large 3 2512
Input: $0.50
Output: $1.50
Context: 262k
Alibaba Qwen Qwen3 VL 32B Instruct
Input: $0.50
Output: $1.50
Context: 131k
Perplexity Sonar
Input: $1.00
Output: $1.00
Context: 127k

๐Ÿš€ Quick Start

Get started with Llemma 7b API

OpenAI-compatible SDK
from openai import OpenAI

client = OpenAI(
    base_url="https://api.provider.com/v1",
    api_key="YOUR_API_KEY"
)

response = client.chat.completions.create(
    model="eleutherai/llemma_7b",
    messages=[
        {"role": "user", "content": "Hello!"}
    ]
)
print(response.choices[0].message.content)