GTE-Base

Other chattool_use

API ID: thenlper/gte-base-20251117

Input Price
$0.0050
/1M tokens
Output Price
Free
/1M tokens

About GTE-Base

GTE Base is Alibaba's efficient embedding model, delivering good semantic search capability. The model generates embeddings that capture semantic meaning effectively. GTE Base achieves competitive performance while maintaining reasonable resource requirements. It's popular in the open-source community for building search systems. For developers seeking efficient open-source embeddings, GTE Base offers practical capability.

๐Ÿ’ฐ
Price Ranking
#547 lowest price among 950 Chat models

Model Specifications

Context Length
512
Max Output
โ€”
Release Date
2025-11-18
Capabilities
chat tool_use
Input Modalities
text
Output Modalities
embeddings

Best For

  • Conversations, content writing, general assistance

Consider Alternatives For

  • Image understanding (needs vision capability)

๐Ÿ’ฐ Real-World Cost Examples

Estimated monthly costs for common use cases

Personal AI Assistant
$0.00
/month
50 conversations/day, ~500 tokens each
Customer Service Bot
$0.07
/month
1000 tickets/day, ~800 tokens each

Other Model Lineup

Compare all models from Other to find the best fit

Model Input Output Context Capabilities
GTE-Base Current Free Free 512 chat tool_use
Riverflow V2 Max Preview Free Free 8k chat vision image_gen
Riverflow V2 Standard Preview Free Free 8k chat vision image_gen
Riverflow V2 Fast Preview Free Free 8k chat vision image_gen
AFM 4.5B Free Free 66k chat
AFM 4.5B Free Free 66k chat

Similar Models from Other Providers

Cross-brand alternatives with similar capabilities

Alibaba Qwen Qwen3 Embedding 8B
Input: $0.01
Output: Free
Context: 32k
Alibaba Qwen Qwen3 Embedding 8B
Input: $0.01
Output: Free
Context: 32k

๐Ÿš€ Quick Start

Get started with GTE-Base API

OpenAI-compatible SDK
from openai import OpenAI

client = OpenAI(
    base_url="https://api.provider.com/v1",
    api_key="YOUR_API_KEY"
)

response = client.chat.completions.create(
    model="thenlper/gte-base-20251117",
    messages=[
        {"role": "user", "content": "Hello!"}
    ]
)
print(response.choices[0].message.content)