海量在线大模型 兼容OpenAI API

全部大模型

219个模型 · 2025-01-10 更新
OpenAI: GPT-3.5 Turbo
$0.0020/1k
$0.0060/1k
openai/gpt-3.5-turbo
GPT-3.5 Turbo is OpenAI’s fastest model. It can understand and generate natural language or code, and is optimized for chat and traditional completion tasks. Training data up to Sep 2021.
2023-05-28 16,385 text->text GPT
OpenAI: GPT-4
$0.12/1k
$0.24/1k
openai/gpt-4
OpenAI’s flagship model, GPT-4 is a large-scale multimodal language model capable of solving difficult problems with greater accuracy than previous models due to its broader general knowledge and advanced reasoning capabilities. Training data: up to Sep 2021.
2023-05-28 8,191 text->text GPT
OpenAI: o1
$0.060/1k
$0.24/1k
openai/o1
The latest and strongest model family from OpenAI, o1 is designed to spend more time thinking before responding. The o1 model series is trained with large-scale reinforcement learning to reason using chain of thought. The o1 models are optimized for math, science, programming, and other STEM-related tasks. They consistently exhibit PhD-level accuracy on benchmarks in physics, chemistry, and biology. Learn more in the launch announcement.
2024-12-18 200,000 text+image->text GPT
Anthropic: Claude v2
$0.032/1k
$0.096/1k
anthropic/claude-2
Claude 2 delivers advancements in key capabilities for enterprises—including an industry-leading 200K token context window, significant reductions in rates of model hallucination, system prompts and a new beta feature: tool use.
2023-11-22 200,000 text->text Claude
Anthropic: Claude 3.5 Sonnet
$0.012/1k
$0.060/1k
anthropic/claude-3.5-sonnet
New Claude 3.5 Sonnet delivers better-than-Opus capabilities, faster-than-Sonnet speeds, at the same Sonnet prices. Sonnet is particularly good at: Coding: Scores ~49% on SWE-Bench Verified, higher than the last best score, and without any fancy prompt scaffolding Data science: Augments human data science expertise; navigates unstructured data while using multiple tools for insights Visual processing: excelling at interpreting charts, graphs, and images, accurately transcribing text to derive insights beyond just the text alone Agentic tasks: exceptional tool use, making it great at agentic tasks (i.e. complex, multi-step problem solving tasks that require engaging with other systems) multimodal
2024-10-22 200,000 text+image->text Claude
DeepSeek V3
$0.0006/1k
$0.0011/1k
deepseek/deepseek-chat
DeepSeek-V3 is the latest model from the DeepSeek team, building upon the instruction following and coding abilities of the previous versions. Pre-trained on nearly 15 trillion tokens, the reported evaluations reveal that the model outperforms other open-source models and rivals leading closed-source models. For model details, please visit the DeepSeek-V3 repo for more information.
2024-12-27 64,000 text->text Other
DeepSeek V2.5
$0.0080/1k
$0.0080/1k
deepseek/deepseek-chat-v2.5
DeepSeek-V2.5 is an upgraded version that combines DeepSeek-V2-Chat and DeepSeek-Coder-V2-Instruct. The new model integrates the general and coding abilities of the two previous versions. For model details, please visit DeepSeek-V2 page for more information.
2024-05-14 8,192 text->text Other
Google: Gemini Flash 1.5
$0.0003/1k
$0.0012/1k
google/gemini-flash-1.5
Gemini 1.5 Flash is a foundation model that performs well at a variety of multimodal tasks such as visual understanding, classification, summarization, and creating content from image, audio and video. It’s adept at processing visual and text inputs such as photographs, documents, infographics, and screenshots. Gemini 1.5 Flash is designed for high-volume, high-frequency tasks where cost and latency matter. On most common tasks, Flash achieves comparable quality to other Gemini Pro models at a significantly reduced cost. Flash is well-suited for applications like chat assistants and on-demand content generation where speed and scale matter. Usage of Gemini is subject to Google’s Gemini Terms of Use. multimodal
2024-05-14 1,000,000 text+image->text Gemini
google/gemini-2.0-flash-exp:free
Gemini Flash 2.0 offers a significantly faster time to first token (TTFT) compared to Gemini Flash 1.5, while maintaining quality on par with larger models like Gemini Pro 1.5. It introduces notable enhancements in multimodal understanding, coding capabilities, complex instruction following, and function calling. These advancements come together to deliver more seamless and robust agentic experiences.
2024-12-12 1,048,576 text+image->text Gemini
Meta: Llama 2 13B Chat
$0.0008/1k
$0.0008/1k
meta-llama/llama-2-13b-chat
A 13 billion parameter language model from Meta, fine tuned for chat completions
2023-06-20 4,096 text->text Llama2
Meta: Llama 3.1 405B (base)
$0.0080/1k
$0.0080/1k
meta-llama/llama-3.1-405b
Meta’s latest class of model (Llama 3.1) launched with a variety of sizes & flavors. This is the base 405B pre-trained version. It has demonstrated strong performance compared to leading closed-source models in human evaluations. To read more about the model release, click here. Usage of this model is subject to Meta’s Acceptable Use Policy.
2024-08-02 32,768 text->text Llama3
openai/o1-preview-2024-09-12
The latest and strongest model family from OpenAI, o1 is designed to spend more time thinking before responding. The o1 models are optimized for math, science, programming, and other STEM-related tasks. They consistently exhibit PhD-level accuracy on benchmarks in physics, chemistry, and biology. Learn more in the launch announcement. Note: This model is currently experimental and not suitable for production use-cases, and may be heavily rate-limited.
2024-09-12 128,000 text->text GPT