Qwen 2 72B Instruct

$0.0014/1k

$0.0016/1k

qwen/qwen-2-72b-instruct

上下文长度: 32,768 text->text Qwen 2024-06-07 更新

Qwen2 72B is a transformer-based model that excels in language understanding, multilingual capabilities, coding, mathematics, and reasoning. It features SwiGLU activation, attention QKV bias, and group query attention. It is pretrained on extensive data with supervised finetuning and direct preference optimization. For more details, see this blog post and GitHub repo. Usage of this model is subject to Tongyi Qianwen LICENSE AGREEMENT.

模型参数

架构信息

模态: text->text

Tokenizer: Qwen

指令类型: chatml

限制信息

上下文长度: 32,768

Qwen 2 72B Instruct

模型参数

架构信息

限制信息

相关模型

Rocinante 12B

Qwen: QwQ 32B Preview

Qwen: QvQ 72B Preview

Qwen2.5 Coder 32B Instruct

Qwen2.5 7B Instruct

Qwen2.5 72B Instruct