Google: Gemini Pro Vision 1.0
google/gemini-pro-vision
上下文长度: 16,384
text+image->text
Gemini
2023-12-13 更新
Google’s flagship multimodal model, supporting image and video in text or chat prompts for a text or code response. See the benchmarks and prompting guidelines from Deepmind. Usage of Gemini is subject to Google’s Gemini Terms of Use. multimodal