Access state-of-the-art open-source AI models through a simple, unified API. From chat to embeddings, find the right model for your use case.
6 models
vision
Efficient vision-language model with automatic multi-tier failover.
vision
Compact vision-language model with automatic multi-tier failover.
vision
Multilingual multimodal model with 128 experts and 17B active parameters. Processes images and text for detailed visual analysis, OCR, and multimodal Q&A.
vision
Premium multimodal model accepting image and audio inputs. Best for complex image reasoning, document analysis, and audio-visual tasks.
vision
Advanced multimodal vision model with image understanding, OCR, visual Q&A, and document analysis capabilities. 128K context window.
We're constantly adding new models. Let us know what you need and we'll work on adding it.
Request a Model