Model Inference
Choose the right model, get to the perfect prompt.
16 models, on 4 inference providers. New models added every week.
MMetaOOthersShow all
?o1-mini
Oct 2024
o1-mini is a fast, cost-efficient reasoning model tailored to coding, math, and science use cases. The model has 128K context and an October 2023 knowledge cutoff.
Input: $3.00 / Output: $12.00
?ministral-8b-latest
Oct 2024
Input: $0.10 / Output: $0.10
?codestral-2405
Oct 2024
Input: $0.20 / Output: $0.60
?pixtral-12b
Oct 2024
Input: $0.15 / Output: $0.15
?open-mistral-7b
Oct 2024
Input: $0.25 / Output: $0.25
?open-mixtral-8x22b
Oct 2024
Mixtral 8x22B is currently the most performant open model. A 22B sparse Mixture-of-Experts (SMoE). Uses only 39B active parameters out of 141B.
Input: $2.00 / Output: $6.00
?mistral-large-2407
Oct 2024
Top-tier reasoning for high-complexity tasks, for your most sophisticated needs.
Input: $2.00 / Output: $6.00
Mmeta-llama/Meta-Llama-3.1-8B-Instruct-Turbo
Oct 2024
Input: $0.18 / Output: $0.18
?gpt-4o-mini
Oct 2024
GPT-4o mini is our most cost-efficient small model that’s smarter and cheaper than GPT-3.5 Turbo, and has vision capabilities. The model has 128K context and an October 2023 knowledge cutoff.
Input: $0.15 / Output: $0.60
?gpt-4o
Oct 2024
GPT-4o is our most advanced multimodal model that’s faster and cheaper than GPT-4 Turbo with stronger vision capabilities. The model has 128K context and an October 2023 knowledge cutoff.
Input: $2.50 / Output: $10.00
?o1-preview
Oct 2024
o1-preview is our new reasoning model for complex tasks. The model has 128K context and an October 2023 knowledge cutoff.
Input: $15.00 / Output: $60.00
?ministral-3b-latest
Oct 2024
Input: $0.04 / Output: $0.04
?mistral-small-2409
Oct 2024
Cost-efficient, fast, and reliable option for use cases such as translation, summarization, and sentiment analysis.
Input: $0.20 / Output: $0.60
?mistral-nemo
Oct 2024
Input: $0.15 / Output: $0.15
?open-mixtral-8x7b
Oct 2024
A 7B sparse Mixture-of-Experts (SMoE). Uses 12.9B active parameters out of 45B total.
Input: $0.70 / Output: $0.70
?Gemini 1.5 Flash
Nov 2024
Input: $0.02 / Output: $0.02