Platform

Choosing a language model

Open vs API models, what to weigh, and how we pick the right one for your use case.


We're model-agnostic — the right model depends on the task, your budget, and your data requirements. Here's how the options compare and how we decide.

Open vs API models

  • Open models (Llama, Mistral, Qwen, DeepSeek, Gemma…) can be self-hosted: your data stays on your servers, costs are predictable at scale, and you avoid per-vendor lock-in.
  • API models (GPT, Claude, Gemini…) are fastest to start with and often have an edge on the hardest reasoning and most natural language — but data is processed by the provider and you pay per use.

What to weigh

  • Quality on your task — reasoning, writing, or structured output
  • Cost — per-token API spend vs hosting your own
  • Latency — how fast replies need to be (voice and live chat are sensitive)
  • Languages — some open models are notably stronger multilingually
  • Context window — how much it can read at once
  • Tool / function calling — for agents that take actions
  • Data residency — does data need to stay in-house?

Rules of thumb

  • Structured, tool-driven tasks (scheduling, order lookups) run fine on mid-size open models
  • Grounded Q&A (RAG) depends more on retrieval quality than raw model size
  • Creative or high-stakes reasoning is where frontier API models earn their cost
  • Sensitive data leans toward self-hosted open models

How we choose

For each use case we shortlist a couple of options, test them on your real examples, and recommend the best balance — then revisit as your volume and needs grow. Every agent's page lists a recommended starting point.