qwen3-coder-flash

qwen3-coder-flash (qwen3-coder-flash) — 128K context, Coding tier. Preço entrada $0.5/M · Preço saída $4/M · Latência 24ms. Routed via Routara OpenAI-compatible endpoint with multi-region failover and metered billing.

Entrada / 1M: $0.5/M · Saída / 1M: $4/M · TTFT: 24ms

Specifications

  • Desenvolvedor: qwen3-coder-flash
  • Categoria: Coding
  • Janela de contexto: 128K
  • Preço entrada: $0.5 / 1M
  • Preço saída: $4 / 1M
  • Latência: 24 ms
  • SLA: A

Typical use cases

  • Geração de código
  • RAG e ferramentas
  • Raciocínio multi-etapas

FAQ

  • Preço do qwen3-coder-flash na Routara? — Entrada $0.5/M, saída $4/M — cobrança por uso.
  • qwen3-coder-flash é compatível com OpenAI? — Sim. base_url: https://api.routara.ai/v1
  • qwen3-coder-flash suporta streaming? — Sim quando a rota está live (stream: true).

Related models

  • qwen3-coder-480b-a35b-instruct (/detail/qwen3-coder-480b-a35b-instruct)
  • gpt-5.1-codex (/detail/gpt-5-1-codex)
  • gpt-5.1-codex-max (/detail/gpt-5-1-codex-max)

Quick integration

  • curl https://api.routara.ai/v1/chat/completions \ -H "Authorization: Bearer YOUR_API_KEY" \ -H "Content-Type: application/json" \ -d '{"model":"qwen3-coder-flash","messages":[{"role":"user","content":"Hello"}],"stream":true}'