qwen
qwen/qwen3-vl-4b-instruct/Released Aug 2025

Qwen3 VL 4B - Instruct

Multimodal LLM for text and images, excelling in visual QA, document/UI understanding, spatial reasoning, image captioning, and multimodal coding.

Fine-tunable
Provider
qwen
Modality
multi-to-text
Released
Aug 2025
Pricing
$0.0010 / sec

Ready to build with Qwen3 VL 4B - Instruct?

Sign in to grab your API key and start making requests.
bash
curl -sSf -X POST https://staging.hub.oxen.ai/api/chat/completions \
    -H "Content-Type: application/json" \
    -H "Authorization: Bearer $OXEN_API_KEY" \
    -d '{
  "model": "qwen3-vl-4b-instruct",
  "messages": [
    {
      "role": "user",
      "content": "Try sending a message."
    }
  ]
}'
Qwen3 VL 4B - Instruct on Oxen | Oxen.ai