kling
kling/kling-video-o1-reference-to-video/Released Dec 2025

Kling O1 - Reference to Video

Multimodal video model for reference-guided generation, preserving characters and styles from reference images.

Provider
kling
Modality
image-to-video
Released
Dec 2025
Pricing
$0.11 / sec

Ready to build with Kling O1 - Reference to Video?

Sign in to grab your API key and start making requests.
bash
curl -sSf -X POST https://staging.hub.oxen.ai/api/videos/generate \
    -H "Content-Type: application/json" \
    -H "Authorization: Bearer $OXEN_API_KEY" \
    -d '{
  "model": "kling-video-o1-reference-to-video",
  "prompt": "@Element1 flies into the scene and lands on the table next to the ox in @Image1",
  "input_image": [
    "https://hub.oxen.ai/api/repos/ox/Oxen-AI-Assets/file/main/images/daily_moo.png"
  ],
  "elements": [
    {
      "frontal_image_url": "https://hub.oxen.ai/api/repos/ox/Oxen-AI-Assets/file/main/images/red_bird.png",
      "reference_image_urls": [
        "https://hub.oxen.ai/api/repos/ox/Oxen-AI-Assets/file/main/images/red_bird_side.png"
      ]
    }
  ],
  "aspect_ratio": "16:9",
  "duration": 5
}'
Kling O1 - Reference to Video on Oxen | Oxen.ai