Skip to main content

Image to Video

L
Written by LX

Image to Video animates a still image into a moving clip. You provide a first frame (and optionally a last frame), describe the motion you want, and generate. It's ideal when you already have an image and want to bring it to life.

What does Image to Video do?

It takes a source image as the starting frame of a video and animates it according to your description. A hint suggests "We recommend using images generated by this model, but feel free to choose any you like." You can also add a last frame so the clip ends on a specific image.

How do I add my source (first frame) image?

In the reference area, you have two buttons:

  • Select content — pick an existing image from your library or the inspiration feed.

  • Upload image — upload your own file.

Until you add one, the area shows "Your selected reference image will appear here once you select content or upload an image." The first frame "sets how your video starts." If you're not logged in, choosing Select content opens the sign-in dialog first.

What is the last frame and how do I add one?

The last frame "sets how your video ends." Click Add last frame to attach an image as the ending frame. A hint warns: "Last frame sets how your video ends. If it doesn't connect naturally to the first frame, the result might feel random." To remove it, click Remove last frame.

Can I generate a last frame instead of uploading one?

Yes. The last-frame area has a Generate last frame from first frame option that opens a dialog where you create an ending image from your first frame. Once generated and selected, it's attached as the last frame and the view scrolls back to the top.

Why is "Add last frame" greyed out?

Last frame is only supported on certain quality modes and only for single-shot videos. Hovering the disabled button explains which quality you need. You'll also see related notes like "Last frame is only supported in Ultra mode" and "Last frame is only supported for single-shot video generation." Clicking the Generate last frame from first frame option will switch you to a supporting quality automatically.

Can I use a last frame and multiple shots together?

No. Adding a last frame disables multi-shot, and vice versa. The hint reads "Multi-shot generation is unavailable when a last frame is added. Remove the last frame to use multi-shot generation."

What is the description box for?

The description tells the model how to animate the image — for example a pose or movement. The placeholder reads "Add description like pose / movement." On a quality tier that doesn't support descriptions, the box is replaced by a prompt to switch: it shows "Description" then "only available under" and a button with the supporting quality names you can tap to switch.

What is "Native audio" / the audio section?

The audio card pairs a Native audio toggle with the caption "Synchronized audio-video generation." Turning it on generates sound synced to the video. It's required on the highest tier (you'll see "Native audio is required for Ultra S quality" if you try to turn it off there).

What is a voice model and how do I add one?

When native audio is on, you can attach a Voice model so spoken lines use a specific voice. Use the "+ Voice model" button in the description footer (or Add voice model) to open the voice-model section and pick one. If audio is off, clicking the button turns native audio on first. With a voice model selected, you can drop it into a shot by typing "@" in the description and choosing it. You can't turn audio off while a voice model is still referenced — you'll see "A voice model is in use in the description. Remove the voice model mentions before turning off native audio."

What are Elements here?

Elements are reusable characters, products, or backgrounds you can mention in your description. The "+ Element" button opens the element column. Elements are available on the Ultra and Ultra S quality tiers — clicking "+ Element" on an unsupported tier auto-switches you to Ultra (the first element-supporting quality). In multi-shot mode, click into the shot you want before adding an element; if no shot is focused you'll see "Please choose a shot to add element." See Elements.

What happens if I switch quality after adding elements?

If the new quality doesn't support what you've added, a "Switch Quality Mode?" dialog appears: "Elements are only available for {qualities} video generation. Switching will clear all elements, descriptions and voice scripts. Do you want to continue?" Choose Confirm Switch to proceed or Cancel to keep your current quality.

How do multi-shot videos work?

When supported, click Add shot to add shots, each with its own description and camera movement. The maximum number of shots depends on duration: "{duration}-second videos support up to {count} shots. For more shots, increase duration to {nextDuration} seconds." If multi-shot isn't supported on your quality, the Add Shot button's tooltip points you to a supporting quality.

How much does it cost?

Video is priced per second of output, so the cost scales with your duration. The per-second rate is shown next to Generate and depends on your plan and the quality you pick (it also reflects resolution and options like native audio). See Credits & billing.

On mobile

Image to Video works on mobile with the same first-frame / last-frame, description, audio, and elements controls arranged for a narrow screen. The element and voice-model pills appear above the description editor, and quality/resolution/duration live in the bottom bar.

Did this answer your question?