Image2Video
BETA
See example prompts in the gallery.
Pricing for video generation.
Overview
Astria Image2Video uses the open-source WAN 2.1 image2video model which provides high-quality video generation from images. The model uses the diffusion process to generate high-quality videos from images. The model is trained on a large dataset of images and videos to generate high-quality videos from images. The model can generate videos in 720p and 480p resolutions.
Syntax
Below parameters should be used as text inside the prompt text
--video
Enables video generation after generating an image. If input image is provided with denoising_strength=0, the input image will be used as the first frame of the video.
Example: <lora:1533312:1.0> ohwx woman hiking in the alps --video
--video_prompt
A prompt for the video model which describes camera movement, scene, and object interactions. If not provided, the image prompt will be used for the video. The video_prompt must be surrounded by double quotes, and should not include the token or Flux LoRA that were used for the image prompt.
Example: <lora:1533312:1.0> ohwx woman hiking in the alps --video_prompt "Woman looking at the camera, smiling, puts hands on her hips, confident"
--video_model
enum: 720p
, 480p
720p
is used by default.
Example
Here is an example prompt
<lora:1533312:1.0> A highly detailed image of thoughtful ohwx woman exploring a hidden urban garden, her J. Crew khaki midi skirt and linen blouse complementing the peaceful surroundings --video --video_prompt "Woman looking at the camera, smiling, puts hands on her hips, confident"
<lora:1533312:1.0> A highly detailed image of thoughtful ohwx woman exploring a hidden urban garden, her J. Crew khaki midi skirt and linen blouse complementing the peaceful surroundings --video --video_prompt "Woman looking at the camera, smiling, puts hands on her hips, confident"
num_images=1
negative_prompt=
seed=
steps=
cfg_scale=
controlnet=
input_image_url=
mask_image_url=
denoising_strength=
controlnet_conditioning_scale=
controlnet_txt2img=false
super_resolution=true
inpaint_faces=true
face_correct=false
film_grain=false
face_swap=false
hires_fix=false
backend_version=0
ar=1:1
scheduler=euler_a
color_grading=
use_lpw=false
w=1280
h=720
style=
tune_id=1504944