goenhance logo

Wan 2.1 AI Video Generator

Alibaba Cloud's Top-Ranking Open-Source AI Video Generation Model – Achieving an impressive 84.7% overall score on VBench. Leverage cutting-edge AI technology to turn your concepts into high-quality videos. Try Wan 2.1 on GoEnhance AI below!
Try Wan 2.1 Now

Key Features of Wan 2.1

High-Fidelity Video Generation

Fast video generation within 2 minutes, allowing you to quickly create and iterate on your video content

Precise Movement Control

Transform your text prompts into high-definition 5-second videos with crystal clear 1280x720 resolution

Multi-Object Interaction Handling

Transform static images into dynamic videos with our advanced AI technology, bringing still pictures to life through smooth and natural animations

Leading Performance

Wan 2.1 has leading performance on the VBench benchmark (84.7% overall score)

How To Use Wan 2.1?

01

Enter Your Prompt

Provide a description or upload an image to begin transforming it into a video.

02

Customize Settings

Adjust video settings(prompt, ratio) before Wan 2.1 starts to process your inputs.

03

Download Your Video

Wan 2.1 will generate a video that you can save if satisfied with the result.

Frequently Asked Questions

What is Wan 2.1?

Wan 2.1, also referred to as Wan2.1 or Tongyi Wanxiang 2.1, is an advanced AI video generation model developed by Alibaba Cloud. Launched in July 2023 and recently updated, it currently holds the top spot on the VBench leaderboard with an impressive 84.7% overall score. The model excels in key areas such as dynamic range (91.7%), spatial relationships (87.5%), and multi-object interactions (85.4%). Leveraging cutting-edge VAE (Variational Autoencoder) and DiT (Denoising Diffusion Transformer) technologies, WanX 2.1 can generate high-quality videos at up to 1080p resolution.

How does Wan 2.1 work?

Wan 2.1 (Wan2.1) utilizes a multimodal large model to convert text inputs into high-quality videos. By incorporating its proprietary VAE and DiT frameworks, it enhances both temporal and spatial relationships, resulting in more realistic visuals, especially in scenes involving complex motion and physical interactions. The model employs a comprehensive space-time attention mechanism to accurately replicate real-world dynamics and leverages ultra-long context for smooth and precise integration of text instructions into the video creation process.

What are the standout features of Wan 2.1?

Wan 2.1 offers several standout features, including high-quality video generation up to 1080p resolution, precise control over movement, and the ability to handle multi-object interactions. It supports both Chinese and English text inputs, ensuring versatility. The model delivers exceptional visual quality and temporal consistency, achieving a top performance with an 84.7% overall score on the VBench benchmark. WanX 2.1 excels at generating videos with complex bodily movements, intricate rotations, and precise body coordination, all while maintaining realistic motion trajectories.

Is it free to use Wan 2.1?

Wan 2.1 operates on a freemium model. You can generate videos for free with limited credits, but there are subscription options available for additional features through WanX 2.1.

Wan 2.1 Support Models?

Wan 2.1 offers several models: T2V-14B (480P/720P), T2V-1.3B (480P), I2V-14B-720P, I2V-14B-480P, and Text-to-Image functionality with any model. All are available on Hugging Face and ModelScope.

Hardware Requirements?

T2V-1.3B needs only 8.19GB VRAM (works on RTX 4090). 14B models require high-end GPUs with 24GB+ VRAM or multi-GPU setups. Use --offload_model True and --t5_cpu to reduce memory usage on limited hardware.

How to Improve Video Quality?

Enable prompt extension with --use_prompt_extend. Choose higher resolution (720P) for better quality. For T2V-1.3B, set --sample_guide_scale 6 and adjust --sample_shift (8-12). Use multi-GPU processing for best results.

Wan 2.1 Advantages?

Superior performance over other models. Versatile for multiple tasks (T2V, I2V, editing). Generates Chinese/English text in videos. Advanced WanX-VAE for efficient processing. Consumer-grade GPU compatibility with T2V-1.3B model.

What Our Users Say?

Try Wan 2.1 on GoEnhance AI

Explore the powerful Wan 2.1 video generator

Try Wan 2.1