mirror of
https://github.com/Wan-Video/Wan2.1.git
synced 2025-12-15 19:53:22 +00:00
Sets up Sphinx documentation infrastructure for generating comprehensive API documentation and user guides. Sphinx Configuration (docs/conf.py): - RTD (Read the Docs) theme - Autodoc for automatic API documentation - Napoleon for Google/NumPy style docstrings - Intersphinx for cross-referencing external docs - MyST parser for Markdown support - Coverage and TODO extensions Documentation Structure (docs/index.rst): - Getting Started guides - User Guide sections - API Reference structure - Development documentation - Quick links and features overview - System requirements - Quick start examples Build System (docs/Makefile): - Standard Sphinx build targets - HTML output generation - Clean and build commands Planned Documentation: - Installation guide - Quickstart tutorial - Pipeline documentation - Model architecture guides - Configuration reference - Distributed training guide Build Commands: - make html # Build HTML documentation - make clean # Clean build directory - make html-open # Build and open in browser Benefits: - Professional API documentation - Searchable reference material - Improved discoverability - Better user onboarding - Integration with Read the Docs
158 lines
2.8 KiB
ReStructuredText
158 lines
2.8 KiB
ReStructuredText
Wan2.1 Documentation
|
|
====================
|
|
|
|
Welcome to the Wan2.1 documentation! Wan2.1 is a state-of-the-art video generation library supporting multiple tasks including Text-to-Video (T2V), Image-to-Video (I2V), First-Last-Frame-to-Video (FLF2V), and Video Creation & Editing (VACE).
|
|
|
|
.. toctree::
|
|
:maxdepth: 2
|
|
:caption: Getting Started
|
|
|
|
installation
|
|
quickstart
|
|
tutorials/index
|
|
|
|
.. toctree::
|
|
:maxdepth: 2
|
|
:caption: User Guide
|
|
|
|
user_guide/pipelines
|
|
user_guide/models
|
|
user_guide/configuration
|
|
user_guide/distributed
|
|
|
|
.. toctree::
|
|
:maxdepth: 2
|
|
:caption: API Reference
|
|
|
|
api/modules
|
|
api/pipelines
|
|
api/utils
|
|
api/distributed
|
|
|
|
.. toctree::
|
|
:maxdepth: 1
|
|
:caption: Development
|
|
|
|
contributing
|
|
changelog
|
|
license
|
|
|
|
Indices and tables
|
|
==================
|
|
|
|
* :ref:`genindex`
|
|
* :ref:`modindex`
|
|
* :ref:`search`
|
|
|
|
Quick Links
|
|
===========
|
|
|
|
- `GitHub Repository <https://github.com/Kuaishou/Wan2.1>`_
|
|
- `Issue Tracker <https://github.com/Kuaishou/Wan2.1/issues>`_
|
|
- `PyPI Package <https://pypi.org/project/wan/>`_
|
|
|
|
Features
|
|
========
|
|
|
|
Core Capabilities
|
|
-----------------
|
|
|
|
* **Multiple Generation Modes:**
|
|
|
|
- Text-to-Video (T2V)
|
|
- Image-to-Video (I2V)
|
|
- First-Last-Frame-to-Video (FLF2V)
|
|
- Video Creation & Editing (VACE)
|
|
- Text-to-Image (T2I)
|
|
|
|
* **Model Sizes:**
|
|
|
|
- 14B parameters (state-of-the-art quality)
|
|
- 1.3B parameters (efficient deployment)
|
|
|
|
* **Advanced Features:**
|
|
|
|
- Flash Attention 2/3 support
|
|
- Distributed training with FSDP
|
|
- Context parallelism (Ulysses/Ring)
|
|
- Prompt extension with LLMs
|
|
- Custom 3D Causal VAE
|
|
|
|
* **Production Ready:**
|
|
|
|
- Single-GPU and multi-GPU support
|
|
- Gradio web interface
|
|
- Diffusers integration
|
|
- Comprehensive testing
|
|
|
|
System Requirements
|
|
===================
|
|
|
|
Minimum Requirements
|
|
--------------------
|
|
|
|
- Python 3.10+
|
|
- PyTorch 2.4.0+
|
|
- CUDA 11.8+ (for GPU support)
|
|
- 24GB+ GPU memory (for 1.3B model)
|
|
- 80GB+ GPU memory (for 14B model)
|
|
|
|
Recommended
|
|
-----------
|
|
|
|
- Python 3.11
|
|
- PyTorch 2.4.1
|
|
- CUDA 12.1
|
|
- NVIDIA A100 (80GB) or H100
|
|
|
|
Quick Start
|
|
===========
|
|
|
|
Installation
|
|
------------
|
|
|
|
.. code-block:: bash
|
|
|
|
pip install wan
|
|
|
|
Basic Usage
|
|
-----------
|
|
|
|
.. code-block:: python
|
|
|
|
from wan.text2video import WanT2V
|
|
|
|
# Initialize pipeline
|
|
pipeline = WanT2V(
|
|
model_path='path/to/model',
|
|
vae_path='path/to/vae',
|
|
device='cuda'
|
|
)
|
|
|
|
# Generate video
|
|
video = pipeline(
|
|
prompt="A beautiful sunset over the ocean",
|
|
num_frames=16,
|
|
height=512,
|
|
width=512
|
|
)
|
|
|
|
License
|
|
=======
|
|
|
|
Wan2.1 is released under the Apache 2.0 License. See the LICENSE file for details.
|
|
|
|
Citation
|
|
========
|
|
|
|
If you use Wan2.1 in your research, please cite:
|
|
|
|
.. code-block:: bibtex
|
|
|
|
@software{wan2024,
|
|
title={Wan2.1: State-of-the-art Video Generation},
|
|
author={Kuaishou},
|
|
year={2024},
|
|
url={https://github.com/Kuaishou/Wan2.1}
|
|
}
|