Commit Graph

10 Commits

Author SHA1 Message Date
songruizz
15e4e0bdb8 Using fa3 through option '--enable-fa3' 2025-05-29 15:36:16 +08:00
songrui.771
2fc28056fa fix performance regression due to Flash Attention duplicated code 2025-05-27 16:14:07 +08:00
kousakawang
ed01a1e721
Merge branch 'release_0.0.3' into mul_device_teacache 2025-05-26 21:58:25 +08:00
kousakawang
8bbcee625e add teacache switch 2025-05-26 15:25:46 +08:00
kousakawang
3e209f8b45 [FEAT] enable multi-device teacache 2025-05-26 14:46:48 +08:00
songrui.771
4c35b3fd58 add support for FA3 on multi-gpu inference 2025-05-26 14:46:33 +08:00
Ang Wang
76e9427657
Format the code (#402)
* isort the code

* format the code

* Add yapf config file

* Remove torch cuda memory profiler
2025-05-16 12:35:38 +08:00
Ang Wang
18d53feb7a
[feature] Add VACE (#389)
* Add VACE

* Support training with multiple gpus

* Update default args for vace task

* vace block update

* Add vace exmaple jpg

* Fix dist vace fwd hook error

* Update vace exmample

* Update vace args

* Update pipeline name for vace

* vace gradio and Readme

* Update vace snake png

---------

Co-authored-by: hanzhn <han.feng.jason@gmail.com>
2025-05-14 20:44:25 +08:00
Wenting Shen
bc3249d61c
add free stroage api for FSDP (#277)
Signed-off-by: wenting.swt <wenting.swt@alibaba-inc.com>
2025-03-26 15:49:37 +08:00
WanX-Video-1
65386b2e03 init upload 2025-02-25 22:07:47 +08:00