bubbliiiing
commited on
Commit
ยท
ce5fe3b
1
Parent(s):
9a8a301
Readme
Browse files- README.md +132 -26
- README_en.md +131 -24
README.md
CHANGED
|
@@ -30,6 +30,17 @@ tags:
|
|
| 30 |
- [่ฎธๅฏ่ฏ](#่ฎธๅฏ่ฏ)
|
| 31 |
|
| 32 |
# ๆจกๅๅฐๅ
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 33 |
V1.0:
|
| 34 |
| ๅ็งฐ | ๅญๅจ็ฉบ้ด | Hugging Face | Model Scope | ๆ่ฟฐ |
|
| 35 |
|--|--|--|--|--|
|
|
@@ -40,21 +51,21 @@ V1.0:
|
|
| 40 |
|
| 41 |
# ่ง้ขไฝๅ
|
| 42 |
|
| 43 |
-
### Wan2.1-Fun-14B-InP && Wan2.1-Fun-1.3B-InP
|
| 44 |
|
| 45 |
<table border="0" style="width: 100%; text-align: left; margin-top: 20px;">
|
| 46 |
<tr>
|
| 47 |
<td>
|
| 48 |
-
<video src="https://
|
| 49 |
</td>
|
| 50 |
<td>
|
| 51 |
-
<video src="https://
|
| 52 |
</td>
|
| 53 |
<td>
|
| 54 |
-
<video src="https://
|
| 55 |
</td>
|
| 56 |
<td>
|
| 57 |
-
<video src="https://
|
| 58 |
</td>
|
| 59 |
</tr>
|
| 60 |
</table>
|
|
@@ -62,56 +73,134 @@ V1.0:
|
|
| 62 |
<table border="0" style="width: 100%; text-align: left; margin-top: 20px;">
|
| 63 |
<tr>
|
| 64 |
<td>
|
| 65 |
-
<video src="https://
|
| 66 |
</td>
|
| 67 |
<td>
|
| 68 |
-
<video src="https://
|
| 69 |
</td>
|
| 70 |
<td>
|
| 71 |
-
<video src="https://
|
| 72 |
</td>
|
| 73 |
<td>
|
| 74 |
-
<video src="https://
|
| 75 |
</td>
|
| 76 |
</tr>
|
| 77 |
</table>
|
| 78 |
|
| 79 |
-
### Wan2.1-Fun-14B-Control && Wan2.1-Fun-1.3B-Control
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 80 |
|
| 81 |
<table border="0" style="width: 100%; text-align: left; margin-top: 20px;">
|
| 82 |
<tr>
|
| 83 |
<td>
|
| 84 |
-
<video src="https://
|
| 85 |
</td>
|
| 86 |
<td>
|
| 87 |
-
<video src="https://
|
| 88 |
</td>
|
| 89 |
<td>
|
| 90 |
-
<video src="https://
|
| 91 |
</td>
|
| 92 |
<tr>
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 93 |
</table>
|
| 94 |
|
|
|
|
|
|
|
| 95 |
<table border="0" style="width: 100%; text-align: left; margin-top: 20px;">
|
| 96 |
<tr>
|
| 97 |
<td>
|
| 98 |
-
|
| 99 |
</td>
|
| 100 |
<td>
|
| 101 |
-
|
| 102 |
</td>
|
| 103 |
<td>
|
| 104 |
-
|
| 105 |
</td>
|
| 106 |
<tr>
|
| 107 |
<td>
|
| 108 |
-
<video src="https://
|
| 109 |
</td>
|
| 110 |
<td>
|
| 111 |
-
<video src="https://
|
| 112 |
</td>
|
| 113 |
<td>
|
| 114 |
-
<video src="https://
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 115 |
</td>
|
| 116 |
</tr>
|
| 117 |
</table>
|
|
@@ -140,10 +229,10 @@ docker pull mybigpai-public-registry.cn-beijing.cr.aliyuncs.com/easycv/torch_cud
|
|
| 140 |
docker run -it -p 7860:7860 --network host --gpus all --security-opt seccomp:unconfined --shm-size 200g mybigpai-public-registry.cn-beijing.cr.aliyuncs.com/easycv/torch_cuda:cogvideox_fun
|
| 141 |
|
| 142 |
# clone code
|
| 143 |
-
git clone https://github.com/aigc-apps/
|
| 144 |
|
| 145 |
-
# enter
|
| 146 |
-
cd
|
| 147 |
|
| 148 |
# download weights
|
| 149 |
mkdir models/Diffusion_Transformer
|
|
@@ -155,8 +244,8 @@ mkdir models/Personalized_Model
|
|
| 155 |
# https://modelscope.cn/models/PAI/CogVideoX-Fun-V1.1-5b-InP
|
| 156 |
|
| 157 |
# Wan
|
| 158 |
-
# https://huggingface.co/alibaba-pai/Wan2.1-Fun-14B-InP
|
| 159 |
-
# https://modelscope.cn/models/PAI/Wan2.1-Fun-14B-InP
|
| 160 |
```
|
| 161 |
|
| 162 |
### 2. ๆฌๅฐๅฎ่ฃ
: ็ฏๅขๆฃๆฅ/ไธ่ฝฝ/ๅฎ่ฃ
|
|
@@ -184,13 +273,26 @@ Linux ็่ฏฆ็ปไฟกๆฏ๏ผ
|
|
| 184 |
#### b. ๆ้ๆพ็ฝฎ
|
| 185 |
ๆไปฌๆๅฅฝๅฐ[ๆ้](#model-zoo)ๆ็
งๆๅฎ่ทฏๅพ่ฟ่กๆพ็ฝฎ๏ผ
|
| 186 |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 187 |
```
|
| 188 |
๐ฆ models/
|
| 189 |
โโโ ๐ Diffusion_Transformer/
|
| 190 |
โ โโโ ๐ CogVideoX-Fun-V1.1-2b-InP/
|
| 191 |
โ โโโ ๐ CogVideoX-Fun-V1.1-5b-InP/
|
| 192 |
-
โ โโโ ๐ Wan2.1-Fun-14B-InP
|
| 193 |
-
โ โโโ ๐ Wan2.1-Fun-1.3B-InP/
|
| 194 |
โโโ ๐ Personalized_Model/
|
| 195 |
โ โโโ your trained trainformer model / your trained lora model (for UI load)
|
| 196 |
```
|
|
@@ -243,6 +345,10 @@ webuiๆฏๆๆ็่ง้ขใๅพ็่ง้ขใ่ง้ข็่ง้ขๅๆฎ้ๆงๅถ็่ง้ข
|
|
| 243 |
- CogVideo: https://github.com/THUDM/CogVideo/
|
| 244 |
- EasyAnimate: https://github.com/aigc-apps/EasyAnimate
|
| 245 |
- Wan2.1: https://github.com/Wan-Video/Wan2.1/
|
|
|
|
|
|
|
|
|
|
|
|
|
| 246 |
|
| 247 |
# ่ฎธๅฏ่ฏ
|
| 248 |
ๆฌ้กน็ฎ้็จ [Apache License (Version 2.0)](https://github.com/modelscope/modelscope/blob/master/LICENSE).
|
|
|
|
| 30 |
- [่ฎธๅฏ่ฏ](#่ฎธๅฏ่ฏ)
|
| 31 |
|
| 32 |
# ๆจกๅๅฐๅ
|
| 33 |
+
|
| 34 |
+
V1.1:
|
| 35 |
+
| ๅ็งฐ | ๅญๅจ็ฉบ้ด | Hugging Face | Model Scope | ๆ่ฟฐ |
|
| 36 |
+
|--|--|--|--|--|
|
| 37 |
+
| Wan2.1-Fun-V1.1-1.3B-InP | 19.0 GB | [๐คLink](https://huggingface.co/alibaba-pai/Wan2.1-Fun-V1.1-1.3B-InP) | [๐Link](https://modelscope.cn/models/PAI/Wan2.1-Fun-V1.1-1.3B-InP) | Wan2.1-Fun-V1.1-1.3Bๆๅพ็่ง้ขๆ้๏ผไปฅๅคๅ่พจ็่ฎญ็ป๏ผๆฏๆ้ฆๅฐพๅพ้ขๆตใ |
|
| 38 |
+
| Wan2.1-Fun-V1.1-14B-InP | 47.0 GB | [๐คLink](https://huggingface.co/alibaba-pai/Wan2.1-Fun-V1.1-14B-InP) | [๐Link](https://modelscope.cn/models/PAI/Wan2.1-Fun-V1.1-14B-InP) | Wan2.1-Fun-V1.1-14Bๆๅพ็่ง้ขๆ้๏ผไปฅๅคๅ่พจ็่ฎญ็ป๏ผๆฏๆ้ฆๅฐพๅพ้ขๆตใ |
|
| 39 |
+
| Wan2.1-Fun-V1.1-1.3B-Control | 19.0 GB | [๐คLink](https://huggingface.co/alibaba-pai/Wan2.1-Fun-V1.1-1.3B-Control) | [๐Link](https://modelscope.cn/models/PAI/Wan2.1-Fun-V1.1-1.3B-Control)| Wan2.1-Fun-V1.1-1.3B่ง้ขๆงๅถๆ้ๆฏๆไธๅ็ๆงๅถๆกไปถ๏ผๅฆCannyใDepthใPoseใMLSD็ญ๏ผๆฏๆๅ่ๅพ + ๆงๅถๆกไปถ่ฟ่กๆงๅถ๏ผๆฏๆไฝฟ็จ่ฝจ่ฟนๆงๅถใๆฏๆๅคๅ่พจ็๏ผ512๏ผ768๏ผ1024๏ผ็่ง้ข้ขๆต๏ผๆฏๆๅคๅ่พจ็๏ผ512๏ผ768๏ผ1024๏ผ็่ง้ข้ขๆต๏ผไปฅ81ๅธงใๆฏ็ง16ๅธง่ฟ่ก่ฎญ็ป๏ผๆฏๆๅค่ฏญ่จ้ขๆต |
|
| 40 |
+
| Wan2.1-Fun-V1.1-14B-Control | 47.0 GB | [๐คLink](https://huggingface.co/alibaba-pai/Wan2.1-Fun-V1.1-14B-Control) | [๐Link](https://modelscope.cn/models/PAI/Wan2.1-Fun-V1.1-14B-Control)| Wan2.1-Fun-V1.1-14B่ง่ง้ขๆงๅถๆ้ๆฏๆไธๅ็ๆงๅถๆกไปถ๏ผๅฆCannyใDepthใPoseใMLSD็ญ๏ผๆฏๆๅ่ๅพ + ๆงๅถๆกไปถ่ฟ่กๆงๅถ๏ผๆฏๆไฝฟ็จ่ฝจ่ฟนๆงๅถใๆฏๆๅคๅ่พจ็๏ผ512๏ผ768๏ผ1024๏ผ็่ง้ข้ขๆต๏ผๆฏๆๅคๅ่พจ็๏ผ512๏ผ768๏ผ1024๏ผ็่ง้ข้ขๆต๏ผไปฅ81ๅธงใๆฏ็ง16ๅธง่ฟ่ก่ฎญ็ป๏ผๆฏๆๅค่ฏญ่จ้ขๆต |
|
| 41 |
+
| Wan2.1-Fun-V1.1-1.3B-Control-Camera | 19.0 GB | [๐คLink](https://huggingface.co/alibaba-pai/Wan2.1-Fun-V1.1-1.3B-Control) | [๐Link](https://modelscope.cn/models/PAI/Wan2.1-Fun-V1.1-1.3B-Control)| Wan2.1-Fun-V1.1-1.3B็ธๆบ้ๅคดๆงๅถๆ้ใๆฏๆๅคๅ่พจ็๏ผ512๏ผ768๏ผ1024๏ผ็่ง้ข้ขๆต๏ผๆฏๆๅคๅ่พจ็๏ผ512๏ผ768๏ผ1024๏ผ็่ง้ข้ขๆต๏ผไปฅ81ๅธงใๆฏ็ง16ๅธง่ฟ่ก่ฎญ็ป๏ผๆฏๆๅค่ฏญ่จ้ขๆต |
|
| 42 |
+
| Wan2.1-Fun-V1.1-14B-Control | 47.0 GB | [๐คLink](https://huggingface.co/alibaba-pai/Wan2.1-Fun-V1.1-14B-Control) | [๐Link](https://modelscope.cn/models/PAI/Wan2.1-Fun-V1.1-14B-Control)| Wan2.1-Fun-V1.1-14B็ธๆบ้ๅคดๆงๅถๆ้ใๆฏๆๅคๅ่พจ็๏ผ512๏ผ768๏ผ1024๏ผ็่ง้ข้ขๆต๏ผๆฏๆๅคๅ่พจ็๏ผ512๏ผ768๏ผ1024๏ผ็่ง้ข้ขๆต๏ผไปฅ81ๅธงใๆฏ็ง16ๅธง่ฟ่ก่ฎญ็ป๏ผๆฏๆๅค่ฏญ่จ้ขๆต |
|
| 43 |
+
|
| 44 |
V1.0:
|
| 45 |
| ๅ็งฐ | ๅญๅจ็ฉบ้ด | Hugging Face | Model Scope | ๆ่ฟฐ |
|
| 46 |
|--|--|--|--|--|
|
|
|
|
| 51 |
|
| 52 |
# ่ง้ขไฝๅ
|
| 53 |
|
| 54 |
+
### Wan2.1-Fun-V1.1-14B-InP && Wan2.1-Fun-V1.1-1.3B-InP
|
| 55 |
|
| 56 |
<table border="0" style="width: 100%; text-align: left; margin-top: 20px;">
|
| 57 |
<tr>
|
| 58 |
<td>
|
| 59 |
+
<video src="https://pai-aigc-photog.oss-cn-hangzhou.aliyuncs.com/wan_fun/asset/v1.1/inp_1.mp4" width="100%" controls autoplay loop></video>
|
| 60 |
</td>
|
| 61 |
<td>
|
| 62 |
+
<video src="https://pai-aigc-photog.oss-cn-hangzhou.aliyuncs.com/wan_fun/asset/v1.1/inp_2.mp4" width="100%" controls autoplay loop></video>
|
| 63 |
</td>
|
| 64 |
<td>
|
| 65 |
+
<video src="https://pai-aigc-photog.oss-cn-hangzhou.aliyuncs.com/wan_fun/asset/v1.1/inp_3.mp4" width="100%" controls autoplay loop></video>
|
| 66 |
</td>
|
| 67 |
<td>
|
| 68 |
+
<video src="https://pai-aigc-photog.oss-cn-hangzhou.aliyuncs.com/wan_fun/asset/v1.1/inp_4.mp4" width="100%" controls autoplay loop></video>
|
| 69 |
</td>
|
| 70 |
</tr>
|
| 71 |
</table>
|
|
|
|
| 73 |
<table border="0" style="width: 100%; text-align: left; margin-top: 20px;">
|
| 74 |
<tr>
|
| 75 |
<td>
|
| 76 |
+
<video src="https://pai-aigc-photog.oss-cn-hangzhou.aliyuncs.com/wan_fun/asset/v1.1/inp_5.mp4" width="100%" controls autoplay loop></video>
|
| 77 |
</td>
|
| 78 |
<td>
|
| 79 |
+
<video src="https://pai-aigc-photog.oss-cn-hangzhou.aliyuncs.com/wan_fun/asset/v1.1/inp_6.mp4" width="100%" controls autoplay loop></video>
|
| 80 |
</td>
|
| 81 |
<td>
|
| 82 |
+
<video src="https://pai-aigc-photog.oss-cn-hangzhou.aliyuncs.com/wan_fun/asset/v1.1/inp_7.mp4" width="100%" controls autoplay loop></video>
|
| 83 |
</td>
|
| 84 |
<td>
|
| 85 |
+
<video src="https://pai-aigc-photog.oss-cn-hangzhou.aliyuncs.com/wan_fun/asset/v1.1/inp_8.mp4" width="100%" controls autoplay loop></video>
|
| 86 |
</td>
|
| 87 |
</tr>
|
| 88 |
</table>
|
| 89 |
|
| 90 |
+
### Wan2.1-Fun-V1.1-14B-Control && Wan2.1-Fun-V1.1-1.3B-Control
|
| 91 |
+
|
| 92 |
+
Generic Control Video + Reference Image:
|
| 93 |
+
<table border="0" style="width: 100%; text-align: left; margin-top: 20px;">
|
| 94 |
+
<tr>
|
| 95 |
+
<td>
|
| 96 |
+
Reference Image
|
| 97 |
+
</td>
|
| 98 |
+
<td>
|
| 99 |
+
Control Video
|
| 100 |
+
</td>
|
| 101 |
+
<td>
|
| 102 |
+
Wan2.1-Fun-V1.1-14B-Control
|
| 103 |
+
</td>
|
| 104 |
+
<td>
|
| 105 |
+
Wan2.1-Fun-V1.1-1.3B-Control
|
| 106 |
+
</td>
|
| 107 |
+
<tr>
|
| 108 |
+
<td>
|
| 109 |
+
<image src="https://pai-aigc-photog.oss-cn-hangzhou.aliyuncs.com/wan_fun/asset/v1.1/6.png" width="100%" controls autoplay loop></image>
|
| 110 |
+
</td>
|
| 111 |
+
<td>
|
| 112 |
+
<video src="https://pai-aigc-photog.oss-cn-hangzhou.aliyuncs.com/wan_fun/asset/v1.1/pose.mp4" width="100%" controls autoplay loop></video>
|
| 113 |
+
</td>
|
| 114 |
+
<td>
|
| 115 |
+
<video src="https://pai-aigc-photog.oss-cn-hangzhou.aliyuncs.com/wan_fun/asset/v1.1/14b_ref.mp4" width="100%" controls autoplay loop></video>
|
| 116 |
+
</td>
|
| 117 |
+
<td>
|
| 118 |
+
<video src="https://pai-aigc-photog.oss-cn-hangzhou.aliyuncs.com/wan_fun/asset/v1.1/1_3b_ref.mp4" width="100%" controls autoplay loop></video>
|
| 119 |
+
</td>
|
| 120 |
+
<tr>
|
| 121 |
+
</table>
|
| 122 |
+
|
| 123 |
+
Generic Control Video (Canny, Pose, Depth, etc.) and Trajectory Control:
|
| 124 |
+
<table border="0" style="width: 100%; text-align: left; margin-top: 20px;">
|
| 125 |
+
<tr>
|
| 126 |
+
<td>
|
| 127 |
+
<video src="https://pai-aigc-photog.oss-cn-hangzhou.aliyuncs.com/wan_fun/asset/v1.1/guiji.mp4" width="100%" controls autoplay loop></video>
|
| 128 |
+
</td>
|
| 129 |
+
<td>
|
| 130 |
+
<video src="https://pai-aigc-photog.oss-cn-hangzhou.aliyuncs.com/wan_fun/asset/v1.1/guiji_plus_out.mp4" width="100%" controls autoplay loop></video>
|
| 131 |
+
</td>
|
| 132 |
+
<td>
|
| 133 |
+
<video src="https://pai-aigc-photog.oss-cn-hangzhou.aliyuncs.com/wan_fun/asset/v1.1/guiji_out.mp4" width="100%" controls autoplay loop></video>
|
| 134 |
+
</td>
|
| 135 |
+
<tr>
|
| 136 |
+
</table>
|
| 137 |
|
| 138 |
<table border="0" style="width: 100%; text-align: left; margin-top: 20px;">
|
| 139 |
<tr>
|
| 140 |
<td>
|
| 141 |
+
<video src="https://pai-aigc-photog.oss-cn-hangzhou.aliyuncs.com/wan_fun/asset/v1.1/pose.mp4" width="100%" controls autoplay loop></video>
|
| 142 |
</td>
|
| 143 |
<td>
|
| 144 |
+
<video src="https://pai-aigc-photog.oss-cn-hangzhou.aliyuncs.com/wan_fun/asset/v1.1/canny.mp4" width="100%" controls autoplay loop></video>
|
| 145 |
</td>
|
| 146 |
<td>
|
| 147 |
+
<video src="https://pai-aigc-photog.oss-cn-hangzhou.aliyuncs.com/wan_fun/asset/v1.1/depth.mp4" width="100%" controls autoplay loop></video>
|
| 148 |
</td>
|
| 149 |
<tr>
|
| 150 |
+
<td>
|
| 151 |
+
<video src="https://pai-aigc-photog.oss-cn-hangzhou.aliyuncs.com/wan_fun/asset/v1.1/pose_out.mp4" width="100%" controls autoplay loop></video>
|
| 152 |
+
</td>
|
| 153 |
+
<td>
|
| 154 |
+
<video src="https://pai-aigc-photog.oss-cn-hangzhou.aliyuncs.com/wan_fun/asset/v1.1/canny_out.mp4" width="100%" controls autoplay loop></video>
|
| 155 |
+
</td>
|
| 156 |
+
<td>
|
| 157 |
+
<video src="https://pai-aigc-photog.oss-cn-hangzhou.aliyuncs.com/wan_fun/asset/v1.1/depth_out.mp4" width="100%" controls autoplay loop></video>
|
| 158 |
+
</td>
|
| 159 |
+
</tr>
|
| 160 |
</table>
|
| 161 |
|
| 162 |
+
### Wan2.1-Fun-V1.1-14B-Control-Camera && Wan2.1-Fun-V1.1-1.3B-Control-Camera
|
| 163 |
+
|
| 164 |
<table border="0" style="width: 100%; text-align: left; margin-top: 20px;">
|
| 165 |
<tr>
|
| 166 |
<td>
|
| 167 |
+
Pan Up
|
| 168 |
</td>
|
| 169 |
<td>
|
| 170 |
+
Pan Left
|
| 171 |
</td>
|
| 172 |
<td>
|
| 173 |
+
Pan Right
|
| 174 |
</td>
|
| 175 |
<tr>
|
| 176 |
<td>
|
| 177 |
+
<video src="https://pai-aigc-photog.oss-cn-hangzhou.aliyuncs.com/wan_fun/asset/v1.1/Pan_Up.mp4" width="100%" controls autoplay loop></video>
|
| 178 |
</td>
|
| 179 |
<td>
|
| 180 |
+
<video src="https://pai-aigc-photog.oss-cn-hangzhou.aliyuncs.com/wan_fun/asset/v1.1/Pan_Left.mp4" width="100%" controls autoplay loop></video>
|
| 181 |
</td>
|
| 182 |
<td>
|
| 183 |
+
<video src="https://pai-aigc-photog.oss-cn-hangzhou.aliyuncs.com/wan_fun/asset/v1.1/Pan_Right.mp4" width="100%" controls autoplay loop></video>
|
| 184 |
+
</td>
|
| 185 |
+
<tr>
|
| 186 |
+
<td>
|
| 187 |
+
Pan Down
|
| 188 |
+
</td>
|
| 189 |
+
<td>
|
| 190 |
+
Pan Up + Pan Left
|
| 191 |
+
</td>
|
| 192 |
+
<td>
|
| 193 |
+
Pan Up + Pan Right
|
| 194 |
+
</td>
|
| 195 |
+
<tr>
|
| 196 |
+
<td>
|
| 197 |
+
<video src="https://pai-aigc-photog.oss-cn-hangzhou.aliyuncs.com/wan_fun/asset/v1.1/Pan_Down.mp4" width="100%" controls autoplay loop></video>
|
| 198 |
+
</td>
|
| 199 |
+
<td>
|
| 200 |
+
<video src="https://pai-aigc-photog.oss-cn-hangzhou.aliyuncs.com/wan_fun/asset/v1.1/Pan_Left_Up.mp4" width="100%" controls autoplay loop></video>
|
| 201 |
+
</td>
|
| 202 |
+
<td>
|
| 203 |
+
<video src="https://pai-aigc-photog.oss-cn-hangzhou.aliyuncs.com/wan_fun/asset/v1.1/Pan_Right_Up.mp4" width="100%" controls autoplay loop></video>
|
| 204 |
</td>
|
| 205 |
</tr>
|
| 206 |
</table>
|
|
|
|
| 229 |
docker run -it -p 7860:7860 --network host --gpus all --security-opt seccomp:unconfined --shm-size 200g mybigpai-public-registry.cn-beijing.cr.aliyuncs.com/easycv/torch_cuda:cogvideox_fun
|
| 230 |
|
| 231 |
# clone code
|
| 232 |
+
git clone https://github.com/aigc-apps/VideoX-Fun.git
|
| 233 |
|
| 234 |
+
# enter VideoX-Fun's dir
|
| 235 |
+
cd VideoX-Fun
|
| 236 |
|
| 237 |
# download weights
|
| 238 |
mkdir models/Diffusion_Transformer
|
|
|
|
| 244 |
# https://modelscope.cn/models/PAI/CogVideoX-Fun-V1.1-5b-InP
|
| 245 |
|
| 246 |
# Wan
|
| 247 |
+
# https://huggingface.co/alibaba-pai/Wan2.1-Fun-V1.1-14B-InP
|
| 248 |
+
# https://modelscope.cn/models/PAI/Wan2.1-Fun-V1.1-14B-InP
|
| 249 |
```
|
| 250 |
|
| 251 |
### 2. ๆฌๅฐๅฎ่ฃ
: ็ฏๅขๆฃๆฅ/ไธ่ฝฝ/ๅฎ่ฃ
|
|
|
|
| 273 |
#### b. ๆ้ๆพ็ฝฎ
|
| 274 |
ๆไปฌๆๅฅฝๅฐ[ๆ้](#model-zoo)ๆ็
งๆๅฎ่ทฏๅพ่ฟ่กๆพ็ฝฎ๏ผ
|
| 275 |
|
| 276 |
+
**้่ฟcomfyui**๏ผ
|
| 277 |
+
ๅฐๆจกๅๆพๅ
ฅComfyui็ๆ้ๆไปถๅคน`ComfyUI/models/Fun_Models/`๏ผ
|
| 278 |
+
```
|
| 279 |
+
๐ฆ ComfyUI/
|
| 280 |
+
โโโ ๐ models/
|
| 281 |
+
โ โโโ ๐ Fun_Models/
|
| 282 |
+
โ โโโ ๐ CogVideoX-Fun-V1.1-2b-InP/
|
| 283 |
+
โ โโโ ๐ CogVideoX-Fun-V1.1-5b-InP/
|
| 284 |
+
โ โโโ ๐ Wan2.1-Fun-V1.1-14B-InP
|
| 285 |
+
โ โโโ ๐ Wan2.1-Fun-V1.1-1.3B-InP/
|
| 286 |
+
```
|
| 287 |
+
|
| 288 |
+
**่ฟ่ก่ช่บซ็pythonๆไปถๆui็้ข**:
|
| 289 |
```
|
| 290 |
๐ฆ models/
|
| 291 |
โโโ ๐ Diffusion_Transformer/
|
| 292 |
โ โโโ ๐ CogVideoX-Fun-V1.1-2b-InP/
|
| 293 |
โ โโโ ๐ CogVideoX-Fun-V1.1-5b-InP/
|
| 294 |
+
โ โโโ ๐ Wan2.1-Fun-V1.1-14B-InP
|
| 295 |
+
โ โโโ ๐ Wan2.1-Fun-V1.1-1.3B-InP/
|
| 296 |
โโโ ๐ Personalized_Model/
|
| 297 |
โ โโโ your trained trainformer model / your trained lora model (for UI load)
|
| 298 |
```
|
|
|
|
| 345 |
- CogVideo: https://github.com/THUDM/CogVideo/
|
| 346 |
- EasyAnimate: https://github.com/aigc-apps/EasyAnimate
|
| 347 |
- Wan2.1: https://github.com/Wan-Video/Wan2.1/
|
| 348 |
+
- ComfyUI-KJNodes: https://github.com/kijai/ComfyUI-KJNodes
|
| 349 |
+
- ComfyUI-EasyAnimateWrapper: https://github.com/kijai/ComfyUI-EasyAnimateWrapper
|
| 350 |
+
- ComfyUI-CameraCtrl-Wrapper: https://github.com/chaojie/ComfyUI-CameraCtrl-Wrapper
|
| 351 |
+
- CameraCtrl: https://github.com/hehao13/CameraCtrl
|
| 352 |
|
| 353 |
# ่ฎธๅฏ่ฏ
|
| 354 |
ๆฌ้กน็ฎ้็จ [Apache License (Version 2.0)](https://github.com/modelscope/modelscope/blob/master/LICENSE).
|
README_en.md
CHANGED
|
@@ -30,6 +30,17 @@ tags:
|
|
| 30 |
- [License](#license)
|
| 31 |
|
| 32 |
# Model zoo
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 33 |
V1.0:
|
| 34 |
| Name | Storage Space | Hugging Face | Model Scope | Description |
|
| 35 |
|--|--|--|--|--|
|
|
@@ -38,23 +49,24 @@ V1.0:
|
|
| 38 |
| Wan2.1-Fun-1.3B-Control | 19.0 GB | [๐คLink](https://huggingface.co/alibaba-pai/Wan2.1-Fun-1.3B-Control) | [๐Link](https://modelscope.cn/models/PAI/Wan2.1-Fun-1.3B-Control) | Wan2.1-Fun-1.3B video control weights, supporting various control conditions such as Canny, Depth, Pose, MLSD, etc., and trajectory control. Supports multi-resolution (512, 768, 1024) video prediction at 81 frames, trained at 16 frames per second, with multilingual prediction support. |
|
| 39 |
| Wan2.1-Fun-14B-Control | 47.0 GB | [๐คLink](https://huggingface.co/alibaba-pai/Wan2.1-Fun-14B-Control) | [๐Link](https://modelscope.cn/models/PAI/Wan2.1-Fun-14B-Control) | Wan2.1-Fun-14B video control weights, supporting various control conditions such as Canny, Depth, Pose, MLSD, etc., and trajectory control. Supports multi-resolution (512, 768, 1024) video prediction at 81 frames, trained at 16 frames per second, with multilingual prediction support. |
|
| 40 |
|
|
|
|
| 41 |
# Video Result
|
| 42 |
|
| 43 |
-
### Wan2.1-Fun-14B-InP && Wan2.1-Fun-1.3B-InP
|
| 44 |
|
| 45 |
<table border="0" style="width: 100%; text-align: left; margin-top: 20px;">
|
| 46 |
<tr>
|
| 47 |
<td>
|
| 48 |
-
<video src="https://
|
| 49 |
</td>
|
| 50 |
<td>
|
| 51 |
-
<video src="https://
|
| 52 |
</td>
|
| 53 |
<td>
|
| 54 |
-
<video src="https://
|
| 55 |
</td>
|
| 56 |
<td>
|
| 57 |
-
<video src="https://
|
| 58 |
</td>
|
| 59 |
</tr>
|
| 60 |
</table>
|
|
@@ -62,56 +74,134 @@ V1.0:
|
|
| 62 |
<table border="0" style="width: 100%; text-align: left; margin-top: 20px;">
|
| 63 |
<tr>
|
| 64 |
<td>
|
| 65 |
-
<video src="https://
|
| 66 |
</td>
|
| 67 |
<td>
|
| 68 |
-
<video src="https://
|
| 69 |
</td>
|
| 70 |
<td>
|
| 71 |
-
<video src="https://
|
| 72 |
</td>
|
| 73 |
<td>
|
| 74 |
-
<video src="https://
|
| 75 |
</td>
|
| 76 |
</tr>
|
| 77 |
</table>
|
| 78 |
|
| 79 |
-
### Wan2.1-Fun-14B-Control && Wan2.1-Fun-1.3B-Control
|
| 80 |
|
|
|
|
| 81 |
<table border="0" style="width: 100%; text-align: left; margin-top: 20px;">
|
| 82 |
<tr>
|
| 83 |
<td>
|
| 84 |
-
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 85 |
</td>
|
| 86 |
<td>
|
| 87 |
-
<video src="https://
|
| 88 |
</td>
|
| 89 |
<td>
|
| 90 |
-
<video src="https://
|
|
|
|
|
|
|
|
|
|
| 91 |
</td>
|
| 92 |
<tr>
|
| 93 |
</table>
|
| 94 |
|
|
|
|
| 95 |
<table border="0" style="width: 100%; text-align: left; margin-top: 20px;">
|
| 96 |
<tr>
|
| 97 |
<td>
|
| 98 |
-
<video src="https://
|
| 99 |
</td>
|
| 100 |
<td>
|
| 101 |
-
<video src="https://
|
| 102 |
</td>
|
| 103 |
<td>
|
| 104 |
-
<video src="https://
|
| 105 |
</td>
|
|
|
|
|
|
|
|
|
|
|
|
|
| 106 |
<tr>
|
| 107 |
<td>
|
| 108 |
-
<video src="https://
|
| 109 |
</td>
|
| 110 |
<td>
|
| 111 |
-
<video src="https://
|
| 112 |
</td>
|
| 113 |
<td>
|
| 114 |
-
<video src="https://
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 115 |
</td>
|
| 116 |
</tr>
|
| 117 |
</table>
|
|
@@ -142,10 +232,10 @@ docker pull mybigpai-public-registry.cn-beijing.cr.aliyuncs.com/easycv/torch_cud
|
|
| 142 |
docker run -it -p 7860:7860 --network host --gpus all --security-opt seccomp:unconfined --shm-size 200g mybigpai-public-registry.cn-beijing.cr.aliyuncs.com/easycv/torch_cuda:cogvideox_fun
|
| 143 |
|
| 144 |
# clone code
|
| 145 |
-
git clone https://github.com/aigc-apps/
|
| 146 |
|
| 147 |
-
# enter
|
| 148 |
-
cd
|
| 149 |
|
| 150 |
# download weights
|
| 151 |
mkdir models/Diffusion_Transformer
|
|
@@ -157,8 +247,8 @@ mkdir models/Personalized_Model
|
|
| 157 |
# https://modelscope.cn/models/PAI/CogVideoX-Fun-V1.1-5b-InP
|
| 158 |
|
| 159 |
# Wan
|
| 160 |
-
# https://huggingface.co/alibaba-pai/Wan2.1-Fun-14B-InP
|
| 161 |
-
# https://modelscope.cn/models/PAI/Wan2.1-Fun-14B-InP
|
| 162 |
```
|
| 163 |
|
| 164 |
### 2. Local install: Environment Check/Downloading/Installation
|
|
@@ -186,6 +276,19 @@ We need about 60GB available on disk (for saving weights), please check!
|
|
| 186 |
#### b. Weights
|
| 187 |
We'd better place the [weights](#model-zoo) along the specified path:
|
| 188 |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 189 |
```
|
| 190 |
๐ฆ models/
|
| 191 |
โโโ ๐ Diffusion_Transformer/
|
|
@@ -244,6 +347,10 @@ The web UI supports text-to-video, image-to-video, video-to-video, and controlle
|
|
| 244 |
- CogVideo: https://github.com/THUDM/CogVideo/
|
| 245 |
- EasyAnimate: https://github.com/aigc-apps/EasyAnimate
|
| 246 |
- Wan2.1: https://github.com/Wan-Video/Wan2.1/
|
|
|
|
|
|
|
|
|
|
|
|
|
| 247 |
|
| 248 |
# License
|
| 249 |
This project is licensed under the [Apache License (Version 2.0)](https://github.com/modelscope/modelscope/blob/master/LICENSE).
|
|
|
|
| 30 |
- [License](#license)
|
| 31 |
|
| 32 |
# Model zoo
|
| 33 |
+
|
| 34 |
+
V1.1:
|
| 35 |
+
| Name | Storage Size | Hugging Face | Model Scope | Description |
|
| 36 |
+
|------|--------------|--------------|-------------|-------------|
|
| 37 |
+
| Wan2.1-Fun-V1.1-1.3B-InP | 19.0 GB | [๐คLink](https://huggingface.co/alibaba-pai/Wan2.1-Fun-V1.1-1.3B-InP) | [๐Link](https://modelscope.cn/models/PAI/Wan2.1-Fun-V1.1-1.3B-InP) | Wan2.1-Fun-V1.1-1.3B text-to-video generation weights, trained at multiple resolutions, supports start-end image prediction. |
|
| 38 |
+
| Wan2.1-Fun-V1.1-14B-InP | 47.0 GB | [๐คLink](https://huggingface.co/alibaba-pai/Wan2.1-Fun-V1.1-14B-InP) | [๐Link](https://modelscope.cn/models/PAI/Wan2.1-Fun-V1.1-14B-InP) | Wan2.1-Fun-V1.1-14B text-to-video generation weights, trained at multiple resolutions, supports start-end image prediction. |
|
| 39 |
+
| Wan2.1-Fun-V1.1-1.3B-Control | 19.0 GB | [๐คLink](https://huggingface.co/alibaba-pai/Wan2.1-Fun-V1.1-1.3B-Control) | [๐Link](https://modelscope.cn/models/PAI/Wan2.1-Fun-V1.1-1.3B-Control) | Wan2.1-Fun-V1.1-1.3B video control weights support various control conditions such as Canny, Depth, Pose, MLSD, etc., supports reference image + control condition-based control, and trajectory control. Supports multi-resolution (512, 768, 1024) video prediction, trained with 81 frames at 16 FPS, supports multilingual prediction. |
|
| 40 |
+
| Wan2.1-Fun-V1.1-14B-Control | 47.0 GB | [๐คLink](https://huggingface.co/alibaba-pai/Wan2.1-Fun-V1.1-14B-Control) | [๐Link](https://modelscope.cn/models/PAI/Wan2.1-Fun-V1.1-14B-Control) | Wan2.1-Fun-V1.1-14B video control weights support various control conditions such as Canny, Depth, Pose, MLSD, etc., supports reference image + control condition-based control, and trajectory control. Supports multi-resolution (512, 768, 1024) video prediction, trained with 81 frames at 16 FPS, supports multilingual prediction. |
|
| 41 |
+
| Wan2.1-Fun-V1.1-1.3B-Control-Camera | 19.0 GB | [๐คLink](https://huggingface.co/alibaba-pai/Wan2.1-Fun-V1.1-1.3B-Control) | [๐Link](https://modelscope.cn/models/PAI/Wan2.1-Fun-V1.1-1.3B-Control) | Wan2.1-Fun-V1.1-1.3B camera lens control weights. Supports multi-resolution (512, 768, 1024) video prediction, trained with 81 frames at 16 FPS, supports multilingual prediction. |
|
| 42 |
+
| Wan2.1-Fun-V1.1-14B-Control-Camera | 47.0 GB | [๐คLink](https://huggingface.co/alibaba-pai/Wan2.1-Fun-V1.1-14B-Control) | [๐Link](https://modelscope.cn/models/PAI/Wan2.1-Fun-V1.1-14B-Control) | Wan2.1-Fun-V1.1-14B camera lens control weights. Supports multi-resolution (512, 768, 1024) video prediction, trained with 81 frames at 16 FPS, supports multilingual prediction. |
|
| 43 |
+
|
| 44 |
V1.0:
|
| 45 |
| Name | Storage Space | Hugging Face | Model Scope | Description |
|
| 46 |
|--|--|--|--|--|
|
|
|
|
| 49 |
| Wan2.1-Fun-1.3B-Control | 19.0 GB | [๐คLink](https://huggingface.co/alibaba-pai/Wan2.1-Fun-1.3B-Control) | [๐Link](https://modelscope.cn/models/PAI/Wan2.1-Fun-1.3B-Control) | Wan2.1-Fun-1.3B video control weights, supporting various control conditions such as Canny, Depth, Pose, MLSD, etc., and trajectory control. Supports multi-resolution (512, 768, 1024) video prediction at 81 frames, trained at 16 frames per second, with multilingual prediction support. |
|
| 50 |
| Wan2.1-Fun-14B-Control | 47.0 GB | [๐คLink](https://huggingface.co/alibaba-pai/Wan2.1-Fun-14B-Control) | [๐Link](https://modelscope.cn/models/PAI/Wan2.1-Fun-14B-Control) | Wan2.1-Fun-14B video control weights, supporting various control conditions such as Canny, Depth, Pose, MLSD, etc., and trajectory control. Supports multi-resolution (512, 768, 1024) video prediction at 81 frames, trained at 16 frames per second, with multilingual prediction support. |
|
| 51 |
|
| 52 |
+
|
| 53 |
# Video Result
|
| 54 |
|
| 55 |
+
### Wan2.1-Fun-V1.1-14B-InP && Wan2.1-Fun-V1.1-1.3B-InP
|
| 56 |
|
| 57 |
<table border="0" style="width: 100%; text-align: left; margin-top: 20px;">
|
| 58 |
<tr>
|
| 59 |
<td>
|
| 60 |
+
<video src="https://pai-aigc-photog.oss-cn-hangzhou.aliyuncs.com/wan_fun/asset/v1.1/inp_1.mp4" width="100%" controls autoplay loop></video>
|
| 61 |
</td>
|
| 62 |
<td>
|
| 63 |
+
<video src="https://pai-aigc-photog.oss-cn-hangzhou.aliyuncs.com/wan_fun/asset/v1.1/inp_2.mp4" width="100%" controls autoplay loop></video>
|
| 64 |
</td>
|
| 65 |
<td>
|
| 66 |
+
<video src="https://pai-aigc-photog.oss-cn-hangzhou.aliyuncs.com/wan_fun/asset/v1.1/inp_3.mp4" width="100%" controls autoplay loop></video>
|
| 67 |
</td>
|
| 68 |
<td>
|
| 69 |
+
<video src="https://pai-aigc-photog.oss-cn-hangzhou.aliyuncs.com/wan_fun/asset/v1.1/inp_4.mp4" width="100%" controls autoplay loop></video>
|
| 70 |
</td>
|
| 71 |
</tr>
|
| 72 |
</table>
|
|
|
|
| 74 |
<table border="0" style="width: 100%; text-align: left; margin-top: 20px;">
|
| 75 |
<tr>
|
| 76 |
<td>
|
| 77 |
+
<video src="https://pai-aigc-photog.oss-cn-hangzhou.aliyuncs.com/wan_fun/asset/v1.1/inp_5.mp4" width="100%" controls autoplay loop></video>
|
| 78 |
</td>
|
| 79 |
<td>
|
| 80 |
+
<video src="https://pai-aigc-photog.oss-cn-hangzhou.aliyuncs.com/wan_fun/asset/v1.1/inp_6.mp4" width="100%" controls autoplay loop></video>
|
| 81 |
</td>
|
| 82 |
<td>
|
| 83 |
+
<video src="https://pai-aigc-photog.oss-cn-hangzhou.aliyuncs.com/wan_fun/asset/v1.1/inp_7.mp4" width="100%" controls autoplay loop></video>
|
| 84 |
</td>
|
| 85 |
<td>
|
| 86 |
+
<video src="https://pai-aigc-photog.oss-cn-hangzhou.aliyuncs.com/wan_fun/asset/v1.1/inp_8.mp4" width="100%" controls autoplay loop></video>
|
| 87 |
</td>
|
| 88 |
</tr>
|
| 89 |
</table>
|
| 90 |
|
| 91 |
+
### Wan2.1-Fun-V1.1-14B-Control && Wan2.1-Fun-V1.1-1.3B-Control
|
| 92 |
|
| 93 |
+
Generic Control Video + Reference Image:
|
| 94 |
<table border="0" style="width: 100%; text-align: left; margin-top: 20px;">
|
| 95 |
<tr>
|
| 96 |
<td>
|
| 97 |
+
Reference Image
|
| 98 |
+
</td>
|
| 99 |
+
<td>
|
| 100 |
+
Control Video
|
| 101 |
+
</td>
|
| 102 |
+
<td>
|
| 103 |
+
Wan2.1-Fun-V1.1-14B-Control
|
| 104 |
+
</td>
|
| 105 |
+
<td>
|
| 106 |
+
Wan2.1-Fun-V1.1-1.3B-Control
|
| 107 |
+
</td>
|
| 108 |
+
<tr>
|
| 109 |
+
<td>
|
| 110 |
+
<image src="https://pai-aigc-photog.oss-cn-hangzhou.aliyuncs.com/wan_fun/asset/v1.1/6.png" width="100%" controls autoplay loop></image>
|
| 111 |
</td>
|
| 112 |
<td>
|
| 113 |
+
<video src="https://pai-aigc-photog.oss-cn-hangzhou.aliyuncs.com/wan_fun/asset/v1.1/pose.mp4" width="100%" controls autoplay loop></video>
|
| 114 |
</td>
|
| 115 |
<td>
|
| 116 |
+
<video src="https://pai-aigc-photog.oss-cn-hangzhou.aliyuncs.com/wan_fun/asset/v1.1/14b_ref.mp4" width="100%" controls autoplay loop></video>
|
| 117 |
+
</td>
|
| 118 |
+
<td>
|
| 119 |
+
<video src="https://pai-aigc-photog.oss-cn-hangzhou.aliyuncs.com/wan_fun/asset/v1.1/1_3b_ref.mp4" width="100%" controls autoplay loop></video>
|
| 120 |
</td>
|
| 121 |
<tr>
|
| 122 |
</table>
|
| 123 |
|
| 124 |
+
Generic Control Video (Canny, Pose, Depth, etc.) and Trajectory Control:
|
| 125 |
<table border="0" style="width: 100%; text-align: left; margin-top: 20px;">
|
| 126 |
<tr>
|
| 127 |
<td>
|
| 128 |
+
<video src="https://pai-aigc-photog.oss-cn-hangzhou.aliyuncs.com/wan_fun/asset/v1.1/guiji.mp4" width="100%" controls autoplay loop></video>
|
| 129 |
</td>
|
| 130 |
<td>
|
| 131 |
+
<video src="https://pai-aigc-photog.oss-cn-hangzhou.aliyuncs.com/wan_fun/asset/v1.1/guiji_plus_out.mp4" width="100%" controls autoplay loop></video>
|
| 132 |
</td>
|
| 133 |
<td>
|
| 134 |
+
<video src="https://pai-aigc-photog.oss-cn-hangzhou.aliyuncs.com/wan_fun/asset/v1.1/guiji_out.mp4" width="100%" controls autoplay loop></video>
|
| 135 |
</td>
|
| 136 |
+
<tr>
|
| 137 |
+
</table>
|
| 138 |
+
|
| 139 |
+
<table border="0" style="width: 100%; text-align: left; margin-top: 20px;">
|
| 140 |
<tr>
|
| 141 |
<td>
|
| 142 |
+
<video src="https://pai-aigc-photog.oss-cn-hangzhou.aliyuncs.com/wan_fun/asset/v1.1/pose.mp4" width="100%" controls autoplay loop></video>
|
| 143 |
</td>
|
| 144 |
<td>
|
| 145 |
+
<video src="https://pai-aigc-photog.oss-cn-hangzhou.aliyuncs.com/wan_fun/asset/v1.1/canny.mp4" width="100%" controls autoplay loop></video>
|
| 146 |
</td>
|
| 147 |
<td>
|
| 148 |
+
<video src="https://pai-aigc-photog.oss-cn-hangzhou.aliyuncs.com/wan_fun/asset/v1.1/depth.mp4" width="100%" controls autoplay loop></video>
|
| 149 |
+
</td>
|
| 150 |
+
<tr>
|
| 151 |
+
<td>
|
| 152 |
+
<video src="https://pai-aigc-photog.oss-cn-hangzhou.aliyuncs.com/wan_fun/asset/v1.1/pose_out.mp4" width="100%" controls autoplay loop></video>
|
| 153 |
+
</td>
|
| 154 |
+
<td>
|
| 155 |
+
<video src="https://pai-aigc-photog.oss-cn-hangzhou.aliyuncs.com/wan_fun/asset/v1.1/canny_out.mp4" width="100%" controls autoplay loop></video>
|
| 156 |
+
</td>
|
| 157 |
+
<td>
|
| 158 |
+
<video src="https://pai-aigc-photog.oss-cn-hangzhou.aliyuncs.com/wan_fun/asset/v1.1/depth_out.mp4" width="100%" controls autoplay loop></video>
|
| 159 |
+
</td>
|
| 160 |
+
</tr>
|
| 161 |
+
</table>
|
| 162 |
+
|
| 163 |
+
### Wan2.1-Fun-V1.1-14B-Control-Camera && Wan2.1-Fun-V1.1-1.3B-Control-Camera
|
| 164 |
+
|
| 165 |
+
<table border="0" style="width: 100%; text-align: left; margin-top: 20px;">
|
| 166 |
+
<tr>
|
| 167 |
+
<td>
|
| 168 |
+
Pan Up
|
| 169 |
+
</td>
|
| 170 |
+
<td>
|
| 171 |
+
Pan Left
|
| 172 |
+
</td>
|
| 173 |
+
<td>
|
| 174 |
+
Pan Right
|
| 175 |
+
</td>
|
| 176 |
+
<tr>
|
| 177 |
+
<td>
|
| 178 |
+
<video src="https://pai-aigc-photog.oss-cn-hangzhou.aliyuncs.com/wan_fun/asset/v1.1/Pan_Up.mp4" width="100%" controls autoplay loop></video>
|
| 179 |
+
</td>
|
| 180 |
+
<td>
|
| 181 |
+
<video src="https://pai-aigc-photog.oss-cn-hangzhou.aliyuncs.com/wan_fun/asset/v1.1/Pan_Left.mp4" width="100%" controls autoplay loop></video>
|
| 182 |
+
</td>
|
| 183 |
+
<td>
|
| 184 |
+
<video src="https://pai-aigc-photog.oss-cn-hangzhou.aliyuncs.com/wan_fun/asset/v1.1/Pan_Right.mp4" width="100%" controls autoplay loop></video>
|
| 185 |
+
</td>
|
| 186 |
+
<tr>
|
| 187 |
+
<td>
|
| 188 |
+
Pan Down
|
| 189 |
+
</td>
|
| 190 |
+
<td>
|
| 191 |
+
Pan Up + Pan Left
|
| 192 |
+
</td>
|
| 193 |
+
<td>
|
| 194 |
+
Pan Up + Pan Right
|
| 195 |
+
</td>
|
| 196 |
+
<tr>
|
| 197 |
+
<td>
|
| 198 |
+
<video src="https://pai-aigc-photog.oss-cn-hangzhou.aliyuncs.com/wan_fun/asset/v1.1/Pan_Down.mp4" width="100%" controls autoplay loop></video>
|
| 199 |
+
</td>
|
| 200 |
+
<td>
|
| 201 |
+
<video src="https://pai-aigc-photog.oss-cn-hangzhou.aliyuncs.com/wan_fun/asset/v1.1/Pan_Left_Up.mp4" width="100%" controls autoplay loop></video>
|
| 202 |
+
</td>
|
| 203 |
+
<td>
|
| 204 |
+
<video src="https://pai-aigc-photog.oss-cn-hangzhou.aliyuncs.com/wan_fun/asset/v1.1/Pan_Right_Up.mp4" width="100%" controls autoplay loop></video>
|
| 205 |
</td>
|
| 206 |
</tr>
|
| 207 |
</table>
|
|
|
|
| 232 |
docker run -it -p 7860:7860 --network host --gpus all --security-opt seccomp:unconfined --shm-size 200g mybigpai-public-registry.cn-beijing.cr.aliyuncs.com/easycv/torch_cuda:cogvideox_fun
|
| 233 |
|
| 234 |
# clone code
|
| 235 |
+
git clone https://github.com/aigc-apps/VideoX-Fun.git
|
| 236 |
|
| 237 |
+
# enter VideoX-Fun's dir
|
| 238 |
+
cd VideoX-Fun
|
| 239 |
|
| 240 |
# download weights
|
| 241 |
mkdir models/Diffusion_Transformer
|
|
|
|
| 247 |
# https://modelscope.cn/models/PAI/CogVideoX-Fun-V1.1-5b-InP
|
| 248 |
|
| 249 |
# Wan
|
| 250 |
+
# https://huggingface.co/alibaba-pai/Wan2.1-Fun-V1.1-14B-InP
|
| 251 |
+
# https://modelscope.cn/models/PAI/Wan2.1-Fun-V1.1-14B-InP
|
| 252 |
```
|
| 253 |
|
| 254 |
### 2. Local install: Environment Check/Downloading/Installation
|
|
|
|
| 276 |
#### b. Weights
|
| 277 |
We'd better place the [weights](#model-zoo) along the specified path:
|
| 278 |
|
| 279 |
+
**Via ComfyUI**:
|
| 280 |
+
Put the models into the ComfyUI weights folder `ComfyUI/models/Fun_Models/`:
|
| 281 |
+
```
|
| 282 |
+
๐ฆ ComfyUI/
|
| 283 |
+
โโโ ๐ models/
|
| 284 |
+
โ โโโ ๐ Fun_Models/
|
| 285 |
+
โ โโโ ๐ CogVideoX-Fun-V1.1-2b-InP/
|
| 286 |
+
โ โโโ ๐ CogVideoX-Fun-V1.1-5b-InP/
|
| 287 |
+
โ โโโ ๐ Wan2.1-Fun-14B-InP
|
| 288 |
+
โ โโโ ๐ Wan2.1-Fun-1.3B-InP/
|
| 289 |
+
```
|
| 290 |
+
|
| 291 |
+
**Run its own python file or UI interface**:
|
| 292 |
```
|
| 293 |
๐ฆ models/
|
| 294 |
โโโ ๐ Diffusion_Transformer/
|
|
|
|
| 347 |
- CogVideo: https://github.com/THUDM/CogVideo/
|
| 348 |
- EasyAnimate: https://github.com/aigc-apps/EasyAnimate
|
| 349 |
- Wan2.1: https://github.com/Wan-Video/Wan2.1/
|
| 350 |
+
- ComfyUI-KJNodes: https://github.com/kijai/ComfyUI-KJNodes
|
| 351 |
+
- ComfyUI-EasyAnimateWrapper: https://github.com/kijai/ComfyUI-EasyAnimateWrapper
|
| 352 |
+
- ComfyUI-CameraCtrl-Wrapper: https://github.com/chaojie/ComfyUI-CameraCtrl-Wrapper
|
| 353 |
+
- CameraCtrl: https://github.com/hehao13/CameraCtrl
|
| 354 |
|
| 355 |
# License
|
| 356 |
This project is licensed under the [Apache License (Version 2.0)](https://github.com/modelscope/modelscope/blob/master/LICENSE).
|