Commit graph

270 commits

Author SHA1 Message Date
Shen-Chenhui
04a0e4769b debug 2024-04-08 17:33:54 +08:00
Shen-Chenhui
fb64216b45 debug 2024-04-08 17:32:30 +08:00
Shen-Chenhui
25707a9d7d debug 2024-04-08 17:28:31 +08:00
Shen-Chenhui
9c8d084ec5 debug 2024-04-08 17:26:36 +08:00
Shen-Chenhui
eae30f9f89 debug 2024-04-08 17:23:53 +08:00
Shen-Chenhui
4ef813555c debug 2024-04-08 17:22:44 +08:00
Shen-Chenhui
827a0b2c55 debug 2024-04-08 17:21:26 +08:00
Shen-Chenhui
a634f7327e debug 2024-04-08 17:17:22 +08:00
Shen-Chenhui
a401630e7a debug 2024-04-08 17:03:27 +08:00
Shen-Chenhui
3eb96a2793 debug 2024-04-08 16:58:54 +08:00
Shen-Chenhui
dfae10e32b debug 2024-04-08 16:56:28 +08:00
Shen-Chenhui
78f9ec8710 debug 2024-04-08 16:54:27 +08:00
Shen-Chenhui
47b51ba07d debug 2024-04-08 16:53:18 +08:00
Shen-Chenhui
d927acc19d debug 2024-04-08 16:51:06 +08:00
Shen-Chenhui
475762cdb2 debug 2024-04-08 16:37:26 +08:00
Shen-Chenhui
c6181ebdfd debug 2024-04-08 16:36:54 +08:00
Shen-Chenhui
90fc92ee9c debug 2024-04-08 16:14:00 +08:00
Shen-Chenhui
ab1970ca24 debug 2024-04-08 16:05:15 +08:00
Shen-Chenhui
0a5c767fc7 debug 2024-04-08 15:54:03 +08:00
Shen-Chenhui
b2148d5505 debug 2024-04-08 15:52:25 +08:00
Shen-Chenhui
7e3951300f debug 2024-04-08 15:43:06 +08:00
Shen-Chenhui
fa0ca3983e debug inference code 2024-04-08 15:40:14 +08:00
Shen-Chenhui
4b27448a49 debug 2024-04-08 14:22:21 +08:00
Shen-Chenhui
1b66919091 debug 2024-04-08 10:53:44 +08:00
Shen-Chenhui
ad02d28d9c debug inference running loss 2024-04-08 10:22:07 +08:00
Frank Lee
27a627373a updated gradio app (#260) 2024-04-06 23:34:55 +08:00
Shen-Chenhui
e3584b4e43 debug 2024-04-05 16:20:39 +08:00
Shen-Chenhui
9091419b28 save vae to model dir 2024-04-05 16:07:23 +08:00
Shen-Chenhui
bc095aa492 debug 2024-04-04 15:26:56 +08:00
Shen-Chenhui
dfdf8db491 add inference config
add readme
complete inference script
TODO: DEBUG inference
2024-04-04 14:11:25 +08:00
Shen-Chenhui
b6c39873ee finish discriminator arch
use einops.rearrange in vae model_utils
add get_latent_size in vae_3d
inference vae_3d WIP
2024-04-03 10:29:01 +08:00
Zheng Zangwei (Alex Zheng)
7612d22fc6 Exp/image mix (#20)
* [exp] image mixed training

* [exp] add batch info

* [exp] launch

* update num_step_per_epoch

* [feat] verify image mix training
2024-04-02 13:35:09 +08:00
Shen-Chenhui
ebb3dc4d59 enable wandb 2024-04-02 11:47:56 +08:00
Shen-Chenhui
562a966a77 debug 2024-04-01 16:14:11 +08:00
Shen-Chenhui
3d6d499543 add perceptual loss 2024-04-01 13:57:25 +08:00
Zheng Zangwei (Alex Zheng)
f9f539f07e format and some fix (#8) 2024-03-30 13:34:19 +08:00
Hongxin Liu
5e82d1493b [feature] refactor sampler (#4)
* [feature] refactor sampler

* [feature] support sampler resuming

* [feature] support sampler resuming
2024-03-29 23:32:12 +08:00
Shen-Chenhui
d1e57cbfd5 debug 2024-03-29 10:30:56 +08:00
Shen-Chenhui
4795fd5354 debug 2024-03-28 23:27:26 +08:00
Zangwei Zheng
a01f6da20e [feat] support for stdit2 sampling 2024-03-28 21:35:33 +08:00
Shen-Chenhui
3692723b4b debug 2024-03-28 18:30:36 +08:00
shenchenhui
41f7b9b01e debug 2024-03-28 18:17:42 +08:00
shenchenhui
7f105cbdfa debug 2024-03-28 18:02:30 +08:00
Zangwei Zheng
cba02e8a58 [feat] dynamic for video (base_size not completed) 2024-03-28 17:58:36 +08:00
shenchenhui
716b3f3774 debug 2024-03-28 17:45:50 +08:00
shenchenhui
42101a5eee debug 2024-03-28 17:38:48 +08:00
shenchenhui
5935c5b10b debug 2024-03-28 17:36:31 +08:00
shenchenhui
3d9addba17 debug 2024-03-28 16:09:43 +08:00
shenchenhui
a5917c6a3e added vae3d training code 2024-03-28 15:12:20 +08:00
Zangwei Zheng
94e686177e [wip] multi ar 2024-03-27 23:04:12 +08:00
Zangwei Zheng
7392d2e551 [feat] multiple frames with 360p 2024-03-27 00:24:46 +08:00
Zangwei Zheng
552b7e8f79 register dataset 2024-03-26 17:02:41 +08:00
Zangwei Zheng
01728dc28a refactor datasets 2024-03-26 16:50:36 +08:00
Zangwei Zheng
81da63e0dd renaming 2024-03-23 22:24:01 +08:00
Zangwei Zheng
4e6e17d800 complete masked training 2024-03-23 22:06:19 +08:00
Zangwei Zheng
98e62a7c57 update inference z 2024-03-23 20:28:34 +08:00
Zangwei Zheng
7d27f5553e merge mask-related utils 2024-03-23 16:32:51 +08:00
Frank Lee
968c52c7ab [demo] fixed a config mismatch (#193) 2024-03-22 17:09:25 +08:00
Frank Lee
6b2715e40c Feature/gradio demo (#190)
* [gradio] added demo app

* polish
2024-03-22 15:07:04 +08:00
Frank Lee
14526479fc added sp for inference (#76) 2024-03-17 11:00:23 +08:00
Zheng Zangwei (Alex Zheng)
d851a85535 format (#69) 2024-03-15 22:16:20 +08:00
Frank Lee
c4c5d64e49 migrate some new files (#64) 2024-03-15 21:49:38 +08:00
Frank Lee
2f87a9af75
removed old files (#63) 2024-03-15 21:48:36 +08:00
ver217
0106e62555 [feature] impl adaln model arch 2024-03-07 20:04:57 +08:00
Frank Lee
9648d53d4d
added latte sampling (#22)
* added latte sampling

* polish
2024-03-04 10:43:22 +08:00
Frank Lee
3bea560af3
fixed training script (#19) 2024-02-29 16:07:34 +08:00
Frank Lee
0c05cd2e9d
refactored code into a package (#13) 2024-02-27 11:58:22 +08:00
Frank Lee
a33c656c80
added training scripts and readme (#12)
* added training scripts and readme

* polish
2024-02-26 17:47:19 +08:00
Hongxin Liu
6f887f453b
[hotfix] fix vqvae output process (#9) 2024-02-23 13:11:24 +08:00
Frank Lee
da9b00e808
added dataset processing scripts (#8)
* added dataset processing scripts

* added dataset processing scripts
2024-02-23 11:26:28 +08:00