Commit graph

269 commits

Author SHA1 Message Date
Zheng Zangwei (Alex Zheng)
1f99b9ba4b complete eval pipeline (#53) 2024-04-18 15:49:14 +08:00
Shen-Chenhui
6ecc858350 pipeline config for trial data 2024-04-18 15:06:28 +08:00
Shen-Chenhui
246eddecdd enable pipeline vae training 2024-04-18 14:13:56 +08:00
Shen-Chenhui
aba1de6eb3 add pipeline 2024-04-18 13:49:54 +08:00
Zangwei Zheng
ee2e4083e4 [feat] update eval 2024-04-17 17:35:44 +08:00
Zangwei Zheng
8cb6d2f0bd [feat] update eval 2024-04-17 17:05:08 +08:00
Shen-Chenhui
95517d7fb5 inference v2 working 2024-04-16 18:30:31 +08:00
Shen-Chenhui
becff9d86f inference v2 2024-04-16 18:15:12 +08:00
Frank Lee
79dabf8bdf added seed to dataloader args (#52) 2024-04-16 17:45:53 +08:00
Shen-Chenhui
aae08a2841 debug 2024-04-16 17:31:57 +08:00
Shen-Chenhui
235d195dac add logvar 2024-04-16 17:25:48 +08:00
Shen-Chenhui
36b754b047 debug 2024-04-16 15:06:17 +08:00
Shen-Chenhui
afd3f823d4 debug 2024-04-16 15:00:31 +08:00
Frank Lee
db94454645 Feature/timeout (#50)
* added large nccl timeout

* polish

* polish
2024-04-15 23:55:58 +08:00
Shen-Chenhui
af1b1e484d reimplement blurpool 2024-04-15 17:47:33 +08:00
Shen-Chenhui
686829969f debug 2024-04-15 15:13:29 +08:00
Zangwei Zheng
63e86f6dd6 hotfix inference 2024-04-15 13:44:30 +08:00
Shen-Chenhui
c71e04daaa add lecam and gradient penalty loss to discriminator 2024-04-15 11:13:39 +08:00
Zangwei Zheng
e88185fb9f update notebook 2024-04-14 02:02:59 +08:00
Shen-Chenhui
79bff13099 debug 2024-04-13 10:51:56 +08:00
Shen-Chenhui
fb0f59171c debug 2024-04-13 09:36:39 +08:00
Shen-Chenhui
06aa4589f2 debug 2024-04-12 18:39:31 +08:00
Shen-Chenhui
dbd3982ee8 debug 2024-04-12 18:37:28 +08:00
Shen-Chenhui
28f3f4b597 debug 2024-04-12 18:30:40 +08:00
Shen-Chenhui
c80459b72f debug 2024-04-12 18:27:51 +08:00
Shen-Chenhui
f905a1b69d debug 2024-04-12 18:16:22 +08:00
Shen-Chenhui
d53245a98c debug 2024-04-12 18:14:18 +08:00
Shen-Chenhui
de9fffacfd gan training 2024-04-12 17:57:35 +08:00
Shen-Chenhui
e004d43f69 gan optimizer 2024-04-12 15:10:19 +08:00
Shen-Chenhui
ab59033f20 gan 2024-04-12 12:42:02 +08:00
Shen-Chenhui
63f3737eb0 add simple discriminator 2024-04-11 15:14:16 +08:00
Hongxin Liu
2393c63083 [feature] add batch size search script (#47) 2024-04-11 14:23:13 +08:00
Shen-Chenhui
ee58ec16b4 add image encoding support 2024-04-11 10:50:23 +08:00
Shen-Chenhui
a4025b0ea7 magvit v2 enc dec arc 2024-04-09 17:49:01 +08:00
Shen-Chenhui
ae6bffb8c5 restore train vae script 2024-04-09 09:34:59 +08:00
Shen-Chenhui
a1808be460 debug 2024-04-08 18:18:05 +08:00
Shen-Chenhui
4861e6beaf debug 2024-04-08 18:13:15 +08:00
Shen-Chenhui
9f211646f3 debug 2024-04-08 18:11:27 +08:00
Shen-Chenhui
ab38a411a8 debug 2024-04-08 18:07:50 +08:00
Shen-Chenhui
55821937d9 debug 2024-04-08 18:04:32 +08:00
Shen-Chenhui
a485925dbd debug 2024-04-08 18:02:02 +08:00
Shen-Chenhui
45ea2bd29d debug 2024-04-08 18:00:30 +08:00
Shen-Chenhui
e151b64319 debug 2024-04-08 17:58:33 +08:00
Shen-Chenhui
dba5de3d90 debug 2024-04-08 17:54:31 +08:00
Shen-Chenhui
bbeacb491d debug 2024-04-08 17:50:41 +08:00
Shen-Chenhui
0d497994cd debug 2024-04-08 17:42:27 +08:00
Shen-Chenhui
ce0b1928e6 debug 2024-04-08 17:41:08 +08:00
Shen-Chenhui
08fac69c38 debug 2024-04-08 17:39:19 +08:00
Shen-Chenhui
616e2b4fe5 debug 2024-04-08 17:37:29 +08:00
Shen-Chenhui
04a0e4769b debug 2024-04-08 17:33:54 +08:00
Shen-Chenhui
fb64216b45 debug 2024-04-08 17:32:30 +08:00
Shen-Chenhui
25707a9d7d debug 2024-04-08 17:28:31 +08:00
Shen-Chenhui
9c8d084ec5 debug 2024-04-08 17:26:36 +08:00
Shen-Chenhui
eae30f9f89 debug 2024-04-08 17:23:53 +08:00
Shen-Chenhui
4ef813555c debug 2024-04-08 17:22:44 +08:00
Shen-Chenhui
827a0b2c55 debug 2024-04-08 17:21:26 +08:00
Shen-Chenhui
a634f7327e debug 2024-04-08 17:17:22 +08:00
Shen-Chenhui
a401630e7a debug 2024-04-08 17:03:27 +08:00
Shen-Chenhui
3eb96a2793 debug 2024-04-08 16:58:54 +08:00
Shen-Chenhui
dfae10e32b debug 2024-04-08 16:56:28 +08:00
Shen-Chenhui
78f9ec8710 debug 2024-04-08 16:54:27 +08:00
Shen-Chenhui
47b51ba07d debug 2024-04-08 16:53:18 +08:00
Shen-Chenhui
d927acc19d debug 2024-04-08 16:51:06 +08:00
Shen-Chenhui
475762cdb2 debug 2024-04-08 16:37:26 +08:00
Shen-Chenhui
c6181ebdfd debug 2024-04-08 16:36:54 +08:00
Shen-Chenhui
90fc92ee9c debug 2024-04-08 16:14:00 +08:00
Shen-Chenhui
ab1970ca24 debug 2024-04-08 16:05:15 +08:00
Shen-Chenhui
0a5c767fc7 debug 2024-04-08 15:54:03 +08:00
Shen-Chenhui
b2148d5505 debug 2024-04-08 15:52:25 +08:00
Shen-Chenhui
7e3951300f debug 2024-04-08 15:43:06 +08:00
Shen-Chenhui
fa0ca3983e debug inference code 2024-04-08 15:40:14 +08:00
Shen-Chenhui
4b27448a49 debug 2024-04-08 14:22:21 +08:00
Shen-Chenhui
1b66919091 debug 2024-04-08 10:53:44 +08:00
Shen-Chenhui
ad02d28d9c debug inference running loss 2024-04-08 10:22:07 +08:00
Frank Lee
27a627373a updated gradio app (#260) 2024-04-06 23:34:55 +08:00
Shen-Chenhui
e3584b4e43 debug 2024-04-05 16:20:39 +08:00
Shen-Chenhui
9091419b28 save vae to model dir 2024-04-05 16:07:23 +08:00
Shen-Chenhui
bc095aa492 debug 2024-04-04 15:26:56 +08:00
Shen-Chenhui
dfdf8db491 add inference config
add readme
complete inference script
TODO: DEBUG inference
2024-04-04 14:11:25 +08:00
Shen-Chenhui
b6c39873ee finish discriminator arch
use einops.rearrange in vae model_utils
add get_latent_size in vae_3d
inference vae_3d WIP
2024-04-03 10:29:01 +08:00
Zheng Zangwei (Alex Zheng)
7612d22fc6 Exp/image mix (#20)
* [exp] image mixed training

* [exp] add batch info

* [exp] launch

* update num_step_per_epoch

* [feat] verify image mix training
2024-04-02 13:35:09 +08:00
Shen-Chenhui
ebb3dc4d59 enable wandb 2024-04-02 11:47:56 +08:00
Shen-Chenhui
562a966a77 debug 2024-04-01 16:14:11 +08:00
Shen-Chenhui
3d6d499543 add perceptual loss 2024-04-01 13:57:25 +08:00
Zheng Zangwei (Alex Zheng)
f9f539f07e format and some fix (#8) 2024-03-30 13:34:19 +08:00
Hongxin Liu
5e82d1493b [feature] refactor sampler (#4)
* [feature] refactor sampler

* [feature] support sampler resuming

* [feature] support sampler resuming
2024-03-29 23:32:12 +08:00
Shen-Chenhui
d1e57cbfd5 debug 2024-03-29 10:30:56 +08:00
Shen-Chenhui
4795fd5354 debug 2024-03-28 23:27:26 +08:00
Zangwei Zheng
a01f6da20e [feat] support for stdit2 sampling 2024-03-28 21:35:33 +08:00
Shen-Chenhui
3692723b4b debug 2024-03-28 18:30:36 +08:00
shenchenhui
41f7b9b01e debug 2024-03-28 18:17:42 +08:00
shenchenhui
7f105cbdfa debug 2024-03-28 18:02:30 +08:00
Zangwei Zheng
cba02e8a58 [feat] dynamic for video (base_size not completed) 2024-03-28 17:58:36 +08:00
shenchenhui
716b3f3774 debug 2024-03-28 17:45:50 +08:00
shenchenhui
42101a5eee debug 2024-03-28 17:38:48 +08:00
shenchenhui
5935c5b10b debug 2024-03-28 17:36:31 +08:00
shenchenhui
3d9addba17 debug 2024-03-28 16:09:43 +08:00
shenchenhui
a5917c6a3e added vae3d training code 2024-03-28 15:12:20 +08:00
Zangwei Zheng
94e686177e [wip] multi ar 2024-03-27 23:04:12 +08:00
Zangwei Zheng
7392d2e551 [feat] multiple frames with 360p 2024-03-27 00:24:46 +08:00
Zangwei Zheng
552b7e8f79 register dataset 2024-03-26 17:02:41 +08:00
Zangwei Zheng
01728dc28a refactor datasets 2024-03-26 16:50:36 +08:00
Zangwei Zheng
81da63e0dd renaming 2024-03-23 22:24:01 +08:00
Zangwei Zheng
4e6e17d800 complete masked training 2024-03-23 22:06:19 +08:00
Zangwei Zheng
98e62a7c57 update inference z 2024-03-23 20:28:34 +08:00
Zangwei Zheng
7d27f5553e merge mask-related utils 2024-03-23 16:32:51 +08:00
Frank Lee
968c52c7ab [demo] fixed a config mismatch (#193) 2024-03-22 17:09:25 +08:00
Frank Lee
6b2715e40c Feature/gradio demo (#190)
* [gradio] added demo app

* polish
2024-03-22 15:07:04 +08:00
Frank Lee
14526479fc added sp for inference (#76) 2024-03-17 11:00:23 +08:00
Zheng Zangwei (Alex Zheng)
d851a85535 format (#69) 2024-03-15 22:16:20 +08:00
Frank Lee
c4c5d64e49 migrate some new files (#64) 2024-03-15 21:49:38 +08:00
Frank Lee
2f87a9af75
removed old files (#63) 2024-03-15 21:48:36 +08:00
ver217
0106e62555 [feature] impl adaln model arch 2024-03-07 20:04:57 +08:00
Frank Lee
9648d53d4d
added latte sampling (#22)
* added latte sampling

* polish
2024-03-04 10:43:22 +08:00
Frank Lee
3bea560af3
fixed training script (#19) 2024-02-29 16:07:34 +08:00
Frank Lee
0c05cd2e9d
refactored code into a package (#13) 2024-02-27 11:58:22 +08:00
Frank Lee
a33c656c80
added training scripts and readme (#12)
* added training scripts and readme

* polish
2024-02-26 17:47:19 +08:00
Hongxin Liu
6f887f453b
[hotfix] fix vqvae output process (#9) 2024-02-23 13:11:24 +08:00
Frank Lee
da9b00e808
added dataset processing scripts (#8)
* added dataset processing scripts

* added dataset processing scripts
2024-02-23 11:26:28 +08:00