Zheng Zangwei (Alex Zheng)
|
341b12f9bf
|
Dev/v1.0.1 (#58)
* update (#57)
* update
* update datautil
* add VBench prompt
* update eval
* update eval
* update intepolation
* add vbench eval
* Dev/sdedit implementation (#56)
* Update utils.py
* update
* update
* update
---------
Co-authored-by: YuKun Zhou <90625606+1zeryu@users.noreply.github.com>
|
2024-04-20 21:23:10 +08:00 |
|
Hongxin Liu
|
bdb9b7ccbf
|
[hotfix] fix attn impl (#17)
|
2024-04-01 20:37:15 +08:00 |
|
Zheng Zangwei (Alex Zheng)
|
f9f539f07e
|
format and some fix (#8)
|
2024-03-30 13:34:19 +08:00 |
|
Frank Lee
|
a0bdaced4e
|
replaced numpy ops with torch ops (#9)
* replaced numpy ops with torch ops
* polish
* polish
|
2024-03-30 13:01:58 +08:00 |
|
Hongxin Liu
|
8d5278b99a
|
[feature] impl cached pos embedding (#1)
* [feature] impl cached pos embedding
* [feature] update pos emb
* [feature] update pos emb
|
2024-03-28 17:30:47 +08:00 |
|
Frank Lee
|
14526479fc
|
added sp for inference (#76)
|
2024-03-17 11:00:23 +08:00 |
|
Zheng Zangwei (Alex Zheng)
|
d851a85535
|
format (#69)
|
2024-03-15 22:16:20 +08:00 |
|
Frank Lee
|
c4c5d64e49
|
migrate some new files (#64)
|
2024-03-15 21:49:38 +08:00 |
|
Frank Lee
|
2f87a9af75
|
removed old files (#63)
|
2024-03-15 21:48:36 +08:00 |
|
Hongxin Liu
|
91275b2b5e
|
[feature] impl fastseq-style seq parallel (#21)
* [feature] add fastseq-style sp attn
* [feature] add overlap fastseq
* [test] add test for self attn
* [feature] update dit model to fit fastseq
* [polish] refactor attn
* [feature] update train & benchmark script
* [polish] update benchmark script
* [polish] update benchmark script
|
2024-03-01 17:31:59 +08:00 |
|
Hongxin Liu
|
97c089daec
|
[feature] impl ulysses-style seq parallel (#20)
* [feature] add ulysses style sp attn
* [test] add sp attn test
* [feature] add zero sp plugin
* [hotfix] fix sp backward
* [test] add test for dit model
|
2024-03-01 14:42:06 +08:00 |
|