Commit graph

15 commits

Author SHA1 Message Date
zhengzangw
525e29abbe reformat and update docs 2024-06-17 15:37:23 +00:00
Frank Lee
a887f54711 added sp for stdit3 (#131) 2024-06-14 11:10:43 +08:00
Frank Lee
ea96cdb5e4 added warmup lr scheduler (#121) 2024-06-05 10:12:52 +08:00
Zangwei Zheng
917643d5bd [wip] 2024-05-07 14:51:11 +08:00
Zheng Zangwei (Alex Zheng)
341b12f9bf Dev/v1.0.1 (#58)
* update (#57)

* update

* update datautil

* add VBench prompt

* update eval

* update eval

* update intepolation

* add vbench eval

* Dev/sdedit implementation (#56)

* Update utils.py

* update

* update

* update

---------

Co-authored-by: YuKun Zhou <90625606+1zeryu@users.noreply.github.com>
2024-04-20 21:23:10 +08:00
Hongxin Liu
bdb9b7ccbf [hotfix] fix attn impl (#17) 2024-04-01 20:37:15 +08:00
Zheng Zangwei (Alex Zheng)
f9f539f07e format and some fix (#8) 2024-03-30 13:34:19 +08:00
Frank Lee
a0bdaced4e replaced numpy ops with torch ops (#9)
* replaced numpy ops with torch ops

* polish

* polish
2024-03-30 13:01:58 +08:00
Hongxin Liu
8d5278b99a [feature] impl cached pos embedding (#1)
* [feature] impl cached pos embedding

* [feature] update pos emb

* [feature] update pos emb
2024-03-28 17:30:47 +08:00
Frank Lee
14526479fc added sp for inference (#76) 2024-03-17 11:00:23 +08:00
Zheng Zangwei (Alex Zheng)
d851a85535 format (#69) 2024-03-15 22:16:20 +08:00
Frank Lee
c4c5d64e49 migrate some new files (#64) 2024-03-15 21:49:38 +08:00
Frank Lee
2f87a9af75
removed old files (#63) 2024-03-15 21:48:36 +08:00
Hongxin Liu
91275b2b5e
[feature] impl fastseq-style seq parallel (#21)
* [feature] add fastseq-style sp attn

* [feature] add overlap fastseq

* [test] add test for self attn

* [feature] update dit model to fit fastseq

* [polish] refactor attn

* [feature] update train & benchmark script

* [polish] update benchmark script

* [polish] update benchmark script
2024-03-01 17:31:59 +08:00
Hongxin Liu
97c089daec
[feature] impl ulysses-style seq parallel (#20)
* [feature] add ulysses style sp attn

* [test] add sp attn test

* [feature] add zero sp plugin

* [hotfix] fix sp backward

* [test] add test for dit model
2024-03-01 14:42:06 +08:00