Commit graph

67 commits

Author SHA1 Message Date
Zangwei Zheng
4f171fd800 update caption 2024-04-07 00:48:03 +08:00
xyupeng
c4bf35727c Dev/pxy (#38)
* update scoring/matching

* update scoring/matching

* update scoring/matching

* update scoring/matching

* update scoring/matching

* update scoring/matching

* update scoring/matching

* update scoring/matching

* update scoring/matching

* update scene_cut

* update scene_cut

* update scene_cut[A

* update scene_cut

* update scene_cut

* update scene_cut

* update scene_cut

* update scene_cut

* update scene_cut

* m

* m

* m

* m

* m

* m

* m

* m

* m

* m

* m

* m

* m

* m

* update readme

* update readme
2024-04-06 16:45:59 +08:00
xyupeng
7da8be989a Dev/pxy (#37)
* update scoring/matching

* update scoring/matching

* update scoring/matching

* update scoring/matching

* update scoring/matching

* update scoring/matching

* update scoring/matching

* update scoring/matching

* update scoring/matching

* update scene_cut

* update scene_cut

* update scene_cut[A

* update scene_cut

* update scene_cut

* update scene_cut

* update scene_cut

* update scene_cut

* update scene_cut

* m

* m

* m

* m

* m

* m

* m

* m

* m

* m

* m

* m

* m

* m

* update readme
2024-04-06 16:40:48 +08:00
xyupeng
0dfea90a89 Dev/pxy (#36)
update scoring & scene_cut
2024-04-06 16:33:06 +08:00
Zheng Zangwei (Alex Zheng)
c9b81d8fd6 add camera motion detection (#34) 2024-04-05 23:42:31 +08:00
Shen-Chenhui
0d7c226ce6 add remove duplicates by path function 2024-04-05 17:28:17 +08:00
Shen-Chenhui
f3c417a61e add filter by flow score function 2024-04-05 14:48:46 +08:00
Shen-Chenhui
d8203d5cc1 add intersect func 2024-04-05 11:21:00 +08:00
Zangwei Zheng
57a1e7f6c4 format tools 2024-04-05 10:00:30 +08:00
Frank Lee
34b88c5260 fixed aesthetic scoring (#33) 2024-04-04 17:24:54 +08:00
Frank Lee
3bbee00436 accelerate aesthetic scoring (#32)
* accelerate aesthetic scoring

* polish
2024-04-04 16:03:43 +08:00
Zheng Zangwei (Alex Zheng)
1bb49f75e7 update frame extraction (#31) 2024-04-04 15:34:26 +08:00
Frank Lee
04c502f859 added checks for video corruption (#30) 2024-04-04 14:51:36 +08:00
Zangwei Zheng
3fcda91877 Merge branch 'dev/v1.0.1' of https://github.com/hpcaitech/Open-Sora-dev into dev/v1.0.1 2024-04-03 15:15:11 +08:00
Zangwei Zheng
12dcbeee8e update csvutil & mask 2024-04-03 15:15:04 +08:00
Hongxin Liu
cabf1d7746 [feature] support dp for matching (#29) 2024-04-03 14:42:59 +08:00
Zangwei Zheng
550ea32fad update csvutil 2024-04-03 13:33:46 +08:00
Frank Lee
6a86f20386 fixed llava image gen performance (#28) 2024-04-03 11:32:10 +08:00
Hongxin Liu
c18ad83f01 [hotfix] fix caption processing (#27) 2024-04-03 11:32:01 +08:00
xyupeng
b66a6e39d1 Dev/pxy (#26)
* update scoring/matching

* update scoring/matching

* update scoring/matching

* update scoring/matching

* update scoring/matching

* update scoring/matching

* update scoring/matching

* update scoring/matching

* update scoring/matching
2024-04-02 21:20:39 +08:00
xyupeng
35989f54d6 Dev/pxy (#24)
* update scoring/matching

* update scoring/matching

* update scoring/matching

* update scoring/matching

* update scoring/matching

* update scoring/matching

* update scoring/matching

* update scoring/matching
2024-04-02 21:01:27 +08:00
Hongxin Liu
810ebeeb6a [feature] support dp for optical flow scoring (#23) 2024-04-02 19:04:33 +08:00
Zangwei Zheng
065774a501 [feat] resize images 2024-04-02 16:55:03 +08:00
Zangwei Zheng
00c20261f0 update data pipeline 2024-04-02 15:55:58 +08:00
Hongxin Liu
c618f77bdc [feature] move t5 text processing to csvutil (#22)
* [feature] move t5 text processing to csvutil

* polish code
2024-04-02 14:56:44 +08:00
Zheng Zangwei (Alex Zheng)
b5414b36b8 Dev/datapipe (#21)
* fix #210

* fix #209

* fix #188

* [docs] add training order

* update data pipeline

---------

Co-authored-by: Sze-qq <68757353+Sze-qq@users.noreply.github.com>
2024-04-02 14:51:21 +08:00
xyupeng
72de650382 update scoring 2024-04-02 11:30:52 +08:00
xyupeng
ef2834cf36 update scoring 2024-04-02 11:14:53 +08:00
xyupeng
d6a4aeda74 update scoring 2024-04-02 11:13:41 +08:00
xyupeng
ca53b1a21b update data processing 2024-04-02 10:51:40 +08:00
xyupeng
833c70f2ae add optical flow 2024-04-01 20:00:27 +08:00
Zangwei Zheng
5ade5e5984 [fix] use decord for tools 2024-04-01 16:45:23 +08:00
Zangwei Zheng
ff15a0acfb [docs] data processing 2024-04-01 16:08:53 +08:00
Zangwei Zheng
c553ee274f better aspect ratio 2024-04-01 14:27:13 +08:00
Zangwei Zheng
4bfb90295b update csvutil 2024-04-01 11:45:38 +08:00
Zangwei Zheng
cdd3b5eb74 llava fix and config modification 2024-03-31 23:44:37 +08:00
Zheng Zangwei (Alex Zheng)
f1ee27ba2f [feat] llava support image and text (#13)
* [feat] llava support image and text

* add resize for image

* update gpt4 caption

* update prompt for llava image captioning
2024-03-31 20:59:33 +08:00
Frank Lee
0f2bb1700b refactored llava captioning (#12) 2024-03-31 01:11:03 +08:00
Zangwei Zheng
fb6bd34443 delete intepolate 2024-03-30 23:57:03 +08:00
Zangwei Zheng
987283fa1b [fix] transform may not fit enough 2024-03-30 17:05:15 +08:00
Zangwei Zheng
b44acc022c support csv for llava 2024-03-30 13:45:24 +08:00
Zangwei Zheng
52aeb3769f format and some fix 2024-03-30 12:02:48 +08:00
Frank Lee
223cc34c46 Merge pull request #7 from hpcaitech/dev/variable
Dev/variable
2024-03-30 10:57:07 +08:00
Zheng Zangwei (Alex Zheng)
2679bad9a0 update readme (#6) 2024-03-29 23:34:53 +08:00
Zheng Zangwei (Alex Zheng)
682a699aec Update image process (#5)
* [docs] update tool docs

* update aes
2024-03-29 23:34:10 +08:00
Zangwei Zheng
2d4a5df287 [wip] image wrong with flashattn 2024-03-28 22:04:43 +08:00
Zangwei Zheng
a01f6da20e [feat] support for stdit2 sampling 2024-03-28 21:35:33 +08:00
Frank Lee
b704d6c0f8 Feature/llava speedup (#2)
* [caption] accelerated llava with flash attention and parallel frame extraction

* supported dp and tp in llava

* code formatting
2024-03-27 16:55:25 +08:00
Zangwei Zheng
7392d2e551 [feat] multiple frames with 360p 2024-03-27 00:24:46 +08:00
Zangwei Zheng
4d3b68e3ad accelerate aesthetic 2024-03-25 20:54:02 +08:00