Commit graph

39 commits

Author SHA1 Message Date
xyupeng
d6a4aeda74 update scoring 2024-04-02 11:13:41 +08:00
xyupeng
ca53b1a21b update data processing 2024-04-02 10:51:40 +08:00
xyupeng
833c70f2ae add optical flow 2024-04-01 20:00:27 +08:00
Zangwei Zheng
5ade5e5984 [fix] use decord for tools 2024-04-01 16:45:23 +08:00
Zangwei Zheng
ff15a0acfb [docs] data processing 2024-04-01 16:08:53 +08:00
Zangwei Zheng
c553ee274f better aspect ratio 2024-04-01 14:27:13 +08:00
Zangwei Zheng
4bfb90295b update csvutil 2024-04-01 11:45:38 +08:00
Zangwei Zheng
cdd3b5eb74 llava fix and config modification 2024-03-31 23:44:37 +08:00
Zheng Zangwei (Alex Zheng)
f1ee27ba2f [feat] llava support image and text (#13)
* [feat] llava support image and text

* add resize for image

* update gpt4 caption

* update prompt for llava image captioning
2024-03-31 20:59:33 +08:00
Frank Lee
0f2bb1700b refactored llava captioning (#12) 2024-03-31 01:11:03 +08:00
Zangwei Zheng
fb6bd34443 delete intepolate 2024-03-30 23:57:03 +08:00
Zangwei Zheng
987283fa1b [fix] transform may not fit enough 2024-03-30 17:05:15 +08:00
Zangwei Zheng
b44acc022c support csv for llava 2024-03-30 13:45:24 +08:00
Zangwei Zheng
52aeb3769f format and some fix 2024-03-30 12:02:48 +08:00
Frank Lee
223cc34c46 Merge pull request #7 from hpcaitech/dev/variable
Dev/variable
2024-03-30 10:57:07 +08:00
Zheng Zangwei (Alex Zheng)
2679bad9a0 update readme (#6) 2024-03-29 23:34:53 +08:00
Zheng Zangwei (Alex Zheng)
682a699aec Update image process (#5)
* [docs] update tool docs

* update aes
2024-03-29 23:34:10 +08:00
Zangwei Zheng
2d4a5df287 [wip] image wrong with flashattn 2024-03-28 22:04:43 +08:00
Zangwei Zheng
a01f6da20e [feat] support for stdit2 sampling 2024-03-28 21:35:33 +08:00
Frank Lee
b704d6c0f8 Feature/llava speedup (#2)
* [caption] accelerated llava with flash attention and parallel frame extraction

* supported dp and tp in llava

* code formatting
2024-03-27 16:55:25 +08:00
Zangwei Zheng
7392d2e551 [feat] multiple frames with 360p 2024-03-27 00:24:46 +08:00
Zangwei Zheng
4d3b68e3ad accelerate aesthetic 2024-03-25 20:54:02 +08:00
Zangwei Zheng
13423c57a7 train dataset support new format 2024-03-25 18:36:56 +08:00
Zangwei Zheng
6140f1bbba update csvutil 2024-03-25 17:08:35 +08:00
Zangwei Zheng
5e3eca2d0f add parallel pandas 2024-03-25 15:36:32 +08:00
Zangwei Zheng
3c5ce7f743 complete data format 2024-03-25 15:12:18 +08:00
Zangwei Zheng
b1f6e128dc update convert_dataset and launch exp 2024-03-24 22:03:31 +08:00
Zangwei Zheng
0344b24264 [feat] add aesthetic score 2024-03-24 20:34:41 +08:00
Zangwei Zheng
bc27737f7c [feat] add more csv filter 2024-03-24 16:58:16 +08:00
Zangwei Zheng
24c707bfb9 update csvutil for vidprom 2024-03-23 16:02:26 +08:00
pxy
afc1b8ace0 update scene detect README 2024-03-17 23:23:29 +08:00
Zangwei Zheng
a68c5e0843 add todo 2024-03-17 22:54:00 +08:00
Zangwei Zheng
e2856e6397 add datasets doc 2024-03-17 20:09:58 +08:00
Zangwei Zheng
937f52f5bb add video captioning docs 2024-03-17 19:36:03 +08:00
Zangwei Zheng
921e3138b3 update docs 2024-03-17 15:47:48 +08:00
Zangwei Zheng
8265a7f086 rename 2024-03-17 11:00:50 +08:00
Zangwei Zheng
f41ac61290 update docs 2024-03-16 22:48:54 +08:00
Zheng Zangwei (Alex Zheng)
d851a85535 format (#69) 2024-03-15 22:16:20 +08:00
Zheng Zangwei (Alex Zheng)
0ba56f5309 first commit (#67) 2024-03-15 22:06:36 +08:00