Commit graph

14 commits

Author SHA1 Message Date
Zangwei Zheng
3b85effe5a a bunch of update for data 2024-04-13 15:44:24 +08:00
Frank Lee
aa84ad4562 updated llava captioning (#45)
* updated llava captioning

* polish
2024-04-09 15:53:27 +08:00
Zangwei Zheng
4f171fd800 update caption 2024-04-07 00:48:03 +08:00
Frank Lee
6a86f20386 fixed llava image gen performance (#28) 2024-04-03 11:32:10 +08:00
Zangwei Zheng
5ade5e5984 [fix] use decord for tools 2024-04-01 16:45:23 +08:00
Zangwei Zheng
ff15a0acfb [docs] data processing 2024-04-01 16:08:53 +08:00
Zangwei Zheng
c553ee274f better aspect ratio 2024-04-01 14:27:13 +08:00
Zheng Zangwei (Alex Zheng)
f1ee27ba2f [feat] llava support image and text (#13)
* [feat] llava support image and text

* add resize for image

* update gpt4 caption

* update prompt for llava image captioning
2024-03-31 20:59:33 +08:00
Zheng Zangwei (Alex Zheng)
a289bced09 Dev/zangwei (#10)
* format and some fix

* support csv for llava
2024-03-30 13:46:23 +08:00
Zheng Zangwei (Alex Zheng)
f9f539f07e format and some fix (#8) 2024-03-30 13:34:19 +08:00
Zheng Zangwei (Alex Zheng)
2679bad9a0 update readme (#6) 2024-03-29 23:34:53 +08:00
Frank Lee
b704d6c0f8 Feature/llava speedup (#2)
* [caption] accelerated llava with flash attention and parallel frame extraction

* supported dp and tp in llava

* code formatting
2024-03-27 16:55:25 +08:00
Zangwei Zheng
937f52f5bb add video captioning docs 2024-03-17 19:36:03 +08:00
Zangwei Zheng
921e3138b3 update docs 2024-03-17 15:47:48 +08:00