Commit graph

15 commits

Author SHA1 Message Date
Zangwei Zheng
4f171fd800 update caption 2024-04-07 00:48:03 +08:00
Zheng Zangwei (Alex Zheng)
c9b81d8fd6 add camera motion detection (#34) 2024-04-05 23:42:31 +08:00
Frank Lee
6a86f20386 fixed llava image gen performance (#28) 2024-04-03 11:32:10 +08:00
Zangwei Zheng
5ade5e5984 [fix] use decord for tools 2024-04-01 16:45:23 +08:00
Zangwei Zheng
ff15a0acfb [docs] data processing 2024-04-01 16:08:53 +08:00
Zangwei Zheng
c553ee274f better aspect ratio 2024-04-01 14:27:13 +08:00
Zangwei Zheng
cdd3b5eb74 llava fix and config modification 2024-03-31 23:44:37 +08:00
Zheng Zangwei (Alex Zheng)
f1ee27ba2f [feat] llava support image and text (#13)
* [feat] llava support image and text

* add resize for image

* update gpt4 caption

* update prompt for llava image captioning
2024-03-31 20:59:33 +08:00
Frank Lee
0f2bb1700b refactored llava captioning (#12) 2024-03-31 01:11:03 +08:00
Zheng Zangwei (Alex Zheng)
a289bced09 Dev/zangwei (#10)
* format and some fix

* support csv for llava
2024-03-30 13:46:23 +08:00
Zheng Zangwei (Alex Zheng)
f9f539f07e format and some fix (#8) 2024-03-30 13:34:19 +08:00
Zheng Zangwei (Alex Zheng)
2679bad9a0 update readme (#6) 2024-03-29 23:34:53 +08:00
Frank Lee
b704d6c0f8 Feature/llava speedup (#2)
* [caption] accelerated llava with flash attention and parallel frame extraction

* supported dp and tp in llava

* code formatting
2024-03-27 16:55:25 +08:00
Zangwei Zheng
937f52f5bb add video captioning docs 2024-03-17 19:36:03 +08:00
Zangwei Zheng
921e3138b3 update docs 2024-03-17 15:47:48 +08:00