Commit graph

25 commits

Author SHA1 Message Date
Zheng Zangwei (Alex Zheng)
cff6298f6f update caption readme (#331) 2024-04-26 18:12:12 +08:00
Zangwei Zheng
8513ff8734 v1.1 training preparation 2024-04-14 00:20:00 +08:00
Zangwei Zheng
f5e347e08e merge 2024-04-13 15:47:42 +08:00
Zangwei Zheng
3b85effe5a a bunch of update for data 2024-04-13 15:44:24 +08:00
xyupeng
4e0f97f374 Dev/pxy (#48)
* update scoring/matching

* update scoring/matching

* update scoring/matching

* update scoring/matching

* update scoring/matching

* update scoring/matching

* update scoring/matching

* update scoring/matching

* update scoring/matching

* update scene_cut

* update scene_cut

* update scene_cut[A

* update scene_cut

* update scene_cut

* update scene_cut

* update scene_cut

* update scene_cut

* update scene_cut

* m

* m

* m

* m

* m

* m

* m

* m

* m

* m

* m

* m

* m

* m

* update readme

* update readme

* extract frames using opencv everywhere
2024-04-13 15:43:19 +08:00
Frank Lee
aa84ad4562 updated llava captioning (#45)
* updated llava captioning

* polish
2024-04-09 15:53:27 +08:00
Frank Lee
65cb8eb67c added support for llava mistral (#44) 2024-04-08 16:43:32 +08:00
Frank Lee
f0d4d55ed7 fixed captioning data format check (#42) 2024-04-08 10:38:14 +08:00
Frank Lee
11978080f4 enabled uneven data sharding for captioning (#41)
* enabled uneven data sharding for captioning

* poslih
2024-04-08 10:07:43 +08:00
Frank Lee
f7324500d3 added prompt and data type check (#40) 2024-04-07 15:52:01 +08:00
Zangwei Zheng
4f171fd800 update caption 2024-04-07 00:48:03 +08:00
Zheng Zangwei (Alex Zheng)
c9b81d8fd6 add camera motion detection (#34) 2024-04-05 23:42:31 +08:00
Frank Lee
6a86f20386 fixed llava image gen performance (#28) 2024-04-03 11:32:10 +08:00
Zangwei Zheng
5ade5e5984 [fix] use decord for tools 2024-04-01 16:45:23 +08:00
Zangwei Zheng
ff15a0acfb [docs] data processing 2024-04-01 16:08:53 +08:00
Zangwei Zheng
c553ee274f better aspect ratio 2024-04-01 14:27:13 +08:00
Zangwei Zheng
cdd3b5eb74 llava fix and config modification 2024-03-31 23:44:37 +08:00
Zheng Zangwei (Alex Zheng)
f1ee27ba2f [feat] llava support image and text (#13)
* [feat] llava support image and text

* add resize for image

* update gpt4 caption

* update prompt for llava image captioning
2024-03-31 20:59:33 +08:00
Frank Lee
0f2bb1700b refactored llava captioning (#12) 2024-03-31 01:11:03 +08:00
Zheng Zangwei (Alex Zheng)
a289bced09 Dev/zangwei (#10)
* format and some fix

* support csv for llava
2024-03-30 13:46:23 +08:00
Zheng Zangwei (Alex Zheng)
f9f539f07e format and some fix (#8) 2024-03-30 13:34:19 +08:00
Zheng Zangwei (Alex Zheng)
2679bad9a0 update readme (#6) 2024-03-29 23:34:53 +08:00
Frank Lee
b704d6c0f8 Feature/llava speedup (#2)
* [caption] accelerated llava with flash attention and parallel frame extraction

* supported dp and tp in llava

* code formatting
2024-03-27 16:55:25 +08:00
Zangwei Zheng
937f52f5bb add video captioning docs 2024-03-17 19:36:03 +08:00
Zangwei Zheng
921e3138b3 update docs 2024-03-17 15:47:48 +08:00