Commit graph

36 commits

Author SHA1 Message Date
Tom Young
60c265d165 remove debug artifacts 2024-06-02 04:59:09 +00:00
Tom Young
880d380a21 faster pllava 2024-05-27 01:11:10 +00:00
Shen Chenhui
a57c6c71bb Feature/installation (#109)
* format

* format

* caption environment

* format

* update setuptools

* format

* format

* reformat

* format

* fix launch error

* relax version requirements

* change logdir name

---------

Co-authored-by: Shen-Chenhui <shen_chenhui@u.nus.edu>
2024-05-23 17:20:33 +08:00
Tom Young
c279d0f8f0 Pllava captioning 2024-05-15 07:19:12 +00:00
Zangwei Zheng
4f27cb51fc Merge branch 'main' into dev/v1.2 2024-05-09 16:49:24 +08:00
tianzedavid
abd0af8a89 chore: fix some comments (#355) 2024-05-09 16:19:42 +08:00
Fangdong Wu
9e5959c2f9 revise the package import error in tools/caption/caption_llava.py (#357) 2024-05-09 16:19:19 +08:00
FrankLeeeee
ec88d2d54e resolved conflicts 2024-05-07 03:08:41 +00:00
Tom Young
d3bc7c86a6 put back EXTENSIONS 2024-05-02 01:20:33 +00:00
Zheng Zangwei (Alex Zheng)
cff6298f6f update caption readme (#331) 2024-04-26 18:12:12 +08:00
Tom Young
335e8a3eed new camera motion detection (#65) 2024-04-26 10:45:42 +08:00
YuKun Zhou
343568985b Llama3 keyword extraction (#70)
* Create caption_llama3.py

* Update caption_llama3.py
2024-04-26 10:32:28 +08:00
Zangwei Zheng
8513ff8734 v1.1 training preparation 2024-04-14 00:20:00 +08:00
Zangwei Zheng
f5e347e08e merge 2024-04-13 15:47:42 +08:00
Zangwei Zheng
3b85effe5a a bunch of update for data 2024-04-13 15:44:24 +08:00
xyupeng
4e0f97f374 Dev/pxy (#48)
* update scoring/matching

* update scoring/matching

* update scoring/matching

* update scoring/matching

* update scoring/matching

* update scoring/matching

* update scoring/matching

* update scoring/matching

* update scoring/matching

* update scene_cut

* update scene_cut

* update scene_cut[A

* update scene_cut

* update scene_cut

* update scene_cut

* update scene_cut

* update scene_cut

* update scene_cut

* m

* m

* m

* m

* m

* m

* m

* m

* m

* m

* m

* m

* m

* m

* update readme

* update readme

* extract frames using opencv everywhere
2024-04-13 15:43:19 +08:00
Frank Lee
aa84ad4562 updated llava captioning (#45)
* updated llava captioning

* polish
2024-04-09 15:53:27 +08:00
Frank Lee
65cb8eb67c added support for llava mistral (#44) 2024-04-08 16:43:32 +08:00
Frank Lee
f0d4d55ed7 fixed captioning data format check (#42) 2024-04-08 10:38:14 +08:00
Frank Lee
11978080f4 enabled uneven data sharding for captioning (#41)
* enabled uneven data sharding for captioning

* poslih
2024-04-08 10:07:43 +08:00
Frank Lee
f7324500d3 added prompt and data type check (#40) 2024-04-07 15:52:01 +08:00
Zangwei Zheng
4f171fd800 update caption 2024-04-07 00:48:03 +08:00
Zheng Zangwei (Alex Zheng)
c9b81d8fd6 add camera motion detection (#34) 2024-04-05 23:42:31 +08:00
Frank Lee
6a86f20386 fixed llava image gen performance (#28) 2024-04-03 11:32:10 +08:00
Zangwei Zheng
5ade5e5984 [fix] use decord for tools 2024-04-01 16:45:23 +08:00
Zangwei Zheng
ff15a0acfb [docs] data processing 2024-04-01 16:08:53 +08:00
Zangwei Zheng
c553ee274f better aspect ratio 2024-04-01 14:27:13 +08:00
Zangwei Zheng
cdd3b5eb74 llava fix and config modification 2024-03-31 23:44:37 +08:00
Zheng Zangwei (Alex Zheng)
f1ee27ba2f [feat] llava support image and text (#13)
* [feat] llava support image and text

* add resize for image

* update gpt4 caption

* update prompt for llava image captioning
2024-03-31 20:59:33 +08:00
Frank Lee
0f2bb1700b refactored llava captioning (#12) 2024-03-31 01:11:03 +08:00
Zheng Zangwei (Alex Zheng)
a289bced09 Dev/zangwei (#10)
* format and some fix

* support csv for llava
2024-03-30 13:46:23 +08:00
Zheng Zangwei (Alex Zheng)
f9f539f07e format and some fix (#8) 2024-03-30 13:34:19 +08:00
Zheng Zangwei (Alex Zheng)
2679bad9a0 update readme (#6) 2024-03-29 23:34:53 +08:00
Frank Lee
b704d6c0f8 Feature/llava speedup (#2)
* [caption] accelerated llava with flash attention and parallel frame extraction

* supported dp and tp in llava

* code formatting
2024-03-27 16:55:25 +08:00
Zangwei Zheng
937f52f5bb add video captioning docs 2024-03-17 19:36:03 +08:00
Zangwei Zheng
921e3138b3 update docs 2024-03-17 15:47:48 +08:00