Zangwei Zheng
|
4f171fd800
|
update caption
|
2024-04-07 00:48:03 +08:00 |
|
Zheng Zangwei (Alex Zheng)
|
c9b81d8fd6
|
add camera motion detection (#34)
|
2024-04-05 23:42:31 +08:00 |
|
Frank Lee
|
6a86f20386
|
fixed llava image gen performance (#28)
|
2024-04-03 11:32:10 +08:00 |
|
Zangwei Zheng
|
5ade5e5984
|
[fix] use decord for tools
|
2024-04-01 16:45:23 +08:00 |
|
Zangwei Zheng
|
ff15a0acfb
|
[docs] data processing
|
2024-04-01 16:08:53 +08:00 |
|
Zangwei Zheng
|
c553ee274f
|
better aspect ratio
|
2024-04-01 14:27:13 +08:00 |
|
Zangwei Zheng
|
cdd3b5eb74
|
llava fix and config modification
|
2024-03-31 23:44:37 +08:00 |
|
Zheng Zangwei (Alex Zheng)
|
f1ee27ba2f
|
[feat] llava support image and text (#13)
* [feat] llava support image and text
* add resize for image
* update gpt4 caption
* update prompt for llava image captioning
|
2024-03-31 20:59:33 +08:00 |
|
Frank Lee
|
0f2bb1700b
|
refactored llava captioning (#12)
|
2024-03-31 01:11:03 +08:00 |
|
Zheng Zangwei (Alex Zheng)
|
a289bced09
|
Dev/zangwei (#10)
* format and some fix
* support csv for llava
|
2024-03-30 13:46:23 +08:00 |
|
Zheng Zangwei (Alex Zheng)
|
f9f539f07e
|
format and some fix (#8)
|
2024-03-30 13:34:19 +08:00 |
|
Zheng Zangwei (Alex Zheng)
|
2679bad9a0
|
update readme (#6)
|
2024-03-29 23:34:53 +08:00 |
|
Frank Lee
|
b704d6c0f8
|
Feature/llava speedup (#2)
* [caption] accelerated llava with flash attention and parallel frame extraction
* supported dp and tp in llava
* code formatting
|
2024-03-27 16:55:25 +08:00 |
|
Zangwei Zheng
|
937f52f5bb
|
add video captioning docs
|
2024-03-17 19:36:03 +08:00 |
|
Zangwei Zheng
|
921e3138b3
|
update docs
|
2024-03-17 15:47:48 +08:00 |
|