Open-Sora/eval/README.md

# Evalution

## Human evaluation

To conduct human evaluation, we need to generate various samples. We provide many prompts in `assets/texts`, and defined some test setting covering different resolution, duration and aspect ratio in `eval/sample.sh`. To facilitate the usage of multiple GPUs, we split sampling tasks into several parts.

```bash
# image (1)
bash eval/sample.sh /path/to/ckpt -1
# video (2a 2b 2c ...)
bash eval/sample.sh /path/to/ckpt -2a
# launch 8 jobs at once (you must read the script to understand the details)
bash eval/launch.sh /path/to/ckpt
```

## VBench

[VBench](https://github.com/Vchitect/VBench) is a benchmark for short text to video generation. We provide a script for easily generating samples required by VBench.

```bash
# vbench tasks (4a 4b 4c ...)
bash eval/sample.sh /path/to/ckpt -4a
# launch 8 jobs at once (you must read the script to understand the details)
bash eval/launch.sh /path/to/ckpt
```

After generation, install the VBench package according to their [instructions](https://github.com/Vchitect/VBench?tab=readme-ov-file#hammer-installation). Then, run the following commands to evaluate the generated samples.

```bash
bash eval/vbench/vbench.sh /path/to/video_folder
```

## VBench-i2v

[VBench-i2v](https://github.com/Vchitect/VBench/tree/master/vbench2_beta_i2v) is a benchmark for short image to video generation (beta version).

TBD
update docs 2024-04-23 11:26:10 +02:00			`# Evalution`

			`## Human evaluation`

			To conduct human evaluation, we need to generate various samples. We provide many prompts in `assets/texts`, and defined some test setting covering different resolution, duration and aspect ratio in `eval/sample.sh`. To facilitate the usage of multiple GPUs, we split sampling tasks into several parts.

			```bash
update config 2024-04-24 04:33:27 +02:00			`# image (1)`
update docs 2024-04-23 11:26:10 +02:00			`bash eval/sample.sh /path/to/ckpt -1`
update config 2024-04-24 04:33:27 +02:00			`# video (2a 2b 2c ...)`
update docs 2024-04-23 11:26:10 +02:00			`bash eval/sample.sh /path/to/ckpt -2a`
			`# launch 8 jobs at once (you must read the script to understand the details)`
			`bash eval/launch.sh /path/to/ckpt`
			```

			`## VBench`

			`[VBench](https://github.com/Vchitect/VBench) is a benchmark for short text to video generation. We provide a script for easily generating samples required by VBench.`

			```bash
update config 2024-04-24 04:33:27 +02:00			`# vbench tasks (4a 4b 4c ...)`
			`bash eval/sample.sh /path/to/ckpt -4a`
update docs 2024-04-23 11:26:10 +02:00			`# launch 8 jobs at once (you must read the script to understand the details)`
			`bash eval/launch.sh /path/to/ckpt`
			```

update config 2024-04-24 04:33:27 +02:00			`After generation, install the VBench package according to their [instructions](https://github.com/Vchitect/VBench?tab=readme-ov-file#hammer-installation). Then, run the following commands to evaluate the generated samples.`

			```bash
			`bash eval/vbench/vbench.sh /path/to/video_folder`
			```
update docs 2024-04-23 11:26:10 +02:00
			`## VBench-i2v`

			`[VBench-i2v](https://github.com/Vchitect/VBench/tree/master/vbench2_beta_i2v) is a benchmark for short image to video generation (beta version).`

			`TBD`