Skip to content

showlab/MovieBench

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

14 Commits
 
 
 
 
 
 
 
 

Repository files navigation

MovieBench

A Hierarchical Movie Level Dataset for Long Video Generation

🎶 Updates

  • Dec. 16, 2024. Release DataSplit, Scene Split.
  • Dec. 16, 2024. Release the Scripts for Shot-Level Annotation Generation with GPT4.
  • Nov. 22, 2024. Rep initialization.

🎶 Todo

  • Release Dataset within the next three months.
  • Building Leaderboard.
  • Release Metric Scripts.

🐱 Abstract

MovieBench is a Hierarchical Movie-Level Dataset for Long Video Generation, which addresses these challenges by providing unique contributions: (1) movie-length videos featuring rich, coherent storylines and multi-scene narratives, (2) consistency of character appearance and audio across scenes, and (3) hierarchical data structure contains high-level movie information and detailed shot-level descriptions. Experiments demonstrate that MovieBench brings some new insights and challenges, such as maintaining character ID consistency across multiple scenes for various characters. The dataset will be public and continuously maintained, aiming to advance the field of long video generation.


image.


image.

⏬ Download Data

⏬ Shot-Level Annotation Generation with GPT4

We developed our Shot-Level Annotation Generation system based on MovieSeq, leveraging GPT-4 to enhance its functionality.

image description

Using a Visual Language Model (e.g., GPT-4), you can generate detailed annotations that include the following elements:

{
    "Characters":
    {
        "Character Name 1": "Description for appearance and behavior of Character 1, within 30 words",
        "Character Name 2": "Description for appearance and behavior of Character 2, within 30 words", 
    },
    "Style Elements":
    [
        "Element 1", "Element 2", "Element 3"
    ],
    "Plot":"A concise summary focusing on the main event or emotion, within 80 words",
    "Background Description":"A concise summary focusing on the main event or emotion, within 40 words",
    "Camera Motion":"A concise summary focusing camera motion, within 30 words."
}

For detailed environment setup and usage instructions, please refer to the corresponding README.

📖BibTeX

@misc{wu2024moviebenchhierarchicalmovielevel,
  title={MovieBench: A Hierarchical Movie Level Dataset for Long Video Generation}, 
  author={Weijia Wu and Mingyu Liu and Zeyu Zhu and Xi Xia and Haoen Feng and Wen Wang and Kevin Qinghong Lin and Chunhua Shen and Mike Zheng Shou},
  year={2024},
  eprint={2411.15262},
  archivePrefix={arXiv},
  primaryClass={cs.CV},
  url={https://arxiv.org/abs/2411.15262}, 
  }

🤗Acknowledgements

  • Thanks to Diffusers for the wonderful work.

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published