Mixed-Genre Videos

Start training, fine-tuning or testing your multimedia understanding, video classification, or content analysis models with this diverse collection of videos in various languages. The dataset includes movies, talk shows, interviews, and event recordings captured at resolutions ranging from 720p to 1080p. It features a broad set of languages, including Nigerian languages (English, Yoruba, Hausa), Swahili (Tanzania and Kenya), Twi (Ghana), Amharic (Ethiopia), Lingala (Congo), and Kinyarwanda (Rwanda), among others, making it well suited for a wide range of multimedia AI applications.

Start training, fine-tuning or testing your multimedia understanding, video classification, or content analysis models with this diverse collection of videos in various languages. The dataset includes movies, talk shows, interviews, and event recordings captured at resolutions ranging from 720p to 1080p. It features a broad set of languages, including Nigerian languages (English, Yoruba, Hausa), Swahili (Tanzania and Kenya), Twi (Ghana), Amharic (Ethiopia), Lingala (Congo), and Kinyarwanda (Rwanda), among others, making it well suited for a wide range of multimedia AI applications.

Start training, fine-tuning or testing your multimedia understanding, video classification, or content analysis models with this diverse collection of videos in various languages. The dataset includes movies, talk shows, interviews, and event recordings captured at resolutions ranging from 720p to 1080p. It features a broad set of languages, including Nigerian languages (English, Yoruba, Hausa), Swahili (Tanzania and Kenya), Twi (Ghana), Amharic (Ethiopia), Lingala (Congo), and Kinyarwanda (Rwanda), among others, making it well suited for a wide range of multimedia AI applications.

Start training, fine-tuning or testing your multimedia understanding, video classification, or content analysis models with this diverse collection of videos in various languages. The dataset includes movies, talk shows, interviews, and event recordings captured at resolutions ranging from 720p to 1080p. It features a broad set of languages, including Nigerian languages (English, Yoruba, Hausa), Swahili (Tanzania and Kenya), Twi (Ghana), Amharic (Ethiopia), Lingala (Congo), and Kinyarwanda (Rwanda), among others, making it well suited for a wide range of multimedia AI applications.

Media content
Media content

Dataset specs

Type

Video

Content type

Media Content

Amount

100K hours

Dataset SubtypeMedia ContentDomainEntertainmentFile Formatmp4

Leverage

  • Perfect for training AI models in video classification, content recommendation, speech recognition and emotion detection.

Use Cases

  • Build generative models that learn cross-genre structure and multilingual narrative patterns from diverse video sources.

  • Support dialogue-heavy scene understanding across varied cultural and linguistic contexts.

Do you need a specific dataset?

We understand the uniqueness of every project. That's why we offer customizable dataset solutions to match your specific requirements.

Dataset specs

Type

Video

Content type

Media Content

Amount

100K hours

Dataset SubtypeMedia ContentDomainEntertainmentFile Formatmp4

© 2026 DefinedCrowd. All rights reserved.