Spokesperson Videos

Start training, fine-tuning, or testing your multimedia understanding, video classification, or content analysis models with this collection of spokesperson videos. The dataset includes videos in multiple languages including Russian, Hindi, Kazakh, English, and Spanish with the majority in Russian (92.3%). Each video features an individual speaking for approximately 6.5 minutes on a wide range of diverse topics.

Start training, fine-tuning, or testing your multimedia understanding, video classification, or content analysis models with this collection of spokesperson videos. The dataset includes videos in multiple languages including Russian, Hindi, Kazakh, English, and Spanish with the majority in Russian (92.3%). Each video features an individual speaking for approximately 6.5 minutes on a wide range of diverse topics.

Start training, fine-tuning, or testing your multimedia understanding, video classification, or content analysis models with this collection of spokesperson videos. The dataset includes videos in multiple languages including Russian, Hindi, Kazakh, English, and Spanish with the majority in Russian (92.3%). Each video features an individual speaking for approximately 6.5 minutes on a wide range of diverse topics.

Start training, fine-tuning, or testing your multimedia understanding, video classification, or content analysis models with this collection of spokesperson videos. The dataset includes videos in multiple languages including Russian, Hindi, Kazakh, English, and Spanish with the majority in Russian (92.3%). Each video features an individual speaking for approximately 6.5 minutes on a wide range of diverse topics.

Media content
Media content

Dataset specs

Type

Video

Content type

Media Content

Amount

122 hours

Dataset SubtypePresentationDomainVariesFile Formatmp4

Leverage

  • Perfect for training AI models in video classification, content recommendation, speech recognition and deepfake detection.

Use Cases

  • Support video generation and motion-transfer models for realistic spokesperson-style speaking sequences.

  • Evaluate identity consistency, lip-sync alignment, and visual–audio coherence in talking-head videos to detect manipulated or synthetic content.

Do you need a specific dataset?

We understand the uniqueness of every project. That's why we offer customizable dataset solutions to match your specific requirements.

Dataset specs

Type

Video

Content type

Media Content

Amount

122 hours

Dataset SubtypePresentationDomainVariesFile Formatmp4

© 2026 DefinedCrowd. All rights reserved.