Spokesperson Videos
Start training, fine-tuning, or testing your multimedia understanding, video classification, or content analysis models with this collection of spokesperson videos. The dataset includes videos in multiple languages including Russian, Hindi, Kazakh, English, and Spanish with the majority in Russian (92.3%). Each video features an individual speaking for approximately 6.5 minutes on a wide range of diverse topics.
Start training, fine-tuning, or testing your multimedia understanding, video classification, or content analysis models with this collection of spokesperson videos. The dataset includes videos in multiple languages including Russian, Hindi, Kazakh, English, and Spanish with the majority in Russian (92.3%). Each video features an individual speaking for approximately 6.5 minutes on a wide range of diverse topics.
Start training, fine-tuning, or testing your multimedia understanding, video classification, or content analysis models with this collection of spokesperson videos. The dataset includes videos in multiple languages including Russian, Hindi, Kazakh, English, and Spanish with the majority in Russian (92.3%). Each video features an individual speaking for approximately 6.5 minutes on a wide range of diverse topics.
Start training, fine-tuning, or testing your multimedia understanding, video classification, or content analysis models with this collection of spokesperson videos. The dataset includes videos in multiple languages including Russian, Hindi, Kazakh, English, and Spanish with the majority in Russian (92.3%). Each video features an individual speaking for approximately 6.5 minutes on a wide range of diverse topics.
Dataset specs
Type
Video
Content type
Media Content
Amount
122 hours
Leverage
Perfect for training AI models in video classification, content recommendation, speech recognition and deepfake detection.
Use Cases
Support video generation and motion-transfer models for realistic spokesperson-style speaking sequences.
Evaluate identity consistency, lip-sync alignment, and visual–audio coherence in talking-head videos to detect manipulated or synthetic content.
Do you need a specific dataset?
We understand the uniqueness of every project. That's why we offer customizable dataset solutions to match your specific requirements.
Dataset specs
Type
Video
Content type
Media Content
Amount
122 hours