Defined.ai Awarded ISO 42001 Certification, Strengthening Leadership in Responsible AI DataRead the press release

Become a partnerGet in touch
Get in touch
  • Browse Marketplace
  • Data Annotation

    Model-in-the-loop, expert-verified labeling for text, audio, image and video

    Machine Translation

    High-quality multilingual content for global AI systems

    Data Collection

    Global, diverse datasets for AI training at scale

    Conversational AI

    Natural, bias-free voice and chat experiences worldwide

    Data & Model Evaluation

    Rigorous testing to ensure accuracy, fairness and quality

    Accelerat.ai

    Smarter multilingual AI agent support for global businesses


    Industries

Find the right datasets for you

Suggested filters

Healthcareimage

Dataset title

Domain

Type

Locale

Amount

Filipino accented English Podcasts

10000 hours of Filipino accented English live, recorded by real podcasters in our partner network.

Various
Podcast

EN

10K hours

Japanese accented English Podcasts

10000 hours of Japanese accented English live, non-simulated podcasts, recorded by real podcasters in our partner network.

Various
Podcast

EN

10K hours

Chinese accented English Podcasts

10000 hours of Chinese accented English live, non-simulated podcasts, recorded by real podcasters in our partner network.

Various
Podcast

EN

10K hours

Malasian accented English Podcasts

10000 hours of Malasian accented English live, non-simulated podcasts, recorded by real podcasters in our partner network.

Various
Podcast

EN

10K hours

Thai accented English Podcasts

10000 hours of Thai accented English live, non-simulated podcasts, recorded by real podcasters in our partner network.

Various
Podcast

TH,

EN

10K hours

Egyptian English Animation Video Dataset — 234 Hours for AI Training

Egyptian English animation video dataset with 234 hours across various genres.

Animation

234 hours

Polish English Animation Video Dataset — 206 Hours for AI Training

Polish English animation video dataset with 206 hours across various genres.

Animation

206 hours

Showing 7 of 7 datasets

Datasets per page

Filipino accented English Podcasts

Domain:

Various
Podcast

Amount:

10K hours

Locale:

EN

Japanese accented English Podcasts

Amount:

10K hours

Locale:

EN

Chinese accented English Podcasts

Amount:

10K hours

Locale:

EN

Malasian accented English Podcasts

Amount:

10K hours

Locale:

EN

Thai accented English Podcasts

Amount:

10K hours

Locale:

TH, EN

Egyptian English Animation Video Dataset — 234 Hours for AI Training

Domain:

Animation

Amount:

234 hours

Polish English Animation Video Dataset — 206 Hours for AI Training

Amount:

206 hours

Showing 7 of 7 datasets

1/1

New datasets

Medical Claims Data for AI Model Training

Healthcare

Longitudinal Data in Oncology for AI Model Development

Healthcare

Wearable Health Data for AI Model Training

Healthcare

Hot datasets

Live Spanish Call Center Audio Dataset

Call Center

DICOM Medical Imaging Dataset with Clinical Reports

Healthcare

Multimodal Dataset for Household Robotics

Robotics
3D and Lidar

Couldn’t find the right dataset for you?

Get in touch

© 2026 DefinedCrowd. All rights reserved.

Award logo
Award logo
Award logo
Award logo
Award logo
Award logo

Datasets

Marketplace

Dataset Types

Privacy and Cookie PolicyTerms & ConditionsData License AgreementSupplier Code of ConductCCPA Privacy StatementWhistleblowing ChannelCandidate Privacy Statement

© 2026 DefinedCrowd. All rights reserved.

Award logo
Award logo
Award logo
Award logo
Award logo
Award logo