Defined.ai Awarded ISO 42001 Certification, Strengthening Leadership in Responsible AI DataRead the press release

Become a partnerGet in touch
Get in touch
  • Browse Marketplace
  • Data Annotation

    Model-in-the-loop, expert-verified labeling for text, audio, image and video

    Machine Translation

    High-quality multilingual content for global AI systems

    Data Collection

    Global, diverse datasets for AI training at scale

    Conversational AI

    Natural, bias-free voice and chat experiences worldwide

    Data & Model Evaluation

    Rigorous testing to ensure accuracy, fairness and quality

    Accelerat.ai

    Smarter multilingual AI agent support for global businesses


    Industries

Find the right datasets for you

Suggested filters

Healthcareimage

Dataset title

Domain

Type

Locale

Amount

German Podcasts

30000 hours of German live, non-simulated podcasts, recorded by real podcasters in our partner network.

Various
Podcast

DE

21.3K hours

German Podcasts

21305 hours of German live, non-simulated podcasts, recorded by real podcasters in our partner network.

Various
Podcast

DE,

de-DE

231 hours

Bavarian German Podcasts

108 hours of Bavarian German live, non-simulated podcasts, recorded by real podcasters in our partner network.

Various
Podcast

AR,

DE

108 hours

Swiss German Podcasts

93 hours of Swiss German live, non-simulated podcasts, recorded by real podcasters in our partner network.

Various
Podcast

DE

93 hours

German Podcasts

10777 hours of German live, non-simulated podcasts, recorded by real podcasters in our partner network.

Various
Podcast

DE

10.8K hours

Showing 5 of 5 datasets

Datasets per page

German Podcasts

Domain:

Various
Podcast

Amount:

21.3K hours

Locale:

DE

German Podcasts

Amount:

231 hours

Locale:

DE, de-DE

Bavarian German Podcasts

Amount:

108 hours

Locale:

AR, DE

Swiss German Podcasts

Amount:

93 hours

Locale:

DE

German Podcasts

Amount:

10.8K hours

Locale:

DE

Showing 5 of 5 datasets

1/1

New datasets

Medical Claims Data for AI Model Training

Healthcare

Longitudinal Data in Oncology for AI Model Development

Healthcare

Wearable Health Data for AI Model Training

Healthcare

Hot datasets

Live Spanish Call Center Audio Dataset

Call Center

DICOM Medical Imaging Dataset with Clinical Reports

Healthcare

Multimodal Dataset for Household Robotics

Robotics
3D and Lidar

Couldn’t find the right dataset for you?

Get in touch

© 2026 DefinedCrowd. All rights reserved.

Award logo
Award logo
Award logo
Award logo
Award logo
Award logo

Datasets

Marketplace

Dataset Types

Privacy and Cookie PolicyTerms & Conditions (T&M)Data License AgreementSupplier ProgramCCPA Privacy StatementWhistleblowing ChannelCandidate Privacy Statement

© 2026 DefinedCrowd. All rights reserved.

Award logo
Award logo
Award logo
Award logo
Award logo
Award logo