2025 in Review: 65% Revenue Growth & 1,200% Marketplace Expansion— Get the Full Story!

Become a partnerGet in touch
Get in touch
  • Browse Marketplace
  • Data Annotation

    Human-led labeling for text, audio, image and video

    Machine Translation

    High-quality multilingual content for global AI systems

    Data Collection

    Global, diverse datasets for AI training at scale

    Conversational AI

    Natural, bias-free voice and chat experiences worldwide

    Data & Model Evaluation

    Rigorous testing to ensure accuracy, fairness and quality

    Accelerat.ai

    Smarter multilingual AI agent support for global businesses


    Industries

Find the right datasets for you

Suggested filters

Healthcareimage

Dataset title

Domain

Type

Locale

Amount

Latin books

15K books, scanned and digitized, both fiction and non-fiction.

Various
General

la,

la-latn

15K

European Spanish books

40K+ books, scanned and digitized, both fiction and non-fiction.

Various
General

es-ES

40.5K

European French books

177K+ books, scanned and digitized, both fiction and non-fiction.

Various
General

FR,

fr-FR

177.7K

German books

183K+ books, scanned and digitized, both fiction and non-fiction.

Various
General

DE,

de-DE

183.4K

Greek books

15K+ books, scanned and digitized, both fiction and non-fiction.

Various
General

EL

15K

English books

211K+ books, scanned and digitized, both fiction and non-fiction.

Various
General

EN

211.8K

Italian books

42K+ books, scanned and digitized, both fiction and non-fiction.

Various
General

it-IT

42.6K

Korean Question-Answer pairs

1,250,480 Question-Answer pairs in Korean.

Academic
General

KO,

ko-KR

1.3M

English books

92K+ books, scanned and digitized, both fiction and non-fiction.

General

EN

92.7K

German Podcasts

231 hours of German simulated podcasts, recorded with studio quality.

Various
General

de-DE,

DE

30K hours

Showing 10 of 165 datasets

...

Datasets per page

Latin books

Domain:

Various
General

Amount:

15K

Locale:

la, la-latn

European Spanish books

Amount:

40.5K

Locale:

es-ES

European French books

Amount:

177.7K

Locale:

FR, fr-FR

German books

Amount:

183.4K

Locale:

DE, de-DE

Greek books

Amount:

15K

Locale:

EL

English books

Amount:

211.8K

Locale:

EN

Italian books

Amount:

42.6K

Locale:

it-IT

Korean Question-Answer pairs

Domain:

Academic
General

Amount:

1.3M

Locale:

KO, ko-KR

English books

Amount:

92.7K

Locale:

EN

German Podcasts

Amount:

30K hours

Locale:

de-DE, DE

Showing 10 of 165 datasets

1/17

New datasets

Medical Claims Data for AI Model Training

Healthcare

Longitudinal Data in Oncology for AI Model Development

Healthcare

Wearable Health Data for AI Model Training

Healthcare

Couldn’t find the right dataset for you?

Get in touch

© 2026 DefinedCrowd. All rights reserved.

Award logo
Award logo
Award logo
Award logo
Award logo
Award logo

Datasets

Marketplace

Solutions

Privacy and Cookie PolicyTerms & Conditions (T&M)Data License AgreementSupplier Program
Privacy and Cookie PolicyTerms & Conditions (T&M)Data License AgreementSupplier ProgramCCPA Privacy StatementWhistleblowing ChannelCandidate Privacy Statement

© 2026 DefinedCrowd. All rights reserved.

Award logo
Award logo
Award logo
Award logo
Award logo
Award logo