2025 in Review: 65% Revenue Growth & 1,200% Marketplace Expansion— Get the Full Story!

Become a partnerGet in touch
Get in touch
  • Browse Marketplace
  • Data Annotation

    Human-led labeling for text, audio, image and video

    Machine Translation

    High-quality multilingual content for global AI systems

    Data Collection

    Global, diverse datasets for AI training at scale

    Conversational AI

    Natural, bias-free voice and chat experiences worldwide

    Data & Model Evaluation

    Rigorous testing to ensure accuracy, fairness and quality

    Accelerat.ai

    Smarter multilingual AI agent support for global businesses


    Industries

Find the right datasets for you

Suggested filters

Healthcareimage

Dataset title

Domain

Type

Locale

Amount

Latin books

15K books, scanned and digitized, both fiction and non-fiction.

Various
General

la,

la-latn

15K

European Spanish books

40K+ books, scanned and digitized, both fiction and non-fiction.

Various
General

es-ES

40.5K

European French books

177K+ books, scanned and digitized, both fiction and non-fiction.

Various
General

FR,

fr-FR

177.7K

German books

183K+ books, scanned and digitized, both fiction and non-fiction.

Various
General

DE,

de-DE

183.4K

Greek books

15K+ books, scanned and digitized, both fiction and non-fiction.

Various
General

EL

15K

English books

211K+ books, scanned and digitized, both fiction and non-fiction.

Various
General

EN

211.8K

Italian books

42K+ books, scanned and digitized, both fiction and non-fiction.

Various
General

it-IT

42.6K

Named Entity Tagged Sentences in Modern Standard Arabic

More than 150K Named-Entity Annotated sentences, with 24 categories of Entities.

Various

MS,

AR,

ar-MSA

157.6K

Aspect-Based Sentiment Annotations in European Spanish

more than 50K Aspect-Based Sentiment Annotations of product reviews in European Spanish

Various

es-ES

59.7K

Named Entity Tagged Sentences in Russian

More than 150K Named-Entity Annotated sentences, with 24 categories of Entities.

Various

RU,

ru-RU

155K

Showing 10 of 50 datasets

Datasets per page

Latin books

Domain:

Various
General

Amount:

15K

Locale:

la, la-latn

European Spanish books

Amount:

40.5K

Locale:

es-ES

European French books

Amount:

177.7K

Locale:

FR, fr-FR

German books

Amount:

183.4K

Locale:

DE, de-DE

Greek books

Amount:

15K

Locale:

EL

English books

Amount:

211.8K

Locale:

EN

Italian books

Amount:

42.6K

Locale:

it-IT

Named Entity Tagged Sentences in Modern Standard Arabic

Amount:

157.6K

Locale:

MS, AR, ar-MSA

Aspect-Based Sentiment Annotations in European Spanish

Amount:

59.7K

Locale:

es-ES

Named Entity Tagged Sentences in Russian

Amount:

155K

Locale:

RU, ru-RU

Showing 10 of 50 datasets

1/5

New datasets

Medical Claims Data for AI Model Training

Healthcare

Longitudinal Data in Oncology for AI Model Development

Healthcare

Wearable Health Data for AI Model Training

Healthcare

Couldn’t find the right dataset for you?

Get in touch

© 2026 DefinedCrowd. All rights reserved.

Award logo
Award logo
Award logo
Award logo
Award logo
Award logo

Datasets

Marketplace

Solutions

Privacy and Cookie PolicyTerms & Conditions (T&M)Data License AgreementSupplier Program
Privacy and Cookie PolicyTerms & Conditions (T&M)Data License AgreementSupplier ProgramCCPA Privacy StatementWhistleblowing ChannelCandidate Privacy Statement

© 2026 DefinedCrowd. All rights reserved.

Award logo
Award logo
Award logo
Award logo
Award logo
Award logo