Scam Alert: We’ve detected unauthorized use of the Defined.ai name.Read the notice

Become a partnerGet in touch
Get in touch
  • Browse Marketplace
  • Data Annotation

    Model-in-the-loop, expert-verified labeling for text, audio, image and video

    Machine Translation

    High-quality multilingual content for global AI systems

    Data Collection

    Global, diverse datasets for AI training at scale

    Conversational AI

    Natural, bias-free voice and chat experiences worldwide

    Data & Model Evaluation

    Rigorous testing to ensure accuracy, fairness and quality

    Accelerat.ai

    Smarter multilingual AI agent support for global businesses


    Industries

Find the right datasets for you

Suggested filters

Healthcareimage

Dataset title

Domain

Type

Locale

Amount

Latin books

15K books, scanned and digitized, both fiction and non-fiction.

Books

la,

la-latn

15K

European Spanish books

40K+ books, scanned and digitized, both fiction and non-fiction.

Books

es-ES

40.5K

European French books

177K+ books, scanned and digitized, both fiction and non-fiction.

Books

FR,

fr-FR

177.7K

Anonymized Invoices

A collection of anonymized invoices.

EN

50K

German books

183K+ books, scanned and digitized, both fiction and non-fiction.

Books

DE,

de-DE

183.4K

Greek books

15K+ books, scanned and digitized, both fiction and non-fiction.

Books

EL

15K

English books

211K+ books, scanned and digitized, both fiction and non-fiction.

Books

EN

211.8K

Italian books

42K+ books, scanned and digitized, both fiction and non-fiction.

Books

it-IT

42.6K

Arabic Question-Answer pairs

200,000 Question-Answer pairs in Arabic.

Academic
Question - Answer

AR

200K

English Question-Answer pairs

100,000,000 Question-Answer pairs in English.

Academic
Question - Answer

EN

100M

Showing 10 of 51 datasets

...

Datasets per page

Latin books

Domain:

Books

Amount:

15K

Locale:

la, la-latn

European Spanish books

Amount:

40.5K

Locale:

es-ES

European French books

Amount:

177.7K

Locale:

FR, fr-FR

Anonymized Invoices

Domain:

Amount:

50K

Locale:

EN

German books

Amount:

183.4K

Locale:

DE, de-DE

Greek books

Amount:

15K

Locale:

EL

English books

Amount:

211.8K

Locale:

EN

Italian books

Amount:

42.6K

Locale:

it-IT

Arabic Question-Answer pairs

Domain:

Academic
Question - Answer

Amount:

200K

Locale:

AR

English Question-Answer pairs

Amount:

100M

Locale:

EN

Showing 10 of 51 datasets

1/6

New datasets

Medical Claims Data for AI Model Training

Healthcare

Longitudinal Data in Oncology for AI Model Development

Healthcare

Wearable Health Data for AI Model Training

Healthcare

Hot datasets

Live Spanish Call Center Audio Dataset

Call Center

DICOM Medical Imaging Dataset with Clinical Reports

Healthcare

Multimodal Dataset for Household Robotics

Robotics
3D and Lidar

Couldn’t find the right dataset for you?

Get in touch

© 2026 DefinedCrowd. All rights reserved.

Award logo
Award logo
Award logo
Award logo
Award logo
Award logo

Datasets

Marketplace

Solutions

Privacy and Cookie PolicyTerms & Conditions (T&M)Data License AgreementSupplier Program
Privacy and Cookie PolicyTerms & Conditions (T&M)Data License AgreementSupplier ProgramCCPA Privacy StatementWhistleblowing ChannelCandidate Privacy Statement

© 2026 DefinedCrowd. All rights reserved.

Award logo
Award logo
Award logo
Award logo
Award logo
Award logo