Scam Alert: We’ve detected unauthorized use of the Defined.ai name.Read the notice

Become a partnerGet in touch
Get in touch
  • Browse Marketplace
  • Data Annotation

    Model-in-the-loop, expert-verified labeling for text, audio, image and video

    Machine Translation

    High-quality multilingual content for global AI systems

    Data Collection

    Global, diverse datasets for AI training at scale

    Conversational AI

    Natural, bias-free voice and chat experiences worldwide

    Data & Model Evaluation

    Rigorous testing to ensure accuracy, fairness and quality

    Accelerat.ai

    Smarter multilingual AI agent support for global businesses


    Industries

Find the right datasets for you

Suggested filters

Healthcareimage

Dataset title

Domain

Type

Locale

Amount

Arabic Question-Answer pairs

200,000 Question-Answer pairs in Arabic.

Academic
Question - Answer

AR

200K

English Question-Answer pairs

100,000,000 Question-Answer pairs in English.

Academic
Question - Answer

EN

100M

English Question-Answer pairs

4,000,000 Question-Answer pairs in English regarding medical topics between patients and doctors.

Healthcare
Question - Answer

EN

4M

Hindi Question-Answer pairs

2,500,000 Question-Answer pairs in Hindi in the STEM domain.

Academic
Question - Answer

hi-IN

2.5M

English Question-Answer pairs

4,000,000 higher education Question-Answer pairs in English.

Question - Answer

EN

4M

English Question-Answer pairs

55,000 Question-Answer pairs in English in English regarding medical topics between patients and doctors.

Healthcare
Question - Answer

EN

55K

Interview Video Dataset — 9,749 Hours of Diverse English AI Interview Recordings

Interview video dataset with 9,749 hours of real online human–AI job interviews with machine-generated transcriptions.

Question - Answer
General

EN

9.7K hours

Multilingual Interview Video Dataset — 255 Hours of Non-English AI Interview Recordings

A multilingual interview video dataset with 255 hours hours of real online human–AI job interviews.

Question - Answer
General

Various

255 hours

Showing 8 of 8 datasets

Datasets per page

Arabic Question-Answer pairs

Domain:

Academic
Question - Answer

Amount:

200K

Locale:

AR

English Question-Answer pairs

Amount:

100M

Locale:

EN

English Question-Answer pairs

Amount:

4M

Locale:

EN

Hindi Question-Answer pairs

Amount:

2.5M

Locale:

hi-IN

English Question-Answer pairs

Amount:

4M

Locale:

EN

English Question-Answer pairs

Amount:

55K

Locale:

EN

Interview Video Dataset — 9,749 Hours of Diverse English AI Interview Recordings

Domain:

Question - Answer
General

Amount:

9.7K hours

Locale:

EN

Multilingual Interview Video Dataset — 255 Hours of Non-English AI Interview Recordings

Amount:

255 hours

Locale:

Various

Showing 8 of 8 datasets

1/1

New datasets

Medical Claims Data for AI Model Training

Healthcare

Longitudinal Data in Oncology for AI Model Development

Healthcare

Wearable Health Data for AI Model Training

Healthcare

Hot datasets

Live Spanish Call Center Audio Dataset

Call Center

DICOM Medical Imaging Dataset with Clinical Reports

Healthcare

Multimodal Dataset for Household Robotics

Robotics
3D and Lidar

Couldn’t find the right dataset for you?

Get in touch

© 2026 DefinedCrowd. All rights reserved.

Award logo
Award logo
Award logo
Award logo
Award logo
Award logo

Datasets

Marketplace

Dataset Types

Privacy and Cookie PolicyTerms & Conditions (T&M)Data License AgreementSupplier ProgramCCPA Privacy StatementWhistleblowing ChannelCandidate Privacy Statement

© 2026 DefinedCrowd. All rights reserved.

Award logo
Award logo
Award logo
Award logo
Award logo
Award logo