Transcribed English Doctor-Patient conversations
Conversations between physicians and their patients is hard to get a hold of, especially if you are looking for data that is ethically collected and fully licensable for AI training purposes. This dataset ticks all the boxes, and covers a variety of medical specialities. Besides these transcriptions, both the original audio files as well SOAP-structured notes of the conversations are available as well.
Conversations between physicians and their patients is hard to get a hold of, especially if you are looking for data that is ethically collected and fully licensable for AI training purposes. This dataset ticks all the boxes, and covers a variety of medical specialities. Besides these transcriptions, both the original audio files as well SOAP-structured notes of the conversations are available as well.
Conversations between physicians and their patients is hard to get a hold of, especially if you are looking for data that is ethically collected and fully licensable for AI training purposes. This dataset ticks all the boxes, and covers a variety of medical specialities. Besides these transcriptions, both the original audio files as well SOAP-structured notes of the conversations are available as well.
Conversations between physicians and their patients is hard to get a hold of, especially if you are looking for data that is ethically collected and fully licensable for AI training purposes. This dataset ticks all the boxes, and covers a variety of medical specialities. Besides these transcriptions, both the original audio files as well SOAP-structured notes of the conversations are available as well.
Dataset specs
Type
Text
File format
wav
Region/Locale
en-US, EN
Amount
8K
Leverage
Train clinical conversational AI to understand and reason over real doctor patient interactions.
Use cases
Improve medical dialogue agents for summarising consultations and generating clinician style follow up notes.
Fine tune triage models to extract symptoms, conditions, and next steps from natural patient language.



Do you need a specific dataset?
We understand the uniqueness of every project. That's why we offer customizable dataset solutions to match your specific requirements.

Dataset specs
Type
Text
File format
wav
Region/Locale
en-US, EN
Amount
8K