Channel Separated Conversational Audio
Datasets

Improve model performance by using our off the shelf datasets

Language

All

Hindi

Indian English

Punjabi

Telugu

Malayalam

Bengali

Gujarati

Assamese

Kannada

Tamil

Marathi

Urdu

Haryanvi

Bhojpuri

Maithili

Chhattisgarhi

Nepali

Odia

Tulu

Kashmiri

Sindhi

Manipuri

Dogri

Santhali

Japanese

Korean

Arabic

Bahasa Indonesian

Brazilian Portuguese

US English

Bodo

Bulgarian

Croatian

Konkani

Malay

Romanian

Slovak

Vietnamese

Bangladeshi

Data Type

All

Conversational

Sample Rate

All

48 kHz

871 Hours - Malay - Conversational Audio Dataset

Malay

Conversational Audio

Off-The-Shelf

Our Conversational Data in Malay offers comprehensive and authentic dialogues of Malaysians conversing in Malay. This dataset features conversations that span a wide range of topics, including daily life, business, education, and more. It includes diverse speakers from different regions of Malaysia, capturing various accents and dialects to provide a rich linguistic resource. The data is collected from natural, spontaneous conversations to ensure authenticity, and each conversation is accurately transcribed with annotations for contextual understanding. Additionally, we offer the flexibility to tailor the topics, conversations, and scenarios according to the specific needs of your company, ensuring that the dataset aligns perfectly with your requirements.

786 Hours - Romanian - Conversational Audio Dataset

Romanian

Conversational Audio

Off-The-Shelf

Our Conversational Data in Romanian offers comprehensive and authentic dialogues of Romanians conversing in Romanian. This dataset features conversations that span a wide range of topics, including daily life, business, education, and more. It includes diverse speakers from different regions of Romania, capturing various accents and dialects to provide a rich linguistic resource. The data is collected from natural, spontaneous conversations to ensure authenticity, and each conversation is accurately transcribed with annotations for contextual understanding. Additionally, we offer the flexibility to tailor the topics, conversations, and scenarios according to the specific needs of your company, ensuring that the dataset aligns perfectly with your requirements.

987 Hours - Slovak - Conversational Audio Dataset

Slovak

Conversational Audio

Off-The-Shelf

Our Conversational Data in Slovak offers comprehensive and authentic dialogues of Slovaks conversing in Slovak. This dataset features conversations that span a wide range of topics, including daily life, business, education, and more. It includes diverse speakers from different regions of Slovakia, capturing various accents and dialects to provide a rich linguistic resource. The data is collected from natural, spontaneous conversations to ensure authenticity, and each conversation is accurately transcribed with annotations for contextual understanding. Additionally, we offer the flexibility to tailor the topics, conversations, and scenarios according to the specific needs of your company, ensuring that the dataset aligns perfectly with your requirements.

1,002 Hours - Vietnamese - Conversational Audio Dataset

Vietnamese

Conversational Audio

Off-The-Shelf

Our Conversational Data in Vietnamese offers comprehensive and authentic dialogues. This dataset features conversations that span a wide range of topics, including daily life, business, education, and more. It includes diverse speakers from different regions of Vietnam, capturing various accents and dialects to provide a rich linguistic resource. The data is collected from natural, spontaneous conversations to ensure authenticity, and each conversation is accurately transcribed with annotations for contextual understanding. Additionally, we offer the flexibility to tailor the topics, conversations, and scenarios according to the specific needs of your company, ensuring that the dataset aligns perfectly with your requirements.

21,885 Hours - Bangladeshi - Conversational Audio Dataset

Bangladeshi

Conversational Audio

Off-The-Shelf

Our Conversational Data in Bangladeshi offers comprehensive and authentic dialogues of Bangladeshis. This dataset features conversations that span a wide range of topics, including daily life, business, education, and more. It includes diverse speakers from different regions of Bangladesh, capturing various accents and dialects to provide a rich linguistic resource. The data is collected from natural, spontaneous conversations to ensure authenticity, and each conversation is accurately transcribed with annotations for contextual understanding. Additionally, we offer the flexibility to tailor the topics, conversations, and scenarios according to the specific needs of your company, ensuring that the dataset aligns perfectly with your requirements.

Channel Separated Conversational Audio Datasets

Language

Data Type

Sample Rate

871 Hours - Malay - Conversational Audio Dataset

786 Hours - Romanian - Conversational Audio Dataset

987 Hours - Slovak - Conversational Audio Dataset

1,002 Hours - Vietnamese - Conversational Audio Dataset

21,885 Hours - Bangladeshi - Conversational Audio Dataset

Channel Separated Conversational Audio
Datasets