Channel Separated Conversational Audio
Datasets

Improve model performance by using our off the shelf datasets

Language

All

41

Hindi

2

Indian English

2

Punjabi

1

Telugu

1

Malayalam

1

Bengali

1

Gujarati

1

Assamese

1

Kannada

1

Tamil

1

Marathi

1

Urdu

1

Haryanvi

1

Bhojpuri

1

Maithili

1

Chhattisgarhi

1

Nepali

1

Odia

1

Tulu

1

Kashmiri

1

Sindhi

1

Manipuri

1

Dogri

1

Santhali

1

Japanese

1

Korean

1

Arabic

1

Bahasa Indonesian

1

Brazilian Portuguese

1

US English

1

Bodo

1

Bulgarian

1

Croatian

1

Konkani

1

Malay

1

Romanian

1

Slovak

1

Vietnamese

1

Bangladeshi

1

Data Type

All

41

Conversational

41

Sample Rate

All

41

48 kHz

41

871 Hours - Malay - Conversational Audio Dataset

Malay

Conversational Audio

Off-The-Shelf

Our Conversational Data in Malay offers comprehensive and authentic dialogues of Malaysians conversing in Malay. This dataset features conversations that span a wide range of topics, including daily life, business, education, and more. It includes diverse speakers from different regions of Malaysia, capturing various accents and dialects to provide a rich linguistic resource. The data is collected from natural, spontaneous conversations to ensure authenticity, and each conversation is accurately transcribed with annotations for contextual understanding. Additionally, we offer the flexibility to tailor the topics, conversations, and scenarios according to the specific needs of your company, ensuring that the dataset aligns perfectly with your requirements.

786 Hours - Romanian - Conversational Audio Dataset

Romanian

Conversational Audio

Off-The-Shelf

Our Conversational Data in Romanian offers comprehensive and authentic dialogues of Romanians conversing in Romanian. This dataset features conversations that span a wide range of topics, including daily life, business, education, and more. It includes diverse speakers from different regions of Romania, capturing various accents and dialects to provide a rich linguistic resource. The data is collected from natural, spontaneous conversations to ensure authenticity, and each conversation is accurately transcribed with annotations for contextual understanding. Additionally, we offer the flexibility to tailor the topics, conversations, and scenarios according to the specific needs of your company, ensuring that the dataset aligns perfectly with your requirements.

987 Hours - Slovak - Conversational Audio Dataset

Slovak

Conversational Audio

Off-The-Shelf

Our Conversational Data in Slovak offers comprehensive and authentic dialogues of Slovaks conversing in Slovak. This dataset features conversations that span a wide range of topics, including daily life, business, education, and more. It includes diverse speakers from different regions of Slovakia, capturing various accents and dialects to provide a rich linguistic resource. The data is collected from natural, spontaneous conversations to ensure authenticity, and each conversation is accurately transcribed with annotations for contextual understanding. Additionally, we offer the flexibility to tailor the topics, conversations, and scenarios according to the specific needs of your company, ensuring that the dataset aligns perfectly with your requirements.

1,002 Hours - Vietnamese - Conversational Audio Dataset

Vietnamese

Conversational Audio

Off-The-Shelf

Our Conversational Data in Vietnamese offers comprehensive and authentic dialogues. This dataset features conversations that span a wide range of topics, including daily life, business, education, and more. It includes diverse speakers from different regions of Vietnam, capturing various accents and dialects to provide a rich linguistic resource. The data is collected from natural, spontaneous conversations to ensure authenticity, and each conversation is accurately transcribed with annotations for contextual understanding. Additionally, we offer the flexibility to tailor the topics, conversations, and scenarios according to the specific needs of your company, ensuring that the dataset aligns perfectly with your requirements.

21,885 Hours - Bangladeshi - Conversational Audio Dataset

Bangladeshi

Conversational Audio

Off-The-Shelf

Our Conversational Data in Bangladeshi offers comprehensive and authentic dialogues of Bangladeshis. This dataset features conversations that span a wide range of topics, including daily life, business, education, and more. It includes diverse speakers from different regions of Bangladesh, capturing various accents and dialects to provide a rich linguistic resource. The data is collected from natural, spontaneous conversations to ensure authenticity, and each conversation is accurately transcribed with annotations for contextual understanding. Additionally, we offer the flexibility to tailor the topics, conversations, and scenarios according to the specific needs of your company, ensuring that the dataset aligns perfectly with your requirements.

1

2

3

4