Channel Separated Conversational Audio
Datasets

Improve model performance by using our off the shelf datasets

Language

All

41

Hindi

2

Indian English

2

Punjabi

1

Telugu

1

Malayalam

1

Bengali

1

Gujarati

1

Assamese

1

Kannada

1

Tamil

1

Marathi

1

Urdu

1

Haryanvi

1

Bhojpuri

1

Maithili

1

Chhattisgarhi

1

Nepali

1

Odia

1

Tulu

1

Kashmiri

1

Sindhi

1

Manipuri

1

Dogri

1

Santhali

1

Japanese

1

Korean

1

Arabic

1

Bahasa Indonesian

1

Brazilian Portuguese

1

US English

1

Bodo

1

Bulgarian

1

Croatian

1

Konkani

1

Malay

1

Romanian

1

Slovak

1

Vietnamese

1

Bangladeshi

1

Data Type

All

41

Conversational

41

Sample Rate

All

41

48 kHz

41

1,100 Hours - Indian English - Child Speech Conversational Audio Dataset

Indian English

Conversational Audio

Off-The-Shelf

Child Speech

Our Child Speech Conversational Audio in Indian English offers comprehensive and authentic dialogues of children conversing in Indian English. This dataset features conversations that span a wide range of topics that are relevant for children. It includes diverse speakers from different regions of India, capturing various accents and dialects to provide a rich linguistic resource. The data is collected from natural, spontaneous conversations to ensure authenticity, and each conversation is accurately transcribed with annotations for contextual understanding. Additionally, we offer the flexibility to tailor the topics, conversations, and scenarios according to the specific needs of your company, ensuring that the dataset aligns perfectly with your requirements.

1200 Hours - Hindi - Child Speech Conversational Audio Dataset

Hindi

Conversational Audio

Off-The-Shelf

Child Speech

Our Conversational Data in Hindi offers comprehensive and authentic dialogues of children conversing in Hindi. This dataset features conversations that span a wide range of topics that are relevant to children. It includes diverse speakers from different regions of India, capturing various accents and dialects to provide a rich linguistic resource. The data is collected from natural, spontaneous conversations to ensure authenticity, and each conversation is accurately transcribed with annotations for contextual understanding. Additionally, we offer the flexibility to tailor the topics, conversations, and scenarios according to the specific needs of your company, ensuring that the dataset aligns perfectly with your requirements.

985 Hours - Japanese - Conversational Audio Dataset

Japanese

Conversational Audio

Off-The-Shelf

Our Conversational Data offers comprehensive and authentic dialogues of people conversing. This dataset features conversations that span a wide range of topics, including daily life, business, education, and more. It includes diverse speakers from different regions, capturing various accents and dialects to provide a rich linguistic resource. The data is collected from natural, spontaneous conversations to ensure authenticity, and each conversation is accurately transcribed with annotations for contextual understanding. Additionally, we offer the flexibility to tailor the topics, conversations, and scenarios according to the specific needs of your company, ensuring that the dataset aligns perfectly with your requirements.

964 Hours - Korean - Conversational Audio Dataset

Korean

Conversational Audio

Off-The-Shelf

Our Conversational Data offers comprehensive and authentic dialogues of people conversing. This dataset features conversations that span a wide range of topics, including daily life, business, education, and more. It includes diverse speakers from different regions, capturing various accents and dialects to provide a rich linguistic resource. The data is collected from natural, spontaneous conversations to ensure authenticity, and each conversation is accurately transcribed with annotations for contextual understanding. Additionally, we offer the flexibility to tailor the topics, conversations, and scenarios according to the specific needs of your company, ensuring that the dataset aligns perfectly with your requirements.

985 Hours - Arabic - Conversational Audio Dataset

Arabic

Conversational Audio

Off-The-Shelf

Our Conversational Data offers comprehensive and authentic dialogues of people conversing. This dataset features conversations that span a wide range of topics, including daily life, business, education, and more. It includes diverse speakers from different regions, capturing various accents and dialects to provide a rich linguistic resource. The data is collected from natural, spontaneous conversations to ensure authenticity, and each conversation is accurately transcribed with annotations for contextual understanding. Additionally, we offer the flexibility to tailor the topics, conversations, and scenarios according to the specific needs of your company, ensuring that the dataset aligns perfectly with your requirements.

1,108 Hours - Bahasa Indonesian - Conversational Audio Dataset

Bahasa Indonesian

Conversational Audio

Off-The-Shelf

Our Conversational Data offers comprehensive and authentic dialogues of people conversing. This dataset features conversations that span a wide range of topics, including daily life, business, education, and more. It includes diverse speakers from different regions, capturing various accents and dialects to provide a rich linguistic resource. The data is collected from natural, spontaneous conversations to ensure authenticity, and each conversation is accurately transcribed with annotations for contextual understanding. Additionally, we offer the flexibility to tailor the topics, conversations, and scenarios according to the specific needs of your company, ensuring that the dataset aligns perfectly with your requirements.

1,012 Hours - Brazilian Portuguese - Conversational Audio Dataset

Brazilian Portuguese

Conversational Audio

Off-The-Shelf

Our Conversational Data offers comprehensive and authentic dialogues of people conversing. This dataset features conversations that span a wide range of topics, including daily life, business, education, and more. It includes diverse speakers from different regions, capturing various accents and dialects to provide a rich linguistic resource. The data is collected from natural, spontaneous conversations to ensure authenticity, and each conversation is accurately transcribed with annotations for contextual understanding. Additionally, we offer the flexibility to tailor the topics, conversations, and scenarios according to the specific needs of your company, ensuring that the dataset aligns perfectly with your requirements.

1,002 Hours - US English - Conversational Audio Dataset

US English

Conversational Audio

Off-The-Shelf

Our Conversational Data offers comprehensive and authentic dialogues of people conversing. This dataset features conversations that span a wide range of topics, including daily life, business, education, and more. It includes diverse speakers from different regions, capturing various accents and dialects to provide a rich linguistic resource. The data is collected from natural, spontaneous conversations to ensure authenticity, and each conversation is accurately transcribed with annotations for contextual understanding. Additionally, we offer the flexibility to tailor the topics, conversations, and scenarios according to the specific needs of your company, ensuring that the dataset aligns perfectly with your requirements.

21,885 Hours - Bodo - Conversational Audio Dataset

Bodo

Conversational Audio

Off-The-Shelf

Our Conversational Data in Bodo offers comprehensive and authentic dialogues of Indians conversing in Bodo. This dataset features conversations that span a wide range of topics, including daily life, business, education, and more. It includes diverse speakers from different regions of India, capturing various accents and dialects to provide a rich linguistic resource. The data is collected from natural, spontaneous conversations to ensure authenticity, and each conversation is accurately transcribed with annotations for contextual understanding. Additionally, we offer the flexibility to tailor the topics, conversations, and scenarios according to the specific needs of your company, ensuring that the dataset aligns perfectly with your requirements.

598 Hours - Bulgarian - Conversational Audio Dataset

Bulgarian

Conversational Audio

Off-The-Shelf

Our Conversational Data in Bulgarian offers comprehensive and authentic dialogues of Bulgarians conversing in Bulgarian. This dataset features conversations that span a wide range of topics, including daily life, business, education, and more. It includes diverse speakers from different regions of Bulgaria, capturing various accents and dialects to provide a rich linguistic resource. The data is collected from natural, spontaneous conversations to ensure authenticity, and each conversation is accurately transcribed with annotations for contextual understanding. Additionally, we offer the flexibility to tailor the topics, conversations, and scenarios according to the specific needs of your company, ensuring that the dataset aligns perfectly with your requirements.

1,076 Hours - Croatian - Conversational Audio Dataset

Croatian

Conversational Audio

Off-The-Shelf

Our Conversational Data in Croatian offers comprehensive and authentic dialogues of Croats conversing in Croatian. This dataset features conversations that span a wide range of topics, including daily life, business, education, and more. It includes diverse speakers from different regions of Croatia, capturing various accents and dialects to provide a rich linguistic resource. The data is collected from natural, spontaneous conversations to ensure authenticity, and each conversation is accurately transcribed with annotations for contextual understanding. Additionally, we offer the flexibility to tailor the topics, conversations, and scenarios according to the specific needs of your company, ensuring that the dataset aligns perfectly with your requirements.

21,885 Hours - Konkani - Conversational Audio Dataset

Konkani

Conversational Audio

Off-The-Shelf

Our Conversational Data in Konkani offers comprehensive and authentic dialogues of Indians conversing in Konkani. This dataset features conversations that span a wide range of topics, including daily life, business, education, and more. It includes diverse speakers from different regions of India, capturing various accents and dialects to provide a rich linguistic resource. The data is collected from natural, spontaneous conversations to ensure authenticity, and each conversation is accurately transcribed with annotations for contextual understanding. Additionally, we offer the flexibility to tailor the topics, conversations, and scenarios according to the specific needs of your company, ensuring that the dataset aligns perfectly with your requirements.