Channel Separated Conversational Audio
Datasets

Improve model performance by using our off the shelf datasets

Language

All

41

Hindi

2

Indian English

2

Punjabi

1

Telugu

1

Malayalam

1

Bengali

1

Gujarati

1

Assamese

1

Kannada

1

Tamil

1

Marathi

1

Urdu

1

Haryanvi

1

Bhojpuri

1

Maithili

1

Chhattisgarhi

1

Nepali

1

Odia

1

Tulu

1

Kashmiri

1

Sindhi

1

Manipuri

1

Dogri

1

Santhali

1

Japanese

1

Korean

1

Arabic

1

Bahasa Indonesian

1

Brazilian Portuguese

1

US English

1

Bodo

1

Bulgarian

1

Croatian

1

Konkani

1

Malay

1

Romanian

1

Slovak

1

Vietnamese

1

Bangladeshi

1

Data Type

All

41

Conversational

41

Sample Rate

All

41

48 kHz

41

159,425 Hours - Hindi - Conversational Audio Dataset

Hindi

Conversational Audio

Off-The-Shelf

Our Conversational Data in Hindi offers comprehensive and authentic dialogues of Indians conversing in Hindi. This dataset features conversations that span a wide range of topics, including daily life, business, education, and more. It includes diverse speakers from different regions of India, capturing various accents and dialects to provide a rich linguistic resource. The data is collected from natural, spontaneous conversations to ensure authenticity, and each conversation is accurately transcribed with annotations for contextual understanding. Additionally, we offer the flexibility to tailor the topics, conversations, and scenarios according to the specific needs of your company, ensuring that the dataset aligns perfectly with your requirements.

101,960 Hours - Indian English - Conversational Audio Dataset

Indian English

Conversational Audio

Off-The-Shelf

Our Conversational Audio in Indian English offers comprehensive and authentic dialogues of individuals conversing in Indian English. This dataset features conversations that span a wide range of topics, including daily life, business, education, and more. It includes diverse speakers from different regions of India, capturing various accents and dialects to provide a rich linguistic resource. The data is collected from natural, spontaneous conversations to ensure authenticity, and each conversation is accurately transcribed with annotations for contextual understanding. Additionally, we offer the flexibility to tailor the topics, conversations, and scenarios according to the specific needs of your company, ensuring that the dataset aligns perfectly with your requirements.

54,780 Hours - Punjabi - Conversational Audio Dataset

Punjabi

Conversational Audio

Off-The-Shelf

Our Conversational Audio in Punjabi offers comprehensive and authentic dialogues of individuals conversing in Punjabi. This dataset features conversations that span a wide range of topics, including daily life, business, education, and more. It includes diverse speakers from different regions, capturing various accents and dialects to provide a rich linguistic resource. The data is collected from natural, spontaneous conversations to ensure authenticity, and each conversation is accurately transcribed with annotations for contextual understanding. Additionally, we offer the flexibility to tailor the topics, conversations, and scenarios according to the specific needs of your company, ensuring that the dataset aligns perfectly with your requirements.

98,602 Hours - Telugu - Conversational Audio Dataset

Telugu

Conversational Audio

Off-The-Shelf

Our Conversational Data in Telugu offers comprehensive and authentic dialogues of Indians conversing in Telugu. This dataset features conversations that span a wide range of topics, including daily life, business, education, and more. It includes diverse speakers from different regions of India, capturing various accents and dialects to provide a rich linguistic resource. The data is collected from natural, spontaneous conversations to ensure authenticity, and each conversation is accurately transcribed with annotations for contextual understanding. Additionally, we offer the flexibility to tailor the topics, conversations, and scenarios according to the specific needs of your company, ensuring that the dataset aligns perfectly with your requirements.

52,987 Hours - Malayalam - Conversational Audio Dataset

Malayalam

Conversational Audio

Off-The-Shelf

Our Conversational Data in Malayalam offers comprehensive and authentic dialogues of Indians conversing in Malayalam. This dataset features conversations that span a wide range of topics, including daily life, business, education, and more. It includes diverse speakers from different regions of India, capturing various accents and dialects to provide a rich linguistic resource. The data is collected from natural, spontaneous conversations to ensure authenticity, and each conversation is accurately transcribed with annotations for contextual understanding. Additionally, we offer the flexibility to tailor the topics, conversations, and scenarios according to the specific needs of your company, ensuring that the dataset aligns perfectly with your requirements.

100,080 Hours - Bengali - Conversational Audio Dataset

Bengali

Conversational Audio

Off-The-Shelf

Our Conversational Data in Bengali offers comprehensive and authentic dialogues of Indians conversing in Bengali. This dataset features conversations that span a wide range of topics, including daily life, business, education, and more. It includes diverse speakers from different regions of India, capturing various accents and dialects to provide a rich linguistic resource. The data is collected from natural, spontaneous conversations to ensure authenticity, and each conversation is accurately transcribed with annotations for contextual understanding. Additionally, we offer the flexibility to tailor the topics, conversations, and scenarios according to the specific needs of your company, ensuring that the dataset aligns perfectly with your requirements.

83,729 Hours - Gujarati - Conversational Audio Dataset

Gujarati

Conversational Audio

Off-The-Shelf

Our Conversational Data in Gujarati offers comprehensive and authentic dialogues of Indians conversing in Gujarati. This dataset features conversations that span a wide range of topics, including daily life, business, education, and more. It includes diverse speakers from different regions of India, capturing various accents and dialects to provide a rich linguistic resource. The data is collected from natural, spontaneous conversations to ensure authenticity, and each conversation is accurately transcribed with annotations for contextual understanding. Additionally, we offer the flexibility to tailor the topics, conversations, and scenarios according to the specific needs of your company, ensuring that the dataset aligns perfectly with your requirements.

59,231 Hours - Assamese - Conversational Audio Dataset

Assamese

Conversational Audio

Off-The-Shelf

Our Conversational Data in Assamese offers comprehensive and authentic dialogues of Indians conversing in Assamese. This dataset features conversations that span a wide range of topics, including daily life, business, education, and more. It includes diverse speakers from different regions of India, capturing various accents and dialects to provide a rich linguistic resource. The data is collected from natural, spontaneous conversations to ensure authenticity, and each conversation is accurately transcribed with annotations for contextual understanding. Additionally, we offer the flexibility to tailor the topics, conversations, and scenarios according to the specific needs of your company, ensuring that the dataset aligns perfectly with your requirements.

71,190 Hours - Kannada - Conversational Audio Dataset

Kannada

Conversational Audio

Off-The-Shelf

Our Conversational Data in Kannada offers comprehensive and authentic dialogues of Indians conversing in Kannada. This dataset features conversations that span a wide range of topics, including daily life, business, education, and more. It includes diverse speakers from different regions of India, capturing various accents and dialects to provide a rich linguistic resource. The data is collected from natural, spontaneous conversations to ensure authenticity, and each conversation is accurately transcribed with annotations for contextual understanding. Additionally, we offer the flexibility to tailor the topics, conversations, and scenarios according to the specific needs of your company, ensuring that the dataset aligns perfectly with your requirements.

73,274 Hours - Tamil - Conversational Audio Dataset

Tamil

Conversational Audio

Off-The-Shelf

Our Conversational Data in Tamil offers comprehensive and authentic dialogues of Indians conversing in Tamil. This dataset features conversations that span a wide range of topics, including daily life, business, education, and more. It includes diverse speakers from different regions of India, capturing various accents and dialects to provide a rich linguistic resource. The data is collected from natural, spontaneous conversations to ensure authenticity, and each conversation is accurately transcribed with annotations for contextual understanding. Additionally, we offer the flexibility to tailor the topics, conversations, and scenarios according to the specific needs of your company, ensuring that the dataset aligns perfectly with your requirements.

79,304 Hours - Marathi - Conversational Audio Dataset

Marathi

Conversational Audio

Off-The-Shelf

Our Conversational Data in Marathi offers comprehensive and authentic dialogues of Indians conversing in Marathi. This dataset features conversations that span a wide range of topics, including daily life, business, education, and more. It includes diverse speakers from different regions of India, capturing various accents and dialects to provide a rich linguistic resource. The data is collected from natural, spontaneous conversations to ensure authenticity, and each conversation is accurately transcribed with annotations for contextual understanding. Additionally, we offer the flexibility to tailor the topics, conversations, and scenarios according to the specific needs of your company, ensuring that the dataset aligns perfectly with your requirements.

39,145 Hours - Urdu - Conversational Audio Dataset

Urdu

Conversational Audio

Off-The-Shelf

Our Conversational Data in Urdu offers comprehensive and authentic dialogues of Indians conversing in Urdu. This dataset features conversations that span a wide range of topics, including daily life, business, education, and more. It includes diverse speakers from different regions of India, capturing various accents and dialects to provide a rich linguistic resource. The data is collected from natural, spontaneous conversations to ensure authenticity, and each conversation is accurately transcribed with annotations for contextual understanding. Additionally, we offer the flexibility to tailor the topics, conversations, and scenarios according to the specific needs of your company, ensuring that the dataset aligns perfectly with your requirements.

1

2

3

4