Channel Separated Conversational Audio
Datasets

Improve model performance by using our off the shelf datasets

Language

All

Hindi

Indian English

Punjabi

Telugu

Malayalam

Bengali

Gujarati

Assamese

Kannada

Tamil

Marathi

Urdu

Haryanvi

Bhojpuri

Maithili

Chhattisgarhi

Nepali

Odia

Tulu

Kashmiri

Sindhi

Manipuri

Dogri

Santhali

Japanese

Korean

Arabic

Bahasa Indonesian

Brazilian Portuguese

US English

Bodo

Bulgarian

Croatian

Konkani

Malay

Romanian

Slovak

Vietnamese

Bangladeshi

Data Type

All

Conversational

Sample Rate

All

48 kHz

10,020 Hours - Haryanvi - Conversational Audio Dataset

Haryanvi

Conversational Audio

Off-The-Shelf

Our Conversational Data in Haryanvi offers comprehensive and authentic dialogues of Indians conversing in Haryanvi. This dataset features conversations that span a wide range of topics, including daily life, business, education, and more. It includes diverse speakers from different regions of India, capturing various accents and dialects to provide a rich linguistic resource. The data is collected from natural, spontaneous conversations to ensure authenticity, and each conversation is accurately transcribed with annotations for contextual understanding. Additionally, we offer the flexibility to tailor the topics, conversations, and scenarios according to the specific needs of your company, ensuring that the dataset aligns perfectly with your requirements.

43,645 Hours - Bhojpuri - Conversational Audio Dataset

Bhojpuri

Conversational Audio

Off-The-Shelf

Our Conversational Data in Bhojpuri offers comprehensive and authentic dialogues of Indians conversing in Bhojpuri. This dataset features conversations that span a wide range of topics, including daily life, business, education, and more. It includes diverse speakers from different regions of India, capturing various accents and dialects to provide a rich linguistic resource. The data is collected from natural, spontaneous conversations to ensure authenticity, and each conversation is accurately transcribed with annotations for contextual understanding. Additionally, we offer the flexibility to tailor the topics, conversations, and scenarios according to the specific needs of your company, ensuring that the dataset aligns perfectly with your requirements.

49,120 Hours - Maithili - Conversational Audio Dataset

Maithili

Conversational Audio

Off-The-Shelf

Our Conversational Data in Maithili offers comprehensive and authentic dialogues of Indians conversing in Maithili. This dataset features conversations that span a wide range of topics, including daily life, business, education, and more. It includes diverse speakers from different regions of India, capturing various accents and dialects to provide a rich linguistic resource. The data is collected from natural, spontaneous conversations to ensure authenticity, and each conversation is accurately transcribed with annotations for contextual understanding. Additionally, we offer the flexibility to tailor the topics, conversations, and scenarios according to the specific needs of your company, ensuring that the dataset aligns perfectly with your requirements.

43,670 Hours - Chhattisgarhi - Conversational Audio Dataset

Chhattisgarhi

Conversational Audio

Off-The-Shelf

Our Conversational Data in Chhattisgarhi offers comprehensive and authentic dialogues of Indians conversing in Chhattisgarhi. This dataset features conversations that span a wide range of topics, including daily life, business, education, and more. It includes diverse speakers from different regions of India, capturing various accents and dialects to provide a rich linguistic resource. The data is collected from natural, spontaneous conversations to ensure authenticity, and each conversation is accurately transcribed with annotations for contextual understanding. Additionally, we offer the flexibility to tailor the topics, conversations, and scenarios according to the specific needs of your company, ensuring that the dataset aligns perfectly with your requirements.

10,005 Hours - Nepali - Conversational Audio Dataset

Nepali

Conversational Audio

Off-The-Shelf

Our Conversational Data in Nepali offers comprehensive and authentic dialogues of Indians conversing in Nepali. This dataset features conversations that span a wide range of topics, including daily life, business, education, and more. It includes diverse speakers from different regions of India, capturing various accents and dialects to provide a rich linguistic resource. The data is collected from natural, spontaneous conversations to ensure authenticity, and each conversation is accurately transcribed with annotations for contextual understanding. Additionally, we offer the flexibility to tailor the topics, conversations, and scenarios according to the specific needs of your company, ensuring that the dataset aligns perfectly with your requirements.

49,115 Hours - Odia - Conversational Audio Dataset

Odia

Conversational Audio

Off-The-Shelf

Our Conversational Data in Odia offers comprehensive and authentic dialogues of Indians conversing in Odia. This dataset features conversations that span a wide range of topics, including daily life, business, education, and more. It includes diverse speakers from different regions of India, capturing various accents and dialects to provide a rich linguistic resource. The data is collected from natural, spontaneous conversations to ensure authenticity, and each conversation is accurately transcribed with annotations for contextual understanding. Additionally, we offer the flexibility to tailor the topics, conversations, and scenarios according to the specific needs of your company, ensuring that the dataset aligns perfectly with your requirements.

10,019 Hours - Tulu - Conversational Audio Dataset

Tulu

Conversational Audio

Off-The-Shelf

Our Conversational Data in Tulu offers comprehensive and authentic dialogues of Indians conversing in Tulu. This dataset features conversations that span a wide range of topics, including daily life, business, education, and more. It includes diverse speakers from different regions of India, capturing various accents and dialects to provide a rich linguistic resource. The data is collected from natural, spontaneous conversations to ensure authenticity, and each conversation is accurately transcribed with annotations for contextual understanding. Additionally, we offer the flexibility to tailor the topics, conversations, and scenarios according to the specific needs of your company, ensuring that the dataset aligns perfectly with your requirements.

32,740 Hours - Kashmiri - Conversational Audio Dataset

Kashmiri

Conversational Audio

Off-The-Shelf

Our Conversational Data in Kashmiri offers comprehensive and authentic dialogues of Indians conversing in Kashmiri. This dataset features conversations that span a wide range of topics, including daily life, business, education, and more. It includes diverse speakers from different regions of India, capturing various accents and dialects to provide a rich linguistic resource. The data is collected from natural, spontaneous conversations to ensure authenticity, and each conversation is accurately transcribed with annotations for contextual understanding. Additionally, we offer the flexibility to tailor the topics, conversations, and scenarios according to the specific needs of your company, ensuring that the dataset aligns perfectly with your requirements.

10,010 Hours - Sindhi - Conversational Audio Dataset

Sindhi

Conversational Audio

Off-The-Shelf

Our Conversational Data in Sindhi offers comprehensive and authentic dialogues of Indians conversing in Sindhi. This dataset features conversations that span a wide range of topics, including daily life, business, education, and more. It includes diverse speakers from different regions of India, capturing various accents and dialects to provide a rich linguistic resource. The data is collected from natural, spontaneous conversations to ensure authenticity, and each conversation is accurately transcribed with annotations for contextual understanding. Additionally, we offer the flexibility to tailor the topics, conversations, and scenarios according to the specific needs of your company, ensuring that the dataset aligns perfectly with your requirements.

10,010 Hours - Manipuri - Conversational Audio Dataset

Manipuri

Conversational Audio

Off-The-Shelf

Our Conversational Data in Manipuri offers comprehensive and authentic dialogues of Indians conversing in Manipuri. This dataset features conversations that span a wide range of topics, including daily life, business, education, and more. It includes diverse speakers from different regions of India, capturing various accents and dialects to provide a rich linguistic resource. The data is collected from natural, spontaneous conversations to ensure authenticity, and each conversation is accurately transcribed with annotations for contextual understanding. Additionally, we offer the flexibility to tailor the topics, conversations, and scenarios according to the specific needs of your company, ensuring that the dataset aligns perfectly with your requirements.

28,274 Hours - Dogri - Conversational Audio Dataset

Dogri

Conversational Audio

Off-The-Shelf

Our Conversational Data in Dogri offers comprehensive and authentic dialogues of Indians conversing in Dogri. This dataset features conversations that span a wide range of topics, including daily life, business, education, and more. It includes diverse speakers from different regions of India, capturing various accents and dialects to provide a rich linguistic resource. The data is collected from natural, spontaneous conversations to ensure authenticity, and each conversation is accurately transcribed with annotations for contextual understanding. Additionally, we offer the flexibility to tailor the topics, conversations, and scenarios according to the specific needs of your company, ensuring that the dataset aligns perfectly with your requirements.

5,765 Hours - Santhali - Conversational Audio Dataset

Santhali

Conversational Audio

Off-The-Shelf

Our Conversational Data in Santhali offers comprehensive and authentic dialogues of Indians conversing in Santhali. This dataset features conversations that span a wide range of topics, including daily life, business, education, and more. It includes diverse speakers from different regions of India, capturing various accents and dialects to provide a rich linguistic resource. The data is collected from natural, spontaneous conversations to ensure authenticity, and each conversation is accurately transcribed with annotations for contextual understanding. Additionally, we offer the flexibility to tailor the topics, conversations, and scenarios according to the specific needs of your company, ensuring that the dataset aligns perfectly with your requirements.

Channel Separated Conversational Audio Datasets

Language

Data Type

Sample Rate

10,020 Hours - Haryanvi - Conversational Audio Dataset

43,645 Hours - Bhojpuri - Conversational Audio Dataset

49,120 Hours - Maithili - Conversational Audio Dataset

43,670 Hours - Chhattisgarhi - Conversational Audio Dataset

10,005 Hours - Nepali - Conversational Audio Dataset

49,115 Hours - Odia - Conversational Audio Dataset

10,019 Hours - Tulu - Conversational Audio Dataset

32,740 Hours - Kashmiri - Conversational Audio Dataset

10,010 Hours - Sindhi - Conversational Audio Dataset

10,010 Hours - Manipuri - Conversational Audio Dataset

28,274 Hours - Dogri - Conversational Audio Dataset

5,765 Hours - Santhali - Conversational Audio Dataset

Channel Separated Conversational Audio
Datasets