Channel Separated Conversational Audio
Datasets

Improve model performance by using our off the shelf datasets

Language

All

26

Hindi

2

Indian English

2

Punjabi

1

Telugu

1

Malayalam

1

Bengali

1

Gujarati

1

Assamese

1

Kannada

1

Tamil

1

Marathi

1

Urdu

1

Haryanvi

1

Bhojpuri

1

Maithili

1

Chhattisgarhi

1

Nepali

1

Odia

1

Tulu

1

Kashmiri

1

Sindhi

1

Manipuri

1

Dogri

1

Santhali

1

Data Type

All

26

Conversational

26

Sample Rate

All

26

48 kHz

26

21,885 Hours - Hindi - Conversational Audio Dataset

Hindi

Conversational Audio

Off-The-Shelf

Our Conversational Data in Hindi offers comprehensive and authentic dialogues of Indians conversing in Hindi. This dataset features conversations that span a wide range of topics, including daily life, business, education, and more. It includes diverse speakers from different regions of India, capturing various accents and dialects to provide a rich linguistic resource. The data is collected from natural, spontaneous conversations to ensure authenticity, and each conversation is accurately transcribed with annotations for contextual understanding. Additionally, we offer the flexibility to tailor the topics, conversations, and scenarios according to the specific needs of your company, ensuring that the dataset aligns perfectly with your requirements.

20,392 Hours - Indian English - Conversational Audio Dataset

Indian English

Conversational Audio

Off-The-Shelf

Our Conversational Audio in Indian English offers comprehensive and authentic dialogues of individuals conversing in Indian English. This dataset features conversations that span a wide range of topics, including daily life, business, education, and more. It includes diverse speakers from different regions of India, capturing various accents and dialects to provide a rich linguistic resource. The data is collected from natural, spontaneous conversations to ensure authenticity, and each conversation is accurately transcribed with annotations for contextual understanding. Additionally, we offer the flexibility to tailor the topics, conversations, and scenarios according to the specific needs of your company, ensuring that the dataset aligns perfectly with your requirements.

10,120 Hours - Punjabi - Conversational Audio Dataset

Punjabi

Conversational Audio

Off-The-Shelf

Our Conversational Audio in Punjabi offers comprehensive and authentic dialogues of individuals conversing in Punjabi. This dataset features conversations that span a wide range of topics, including daily life, business, education, and more. It includes diverse speakers from different regions, capturing various accents and dialects to provide a rich linguistic resource. The data is collected from natural, spontaneous conversations to ensure authenticity, and each conversation is accurately transcribed with annotations for contextual understanding. Additionally, we offer the flexibility to tailor the topics, conversations, and scenarios according to the specific needs of your company, ensuring that the dataset aligns perfectly with your requirements.

19,351 Hours - Telugu - Conversational Audio Dataset

Telugu

Conversational Audio

Off-The-Shelf

Our Conversational Data in Telugu offers comprehensive and authentic dialogues of Indians conversing in Telugu. This dataset features conversations that span a wide range of topics, including daily life, business, education, and more. It includes diverse speakers from different regions of India, capturing various accents and dialects to provide a rich linguistic resource. The data is collected from natural, spontaneous conversations to ensure authenticity, and each conversation is accurately transcribed with annotations for contextual understanding. Additionally, we offer the flexibility to tailor the topics, conversations, and scenarios according to the specific needs of your company, ensuring that the dataset aligns perfectly with your requirements.

10,078 Hours - Malayalam - Conversational Audio Dataset

Malayalam

Conversational Audio

Off-The-Shelf

Our Conversational Data in Malayalam offers comprehensive and authentic dialogues of Indians conversing in Malayalam. This dataset features conversations that span a wide range of topics, including daily life, business, education, and more. It includes diverse speakers from different regions of India, capturing various accents and dialects to provide a rich linguistic resource. The data is collected from natural, spontaneous conversations to ensure authenticity, and each conversation is accurately transcribed with annotations for contextual understanding. Additionally, we offer the flexibility to tailor the topics, conversations, and scenarios according to the specific needs of your company, ensuring that the dataset aligns perfectly with your requirements.

18,616 Hours - Bengali - Conversational Audio Dataset

Bengali

Conversational Audio

Off-The-Shelf

Our Conversational Data in Bengali offers comprehensive and authentic dialogues of Indians conversing in Bengali. This dataset features conversations that span a wide range of topics, including daily life, business, education, and more. It includes diverse speakers from different regions of India, capturing various accents and dialects to provide a rich linguistic resource. The data is collected from natural, spontaneous conversations to ensure authenticity, and each conversation is accurately transcribed with annotations for contextual understanding. Additionally, we offer the flexibility to tailor the topics, conversations, and scenarios according to the specific needs of your company, ensuring that the dataset aligns perfectly with your requirements.

13,503 Hours - Gujarati - Conversational Audio Dataset

Gujarati

Conversational Audio

Off-The-Shelf

Our Conversational Data in Gujarati offers comprehensive and authentic dialogues of Indians conversing in Gujarati. This dataset features conversations that span a wide range of topics, including daily life, business, education, and more. It includes diverse speakers from different regions of India, capturing various accents and dialects to provide a rich linguistic resource. The data is collected from natural, spontaneous conversations to ensure authenticity, and each conversation is accurately transcribed with annotations for contextual understanding. Additionally, we offer the flexibility to tailor the topics, conversations, and scenarios according to the specific needs of your company, ensuring that the dataset aligns perfectly with your requirements.

9,896 Hours - Assamese - Conversational Audio Dataset

Assamese

Conversational Audio

Off-The-Shelf

Our Conversational Data in Assamese offers comprehensive and authentic dialogues of Indians conversing in Assamese. This dataset features conversations that span a wide range of topics, including daily life, business, education, and more. It includes diverse speakers from different regions of India, capturing various accents and dialects to provide a rich linguistic resource. The data is collected from natural, spontaneous conversations to ensure authenticity, and each conversation is accurately transcribed with annotations for contextual understanding. Additionally, we offer the flexibility to tailor the topics, conversations, and scenarios according to the specific needs of your company, ensuring that the dataset aligns perfectly with your requirements.

10,050 Hours - Kannada - Conversational Audio Dataset

Kannada

Conversational Audio

Off-The-Shelf

Our Conversational Data in Kannada offers comprehensive and authentic dialogues of Indians conversing in Kannada. This dataset features conversations that span a wide range of topics, including daily life, business, education, and more. It includes diverse speakers from different regions of India, capturing various accents and dialects to provide a rich linguistic resource. The data is collected from natural, spontaneous conversations to ensure authenticity, and each conversation is accurately transcribed with annotations for contextual understanding. Additionally, we offer the flexibility to tailor the topics, conversations, and scenarios according to the specific needs of your company, ensuring that the dataset aligns perfectly with your requirements.

10,005 Hours - Tamil - Conversational Audio Dataset

Tamil

Conversational Audio

Off-The-Shelf

Our Conversational Data offers comprehensive and authentic dialogues of Indians conversing in Marathi. This dataset features conversations that span a wide range of topics, including daily life, business, education, and more. It includes diverse speakers from different regions of India, capturing various accents and dialects to provide a rich linguistic resource. The data is collected from natural, spontaneous conversations to ensure authenticity, and each conversation is accurately transcribed with annotations for contextual understanding. Additionally, we offer the flexibility to tailor the topics, conversations, and scenarios according to the specific needs of your company, ensuring that the dataset aligns perfectly with your requirements.

10,853 Hours - Marathi - Conversational Audio Dataset

Marathi

Conversational Audio

Off-The-Shelf

Our Conversational Data offers comprehensive and authentic dialogues of Indians conversing. This dataset features conversations that span a wide range of topics, including daily life, business, education, and more. It includes diverse speakers from different regions of India, capturing various accents and dialects to provide a rich linguistic resource. The data is collected from natural, spontaneous conversations to ensure authenticity, and each conversation is accurately transcribed with annotations for contextual understanding. Additionally, we offer the flexibility to tailor the topics, conversations, and scenarios according to the specific needs of your company, ensuring that the dataset aligns perfectly with your requirements.

10,084 Hours - Urdu - Conversational Audio Dataset

Urdu

Conversational Audio

Off-The-Shelf

Our Conversational Data in Urdu offers comprehensive and authentic dialogues of Indians conversing in Urdu. This dataset features conversations that span a wide range of topics, including daily life, business, education, and more. It includes diverse speakers from different regions of India, capturing various accents and dialects to provide a rich linguistic resource. The data is collected from natural, spontaneous conversations to ensure authenticity, and each conversation is accurately transcribed with annotations for contextual understanding. Additionally, we offer the flexibility to tailor the topics, conversations, and scenarios according to the specific needs of your company, ensuring that the dataset aligns perfectly with your requirements.

1

2

3