Channel Separated Conversational Audio
Datasets

Improve model performance by using our off the shelf datasets

Language

All

Hindi

Indian English

Punjabi

Telugu

Malayalam

Bengali

Gujarati

Assamese

Kannada

Tamil

Marathi

Urdu

Haryanvi

Bhojpuri

Maithili

Chhattisgarhi

Nepali

Odia

Tulu

Kashmiri

Sindhi

Manipuri

Dogri

Santhali

Japanese

Korean

Arabic

Bahasa Indonesian

Brazilian Portuguese

US English

Data Type

All

Conversational

Sample Rate

All

48 kHz

21,885 Hours - Hindi - Conversational Audio Dataset

Hindi

Conversational Audio

Off-The-Shelf

Our Conversational Data in Hindi offers comprehensive and authentic dialogues of Indians conversing in Hindi. This dataset features conversations that span a wide range of topics, including daily life, business, education, and more. It includes diverse speakers from different regions of India, capturing various accents and dialects to provide a rich linguistic resource. The data is collected from natural, spontaneous conversations to ensure authenticity, and each conversation is accurately transcribed with annotations for contextual understanding. Additionally, we offer the flexibility to tailor the topics, conversations, and scenarios according to the specific needs of your company, ensuring that the dataset aligns perfectly with your requirements.

20,392 Hours - Indian English - Conversational Audio Dataset

Indian English

Conversational Audio

Off-The-Shelf

Our Conversational Audio in Indian English offers comprehensive and authentic dialogues of individuals conversing in Indian English. This dataset features conversations that span a wide range of topics, including daily life, business, education, and more. It includes diverse speakers from different regions of India, capturing various accents and dialects to provide a rich linguistic resource. The data is collected from natural, spontaneous conversations to ensure authenticity, and each conversation is accurately transcribed with annotations for contextual understanding. Additionally, we offer the flexibility to tailor the topics, conversations, and scenarios according to the specific needs of your company, ensuring that the dataset aligns perfectly with your requirements.

10,120 Hours - Punjabi - Conversational Audio Dataset

Punjabi

Conversational Audio

Off-The-Shelf

Our Conversational Audio in Punjabi offers comprehensive and authentic dialogues of individuals conversing in Punjabi. This dataset features conversations that span a wide range of topics, including daily life, business, education, and more. It includes diverse speakers from different regions, capturing various accents and dialects to provide a rich linguistic resource. The data is collected from natural, spontaneous conversations to ensure authenticity, and each conversation is accurately transcribed with annotations for contextual understanding. Additionally, we offer the flexibility to tailor the topics, conversations, and scenarios according to the specific needs of your company, ensuring that the dataset aligns perfectly with your requirements.

19,351 Hours - Telugu - Conversational Audio Dataset

Telugu

Conversational Audio

Off-The-Shelf

Our Conversational Data in Telugu offers comprehensive and authentic dialogues of Indians conversing in Telugu. This dataset features conversations that span a wide range of topics, including daily life, business, education, and more. It includes diverse speakers from different regions of India, capturing various accents and dialects to provide a rich linguistic resource. The data is collected from natural, spontaneous conversations to ensure authenticity, and each conversation is accurately transcribed with annotations for contextual understanding. Additionally, we offer the flexibility to tailor the topics, conversations, and scenarios according to the specific needs of your company, ensuring that the dataset aligns perfectly with your requirements.

10,078 Hours - Malayalam - Conversational Audio Dataset

Malayalam

Conversational Audio

Off-The-Shelf

Our Conversational Data in Malayalam offers comprehensive and authentic dialogues of Indians conversing in Malayalam. This dataset features conversations that span a wide range of topics, including daily life, business, education, and more. It includes diverse speakers from different regions of India, capturing various accents and dialects to provide a rich linguistic resource. The data is collected from natural, spontaneous conversations to ensure authenticity, and each conversation is accurately transcribed with annotations for contextual understanding. Additionally, we offer the flexibility to tailor the topics, conversations, and scenarios according to the specific needs of your company, ensuring that the dataset aligns perfectly with your requirements.

18,616 Hours - Bengali - Conversational Audio Dataset

Bengali

Conversational Audio

Off-The-Shelf

Our Conversational Data in Bengali offers comprehensive and authentic dialogues of Indians conversing in Bengali. This dataset features conversations that span a wide range of topics, including daily life, business, education, and more. It includes diverse speakers from different regions of India, capturing various accents and dialects to provide a rich linguistic resource. The data is collected from natural, spontaneous conversations to ensure authenticity, and each conversation is accurately transcribed with annotations for contextual understanding. Additionally, we offer the flexibility to tailor the topics, conversations, and scenarios according to the specific needs of your company, ensuring that the dataset aligns perfectly with your requirements.

13,503 Hours - Gujarati - Conversational Audio Dataset

Gujarati

Conversational Audio

Off-The-Shelf

Our Conversational Data in Gujarati offers comprehensive and authentic dialogues of Indians conversing in Gujarati. This dataset features conversations that span a wide range of topics, including daily life, business, education, and more. It includes diverse speakers from different regions of India, capturing various accents and dialects to provide a rich linguistic resource. The data is collected from natural, spontaneous conversations to ensure authenticity, and each conversation is accurately transcribed with annotations for contextual understanding. Additionally, we offer the flexibility to tailor the topics, conversations, and scenarios according to the specific needs of your company, ensuring that the dataset aligns perfectly with your requirements.

9,896 Hours - Assamese - Conversational Audio Dataset

Assamese

Conversational Audio

Off-The-Shelf

Our Conversational Data in Assamese offers comprehensive and authentic dialogues of Indians conversing in Assamese. This dataset features conversations that span a wide range of topics, including daily life, business, education, and more. It includes diverse speakers from different regions of India, capturing various accents and dialects to provide a rich linguistic resource. The data is collected from natural, spontaneous conversations to ensure authenticity, and each conversation is accurately transcribed with annotations for contextual understanding. Additionally, we offer the flexibility to tailor the topics, conversations, and scenarios according to the specific needs of your company, ensuring that the dataset aligns perfectly with your requirements.

10,050 Hours - Kannada - Conversational Audio Dataset

Kannada

Conversational Audio

Off-The-Shelf

Our Conversational Data in Kannada offers comprehensive and authentic dialogues of Indians conversing in Kannada. This dataset features conversations that span a wide range of topics, including daily life, business, education, and more. It includes diverse speakers from different regions of India, capturing various accents and dialects to provide a rich linguistic resource. The data is collected from natural, spontaneous conversations to ensure authenticity, and each conversation is accurately transcribed with annotations for contextual understanding. Additionally, we offer the flexibility to tailor the topics, conversations, and scenarios according to the specific needs of your company, ensuring that the dataset aligns perfectly with your requirements.

10,005 Hours - Tamil - Conversational Audio Dataset

Tamil

Conversational Audio

Off-The-Shelf

Our Conversational Data offers comprehensive and authentic dialogues of Indians conversing in Marathi. This dataset features conversations that span a wide range of topics, including daily life, business, education, and more. It includes diverse speakers from different regions of India, capturing various accents and dialects to provide a rich linguistic resource. The data is collected from natural, spontaneous conversations to ensure authenticity, and each conversation is accurately transcribed with annotations for contextual understanding. Additionally, we offer the flexibility to tailor the topics, conversations, and scenarios according to the specific needs of your company, ensuring that the dataset aligns perfectly with your requirements.

10,853 Hours - Marathi - Conversational Audio Dataset

Marathi

Conversational Audio

Off-The-Shelf

Our Conversational Data offers comprehensive and authentic dialogues of Indians conversing. This dataset features conversations that span a wide range of topics, including daily life, business, education, and more. It includes diverse speakers from different regions of India, capturing various accents and dialects to provide a rich linguistic resource. The data is collected from natural, spontaneous conversations to ensure authenticity, and each conversation is accurately transcribed with annotations for contextual understanding. Additionally, we offer the flexibility to tailor the topics, conversations, and scenarios according to the specific needs of your company, ensuring that the dataset aligns perfectly with your requirements.

10,084 Hours - Urdu - Conversational Audio Dataset

Urdu

Conversational Audio

Off-The-Shelf

Our Conversational Data in Urdu offers comprehensive and authentic dialogues of Indians conversing in Urdu. This dataset features conversations that span a wide range of topics, including daily life, business, education, and more. It includes diverse speakers from different regions of India, capturing various accents and dialects to provide a rich linguistic resource. The data is collected from natural, spontaneous conversations to ensure authenticity, and each conversation is accurately transcribed with annotations for contextual understanding. Additionally, we offer the flexibility to tailor the topics, conversations, and scenarios according to the specific needs of your company, ensuring that the dataset aligns perfectly with your requirements.

Channel Separated Conversational Audio Datasets

Language

Data Type

Sample Rate

21,885 Hours - Hindi - Conversational Audio Dataset

20,392 Hours - Indian English - Conversational Audio Dataset

10,120 Hours - Punjabi - Conversational Audio Dataset

19,351 Hours - Telugu - Conversational Audio Dataset

10,078 Hours - Malayalam - Conversational Audio Dataset

18,616 Hours - Bengali - Conversational Audio Dataset

13,503 Hours - Gujarati - Conversational Audio Dataset

9,896 Hours - Assamese - Conversational Audio Dataset

10,050 Hours - Kannada - Conversational Audio Dataset

10,005 Hours - Tamil - Conversational Audio Dataset

10,853 Hours - Marathi - Conversational Audio Dataset

10,084 Hours - Urdu - Conversational Audio Dataset

Channel Separated Conversational Audio
Datasets