Overview

Parameters and Specifications

Metadata

Sample Data

Sample Metadata

Sample Transcription

Request Data

Overview

Our Conversational Data in Maithili offers comprehensive and authentic dialogues of Indians conversing in Maithili. This dataset features conversations that span a wide range of topics, including daily life, business, education, and more. It includes diverse speakers from different regions of India, capturing various accents and dialects to provide a rich linguistic resource.

The data is collected from natural, spontaneous conversations to ensure authenticity, and each conversation is accurately transcribed with annotations for contextual understanding. Additionally, we offer the flexibility to tailor the topics, conversations, and scenarios according to the specific needs of your company, ensuring that the dataset aligns perfectly with your requirements.

Parameters and Specifications

Data type

Conversational, Labelled

Format

Audio - .wav (44100Hz, 16-bit)

Unique Speakers

2

Platform Hardware

Mobile Device

Audio Tracks

Individual Speaker Stems (Stereo)

Metadata

For each recording the following metadata will be available

Age of speakers

Gender

Social Background

Geographical Location

Recording Platform

Topic

Scenario

Accent

Dialect

Sample Data

Individual Speaker Stems

General Conversation

Duration: 0:00

Waveform loading... 0%

0:00
0:00
0:00
0:00
0:00
0:00
0:00
0:00
0:00
0:00
0:00

Speaker 1

Audio - .wav (44100Hz, 16-bit)

Speaker 2

Audio - .wav (44100Hz, 16-bit)

1.0

0.5

0.0

-0.5

-1.0

1.0

0.5

0.0

-0.5

-1.0

Sample Metadata

Sample Transcription

You can request below to get access to our Transcription Guidelines.

Transcription Sample

Speaker 1

25, Male, Gurugram, Haryana

[
  {
    "index": 0,
    "start_time": 2,
    "end_time": 2.4,
    "text": "Hello"
  },
  {
    "index": 1,
    "start_time": 5.1,
    "end_time": 6.4,
    "text": "हां, की समाचार?"
  },
  {
    "index": 2,
    "start_time": 9.1,
    "end_time": 10.4,
    "text": "बढ़िया! बढ़िया! [ खुश]"
  },
  {
    "index": 3,
    "start_time": 11.6,
    "end_time": 13.4,
    "text": "की, की, कए रहल छिये आए कल?"
  },
  {
    "index": 4,
    "start_time": 20,
    "end_time": 21.5,
    "text": "बस अखन काम"
  },
  {
    "index": 5,
    "start_time": 21.8,
    "end_time": 24.9,
    "text": "कर रहल छलओ त {ouu} बस dial कर दिलु फोन।"
  },
  {
    "index": 6,
    "start_time": 25.4,
    "end_time": 26.7,
    "text": "और कहियौ कि समाचार?"
  },
  {
    "index": 7,
    "start_time": 26.8,
    "end_time": 27.2,
    "text": "[coughing]"
  },
  {
    "index": 8,
    "start_time": 30.4,
    "end_time": 33,
    "text": "नए, अखन क तो नए सोचने छिए, जबई।"
  },
  {
    "index": 9,
    "start_time": 33.7,
    "end_time": 34.8,
    "text": "छठ पूजा और में"
  },
  {
    "index": 10,
    "start_time": 40.7,
    "end_time": 42.2,
    "text": "अहाँ! अखन गाऊ गेल छिए ना?"
  },
  {
    "index": 11,
    "start_time": 45.4,
    "end_time": 48.5,
    "text": "अच्छा! अखन कोनो movie वगेरा आएल छे की अपन?"
  },
  {
    "index": 12,
    "start_time": 48.7,
    "end_time": 49.8,
    "text": "अपना ओर के मैथली में?"
  },
  {
    "index": 13,
    "start_time": 57.2,
    "end_time": 57.7,
    "text": "{Hmmm}"
  },
  {
    "index": 14,
    "start_time": 58.6,
    "end_time": 58.9,
    "text": "ना"
  },
  {
    "index": 15,
    "start_time": 65.8,
    "end_time": 66.8,
    "text": "आ YouTube में आएल छे?"
  },
  {
    "index": 16,
    "start_time": 73.3,
    "end_time": 75.3,
    "text": "अच्छा! अच्छा! हम तो देखबो न केलिए अखन।"
  },
  {
    "index": 17,
    "start_time": 76,
    "end_time": 77.2,
    "text": "और,{uh} और कोना?"
  },
  {
    "index": 18,
    "start_time": 87.1,
    "end_time": 87.6,
    "text": "हां हां"
  },
  {
    "index": 19,
    "start_time": 88.2,
    "end_time": 89.2,
    "text": "हां वो तो जोश"
  },
  {
    "index": 20,
    "start_time": 89.6,
    "end_time": 90.5,
    "text": "तो आएल छले न।"
  },
  {
    "index": 21,
    "start_time": 96.9,
    "end_time": 98.3,
    "text": "अच्छा, {uhh} हम त"
  },
  {
    "index": 22,
    "start_time": 98.6,
    "end_time": 99.3,
    "text": "{aa} बहुत"
  },
  {
    "index": 23,
    "start_time": 99.4,
    "end_time": 102.6,
    "text": "पहिले हम देखने रहिए 2011-12 में ओह"
  },
  {
    "index": 24,
    "start_time": 102.8,
    "end_time": 104.4,
    "text": "सस्ता जिंदगी महग सेनूर [laughs]"
  },
  {
    "index": 25,
    "start_time": 104.8,
    "end_time": 106,
    "text": "ता ओ तो देखने हैवे ना।"
  },
  {
    "index": 26,
    "start_time": 106.2,
    "end_time": 106.5,
    "text": "[laughs]"
  },
  {
    "index": 27,
    "start_time": 115.4,
    "end_time": 116,
    "text": "ओहि में त सब"
  },
  {
    "index": 28,
    "start_time": 116.1,
    "end_time": 119.5,
    "text": "हां, गाना एक पर एक छे, गाना तो {uh} एक पर एक छे अउ"
  },
  {
    "index": 29,
    "start_time": 120.4,
    "end_time": 120.7,
    "text": "उहि"
  },
  {
    "index": 30,
    "start_time": 121,
    "end_time": 122.1,
    "text": "हा ओहि जमाना हम"
  },
  {
    "index": 31,
    "start_time": 122.2,
    "end_time": 122.8,
    "text": "{uhh}"
  },
  {
    "index": 32,
    "start_time": 122.9,
    "end_time": 125.3,
    "text": "करीब 2004-05 में"
  },
  {
    "index": 33,
    "start_time": 125.5,
    "end_time": 131.5,
    "text": "ई बढ़िया फिलीम {ss} चइल रहल छेले , पूरा Family के साथ हम सब गांव से गेल रहिए बाहर देखै लेल।"
  },
  {
    "index": 34,
    "start_time": 132,
    "end_time": 133.3,
    "text": "वो भी टैक्टर पर"
  },
  {
    "index": 35,
    "start_time": 135.3,
    "end_time": 135.7,
    "text": "[laughs]"
  },
  {
    "index": 36,
    "start_time": 140.4,
    "end_time": 140.8,
    "text": "हा"
  },
  {
    "index": 37,
    "start_time": 141.2,
    "end_time": 145.8,
    "text": "अपना ओर देश {n} त में बहुते कम भा गेले Cinema Hall, बगेरा जो भी छए सेहा सब [inaudible]"
  },
  {
    "index": 38,
    "start_time": "149",
    "end_time": "157.9",
    "text": "बात छए internetवगेराह क बहुते कम अथि आएब गेले न, Internet [inaudible] जेते आहा का बाहर बगेरह म छे तेत्ते अपना ओर देश में नए न छए अखन।"
  },
  {
    "index": 39,
    "start_time": 159.3,
    "end_time": 160.4,
    "text": "सह छए तए द्वारा"
  },
  {
    "index": 40,
    "start_time": "161",
    "end_time": "165.4",
    "text": "से अखन, री facebook बगेरा ये सब insta वगेरा देखए छे तब"
  },
  {
    "index": 41,
    "start_time": 165.8,
    "end_time": 166.8,
    "text": "वहै पर चएल रहल छए"
  },
  {
    "index": 42,
    "start_time": 172.8,
    "end_time": 174.5,
    "text": "चलु ठीक छे त, bye"
  },
  {
    "index": 43,
    "start_time": 176.6,
    "end_time": 177.4,
    "text": "ठीक छे, ठीक छे, bye"
  }
]

Speaker 2

25, Female, Madhubani, Bihar

[
  {
    "index": 0,
    "start_time": "2",
    "end_time": "7.5",
    "text": "hello,एकदम बढ़िया,अपन बताऊं?"
  },
  {
    "index": 1,
    "start_time": 10.9,
    "end_time": 12.2,
    "text": "और की भ रहल छलए ?"
  },
  {
    "index": 2,
    "start_time": 15.2,
    "end_time": 17.5,
    "text": "कुछो न कर रहल छी, आहा! बताऊं, की भ रहल अछि"
  },
  {
    "index": 3,
    "start_time": 24.9,
    "end_time": 25.2,
    "text": "पो"
  },
  {
    "index": 4,
    "start_time": 26.7,
    "end_time": 28.1,
    "text": "एकदम बढ़िया, घर नए जैब?"
  },
  {
    "index": 5,
    "start_time": 34.9,
    "end_time": 37.1,
    "text": "छैएठ में तो सब घर एबे करए छए।"
  },
  {
    "index": 6,
    "start_time": "37.7",
    "end_time": "38.5",
    "text": "कत्तो रहए।"
  },
  {
    "index": 7,
    "start_time": "42.3",
    "end_time": "43.2",
    "text": "हम घर पर छिए।"
  },
  {
    "index": 8,
    "start_time": 50.4,
    "end_time": 55,
    "text": "ऐते ऐखन कोनो movie नए आएल छे, ऐखन एकटा web series आएल छले नून रोटी।"
  },
  {
    "index": 9,
    "start_time": 56.2,
    "end_time": 56.6,
    "text": "ओ web"
  },
  {
    "index": 10,
    "start_time": 57.2,
    "end_time": 58.7,
    "text": "हां,वो {umm}"
  },
  {
    "index": 11,
    "start_time": 58.9,
    "end_time": 63.8,
    "text": "मैथिली के पहिल web series छए तो ओ अहा देख सकए छिए, YouTube म त मैथिली पर छए।"
  },
  {
    "index": 12,
    "start_time": 65.2,
    "end_time": 66,
    "text": "[inaudible]"
  },
  {
    "index": 13,
    "start_time": 66.4,
    "end_time": 71,
    "text": "हां,हां बेराजोगरी पर बनल छए नून रोटी, विकास झा और रौशनी झा के"
  },
  {
    "index": 14,
    "start_time": 74.7,
    "end_time": 76.2,
    "text": "हां त ओ अहु देख सकइ छि।"
  },
  {
    "index": 15,
    "start_time": 77.2,
    "end_time": 85.6,
    "text": "अरे,बहुत पहले एकटा जक्शन हाल्ट आएल छलए ओहो आहा देख सकए अछि ओहो बहुत नीक ओएमे ओहो छे छथिन ओ दुर्गेश नए छेथिन पंचायत वाला।"
  },
  {
    "index": 16,
    "start_time": 87,
    "end_time": 89.3,
    "text": "ओहिमें रोल केने छथिन हा, हा "
  },
  {
    "index": 17,
    "start_time": 89.9,
    "end_time": 90.7,
    "text": "हां हां हां"
  },
  {
    "index": 18,
    "start_time": 90.8,
    "end_time": 95.1,
    "text": "वैह रोल केने छथिन उही में की कहे छे जक्शन हाल्ट मे।बहुत नीक मूवी छए ओहो"
  },
  {
    "index": 19,
    "start_time": 96.2,
    "end_time": 97.8,
    "text": "ओहो आहा देख सकए छिए।"
  },
  {
    "index": 20,
    "start_time": 104.6,
    "end_time": 105.2,
    "text": "ओ वाला"
  },
  {
    "index": 21,
    "start_time": 105.4,
    "end_time": 113.1,
    "text": "ओएमे बहुत एकटा ओकर famous गीतो छे, हमरा याद नए आएब रहल अएछ, ओ त बहुत famous movie छे, सस्ता जिंदगी महग सेनूर।।"
  },
  {
    "index": 22,
    "start_time": 117.8,
    "end_time": 120,
    "text": "हा, बहुत एकदम नीक नीक।"
  },
  {
    "index": 23,
    "start_time": "120.1",
    "end_time": "120.9",
    "text": "सचे म।"
  },
  {
    "index": 24,
    "start_time": 131.4,
    "end_time": 132.1,
    "text": "[inaudible]"
  },
  {
    "index": 25,
    "start_time": 135.6,
    "end_time": 136,
    "text": "छेले या"
  },
  {
    "index": 26,
    "start_time": "136.1",
    "end_time": "138.6",
    "text": "अब तो लोग जाएते नए, आब त cinema hall नए छए।"
  },
  {
    "index": 27,
    "start_time": 143.7,
    "end_time": 147.3,
    "text": "राजनगर में Cinema Hall छे, लेकिन ओत्त लोग जेबए नए करए छए देखे लेल।"
  },
  {
    "index": 28,
    "start_time": 157.3,
    "end_time": 158.3,
    "text": "हां आता है।"
  },
  {
    "index": 29,
    "start_time": 160.1,
    "end_time": 160.6,
    "text": "सही बात"
  },
  {
    "index": 30,
    "start_time": 166.6,
    "end_time": 167.8,
    "text": "हां, हां सै सब।"
  },
  {
    "index": 31,
    "start_time": 172.9,
    "end_time": 173.5,
    "text": "फेर त"
  },
  {
    "index": 32,
    "start_time": 173.7,
    "end_time": 175,
    "text": "ठीक छे, करी छे फेर गोप।"
  }
]

Request Data

You can file a request to get access to the data.