Conversational Audio - Sindhi

Our Conversational Data in Sindhi features authentic dialogues of Indians conversing in Sindhi

Overview

Parameters and Specifications

Metadata

Sample Data

Sample Metadata

Sample Transcription

Overview

Our Conversational Data in Sindhi offers comprehensive and authentic dialogues of Indians conversing in Sindhi. This dataset features conversations that span a wide range of topics, including daily life, business, education, and more. It includes diverse speakers from different regions of India, capturing various accents and dialects to provide a rich linguistic resource. The data is collected from natural, spontaneous conversations to ensure authenticity, and each conversation is accurately transcribed with annotations for contextual understanding. Additionally, we offer the flexibility to tailor the topics, conversations, and scenarios according to the specific needs of your company, ensuring that the dataset aligns perfectly with your requirements.

Parameters and Specifications

Data type

Conversational, Labelled

Format

Audio - .wav (44100Hz, 16-bit)

Unique Speakers

2

Platform Hardware

Mobile Device

Audio Tracks

Individual Speaker Stems (Stereo)

Metadata

For each recording the following metadata will be available

Age of speakers

Gender

Social Background

Geographical Location

Recording Platform

Topic

Scenario

Accent

Dialect

Sample Data

Individual Speaker Stems

General Conversation

Duration: 0:00

Waveform loading... 0%

0:00
0:00
0:00
0:00
0:00
0:00
0:00
0:00
0:00
0:00
0:00

Speaker 1

Audio - .wav (44100Hz, 16-bit)

Speaker 2

Audio - .wav (44100Hz, 16-bit)

1.0

0.5

0.0

-0.5

-1.0

1.0

0.5

0.0

-0.5

-1.0

Sample Metadata

Sample Transcription

You can request below to get access to our Transcription Guidelines.

Transcription Sample

Speaker 1

21, Female, North West Delhi,Delhi

[
  {
    "index": 0,
    "start_time": 1.2,
    "end_time": 15.39,
    "text": "hello जय झूलेलाल [happy] {आँ} ताँ काथे था ख़ां गालायो?"
  },
  {
    "index": 1,
    "start_time": 17.79,
    "end_time": 26.88,
    "text": "अच्छा? ग्वालियर! हुते त डाढ़ो मस्त सिंधी food मिलदो आ। [background noise]"
  },
  {
    "index": 2,
    "start_time": 27.39,
    "end_time": 40.17,
    "text": "अच्छा! [breathes] मां बि कुछ महिणा पहले {आं} घर सां shift कित्तो हुओ [breathes] for college purpose लाई। त मां त घर जो खाणों डाढ़ो ही miss तो कयां especially सिंधी food"
  },
  {
    "index": 3,
    "start_time": 45.36,
    "end_time": 60.21,
    "text": "[breathes] मुखे हर छे ही डाढ़ी सुट्ठी लगदी आ but specifically अगर असां गाल कयूँ त सिंधी कढ़ी मुखे डाढ़ी सुट्ठी लगदी आ। [excited]"
  },
  {
    "index": 4,
    "start_time": 64.41,
    "end_time": 77.46,
    "text": "[background noise] हा हूअ ही त [happy] हिन करे त एकदम best लगदी आए [background noise] हा [breathes]"
  },
  {
    "index": 5,
    "start_time": 77.46,
    "end_time": 89.34,
    "text": "बिल्कुल, चावर सां गड्ड अऊं हुनजी best गाल त इय आ कि [breathes] हुणमें protein, nutrient सब हिक बराबर हूँदो आ"
  },
  {
    "index": 6,
    "start_time": 89.37,
    "end_time": 104.16,
    "text": "ओह दाल पकवान! [excited] हो त डाढ़ो ही सुठो हूँदो आ [background noise] अऊँ हो अचार ख़ां गड्ड त डाढ़ो ही सुट्ठो लगदो आ [happy]"
  },
  {
    "index": 7,
    "start_time": 108,
    "end_time": 121.29,
    "text": "कोकी! हा बिल्कुल, हो त मुहिंजो go to breakfast आहे [breathes] अऊँ कोकी [background noise] को"
  },
  {
    "index": 8,
    "start_time": 121.56,
    "end_time": 133.86,
    "text": "हा बिल्कुल कोकी [breathes] डही जे सांण डाढ़ी सुठी लगदी आ [background noise] हा हा बिल्कुल"
  },
  {
    "index": 9,
    "start_time": 138.54,
    "end_time": 153.33,
    "text": "न हाणे नथी अचे ठीका जय झूलेलाल"
  }
]

Speaker 2

24, Male, Gwalior,Madhya Pradesh

[
  {
    "index": 0,
    "start_time": 2.58,
    "end_time": 16.98,
    "text": "hello हाँजी [happy] जय झूलेलाल मां ग्वालियर रहंदो आयां"
  },
  {
    "index": 1,
    "start_time": 19.35,
    "end_time": 26.4,
    "text": "हाँजी अरे डाढ़ो सुठो खूब variety आ हित्ते त"
  },
  {
    "index": 2,
    "start_time": 35.82,
    "end_time": 50.61,
    "text": "अच्छा? अहा! [happy] अच्छा {अ} छा सुठो लगदो आ तवां खे सिंधी food में? [excited] {हम्म}"
  },
  {
    "index": 3,
    "start_time": 55.08,
    "end_time": 69.63,
    "text": "ओह हो हो सिंधी कढ़ी! वाह भई वाह [excited] सिन्धी कढ़ी is world's best. ऐमें तवां खे मालूम आ डाढ़ी भाजियूँ पहिंदी हिन त खट्टी कित्ती थिंदी आ [laughs]"
  },
  {
    "index": 4,
    "start_time": 70.2,
    "end_time": 76.35,
    "text": "और {अम अ} चांवरन सां गड्ड {अ} serve कयी वेंदी आ। तवां खादी आ?"
  },
  {
    "index": 5,
    "start_time": 83.97,
    "end_time": 98.25,
    "text": "{हम्म हम्म हम्म} अच्छा बाक़ी मुहिंजो favourite त दाल पकवान आ [happy] डाढ़ो सुट्ठो! crispy and spicy combination शा त थींदो आ। पापड़ी थिंदी आ, चणन जी दाल थिंदी आ त बस"
  },
  {
    "index": 6,
    "start_time": 98.28,
    "end_time": 112.56,
    "text": "अर मिर्चियूँ अहा! [happy] अच्छा अच्छा तवां {अ} कोकी खादी आ कोकी? [laughs] अरे वाह"
  },
  {
    "index": 7,
    "start_time": 114.66,
    "end_time": 127.53,
    "text": "{हम्म हम्म हम्म} कोकी मतलब मोटो पराठो थिंदो आ न तेमे बसर ऐं सब पहिंदा आहिन। धनो धनियो पेंदो आ हम्म"
  },
  {
    "index": 8,
    "start_time": 127.56,
    "end_time": 141.54,
    "text": "ठीक आ त बढ़िया असां असां न मिलंदा से त गड्डजी खायिंदा से। अच्छा हाणे त तवां खे याद नथी अचे ना? सिंधी food जी [laughs]"
  },
  {
    "index": 9,
    "start_time": 141.54,
    "end_time": 147.69,
    "text": "ठीक आ त असां मिलंदा से ठीक आ? जय झूलेलाल"
  }
]