Files in this item

 Download all files in item (2.27 GB)
This item is
Publicly Available
and licensed under:
Creative Commons - Attribution 4.0 International (CC BY 4.0)
Icon
Name
README.txt
Size
7.53 KB
Format
Text file
Description
readme
MD5
6c1e5534546498b51c95079edb7fd04e
 Download file  Preview
 File Preview  
Spjallromur - Icelandic Conversational Speech

About the Spjallrómur corpus
----------------------------
Spjallromur is an open source conversational speech corpus for speech
technology development. The corpus is 21 hrs and 20 mins long, with 54 total
conversations, 102 speakers. The data was collected for one year (September
2020 - September 2021) by Reykjavík University. There are two parts, the first
part has full conversations, while the second part has half conversations.

The dataset was primarily created for automatic speech recognition but due to
the nature of the dataset, it can also be used for other speech technology
fields such as: speaker identification, speaker diarization, and conversational
language modeling.

Spjallrómur was collected using a custom made online chatting platform called
spjall, which is Icelandic for chat.  Each speaker used their own microphones
(some picked up background noise like the neighboring speakers or other
speakers).  and devices.  The audio . . .
                                            
Icon
Name
spjallromur_2603.zip
Size
2.27 GB
Format
application/zip
Description
corpus
MD5
80e7058c27d519c18050d2d63d084241
 Download file  Preview
 File Preview  
  • spjallromur
    • data
      • full_conversations
        • dc0967ee
          • speaker_b_convo_dc0967ee_transcript.json94 kB
          • speaker_a_convo_dc0967ee.wav29 MB
          • speaker_a_convo_dc0967ee_demographics.json113 B
          • speaker_a_convo_dc0967ee_transcript.json253 kB
          • speaker_b_convo_dc0967ee.wav29 MB
          • speaker_b_convo_dc0967ee_demographics.json113 B
        • 879325a8
          • speaker_b_convo_879325a8_demographics.json113 B
          • speaker_b_convo_879325a8_transcript.json148 kB
          • speaker_b_convo_879325a8.wav42 MB
          • speaker_a_convo_879325a8_demographics.json113 B
          • speaker_a_convo_879325a8_transcript.json257 kB
          • speaker_a_convo_879325a8.wav42 MB
        • 9a67bc98
          • speaker_b_convo_9a67bc98_demographics.json114 B
          • speaker_a_convo_9a67bc98.wav23 MB
          • speaker_b_convo_9a67bc98_transcript.json51 kB
          • speaker_a_convo_9a67bc98_demographics.json113 B
          • speaker_a_convo_9a67bc98_transcript.json130 kB
          • speaker_b_convo_9a67bc98.wav23 MB
        • aad7caab
          • speaker_b_convo_aad7caab.wav56 MB
          • speaker_a_convo_aad7caab_transcript.json183 kB
          • speaker_a_convo_aad7caab.wav56 MB
          • speaker_b_convo_aad7caab_demographics.json113 B
          • speaker_b_convo_aad7caab_transcript.json404 kB
          • speaker_a_convo_aad7caab_demographics.json111 B
        • de3b604f
          • speaker_b_convo_de3b604f_transcript.json339 kB
          • speaker_b_convo_de3b604f.wav66 MB
          • speaker_a_convo_de3b604f_demographics.json110 B
          • speaker_a_convo_de3b604f_transcript.json331 kB
          • speaker_a_convo_de3b604f.wav66 MB
          • speaker_b_convo_de3b604f_demographics.json113 B
        • a56ed5af
          • speaker_b_convo_a56ed5af.wav64 MB
          • speaker_b_convo_a56ed5af_demographics.json114 B
          • speaker_b_convo_a56ed5af_transcript.json355 kB
          • speaker_a_convo_a56ed5af_demographics.json112 B
          • speaker_a_convo_a56ed5af.wav64 MB
          • speaker_a_convo_a56ed5af_transcript.json224 kB
        • 7faf84e8
          • speaker_b_convo_7faf84e8.wav20 MB
          • speaker_a_convo_7faf84e8_transcript.json58 kB
          • speaker_b_convo_7faf84e8_demographics.json113 B
          • speaker_a_convo_7faf84e8.wav20 MB
          • speaker_a_convo_7faf84e8_demographics.json113 B
          • speaker_b_convo_7faf84e8_transcript.json61 kB
        • 50d1de3c
          • speaker_a_convo_50d1de3c.wav31 MB
          • speaker_a_convo_50d1de3c_transcript.json234 kB
          • speaker_a_convo_50d1de3c_demographics.json110 B
          • speaker_b_convo_50d1de3c.wav31 MB
          • speaker_b_convo_50d1de3c_transcript.json157 kB
          • speaker_b_convo_50d1de3c_demographics.json109 B
        • 188092d3
          • speaker_a_convo_188092d3_demographics.json114 B
          • speaker_b_convo_188092d3.wav54 MB
          • speaker_b_convo_188092d3_transcript.json99 kB
          • speaker_a_convo_188092d3_transcript.json561 kB
          • speaker_a_convo_188092d3.wav54 MB
          • speaker_b_convo_188092d3_demographics.json113 B
        • 5331448b
          • speaker_b_convo_5331448b_transcript.json164 kB
          • speaker_b_convo_5331448b.wav28 MB
          • speaker_a_convo_5331448b_demographics.json112 B
          • speaker_a_convo_5331448b_transcript.json169 kB
          • speaker_a_convo_5331448b.wav27 MB
          • speaker_b_convo_5331448b_demographics.json113 B
        • 45eebf55
          • speaker_a_convo_45eebf55.wav59 MB
          • speaker_b_convo_45eebf55_transcript.json152 kB
          • speaker_b_convo_45eebf55_demographics.json112 B
          • speaker_a_convo_45eebf55_transcript.json528 kB
          • speaker_b_convo_45eebf55.wav59 MB
          • speaker_a_convo_45eebf55_demographics.json114 B
        • deb42548
          • speaker_a_convo_deb42548_demographics.json113 B
          • speaker_b_convo_deb42548_transcript.json251 kB
          • speaker_a_convo_deb42548.wav60 MB
          • speaker_a_convo_deb42548_transcript.json409 kB
          • speaker_b_convo_deb42548_demographics.json110 B
          • speaker_b_convo_deb42548.wav60 MB
        • b107d272
          • speaker_a_convo_b107d272_transcript.json219 kB
          • speaker_b_convo_b107d272_demographics.json113 B
          • speaker_b_convo_b107d272.wav36 MB
          • speaker_a_convo_b107d272_demographics.json114 B
          • speaker_b_convo_b107d272_transcript.json162 kB
          • speaker_a_convo_b107d272.wav36 MB
        • 92a95e84
          • speaker_b_convo_92a95e84_transcript.json116 kB
          • speaker_b_convo_92a95e84_demographics.json114 B
          • speaker_b_convo_92a95e84.wav64 MB
          • speaker_a_convo_92a95e84_transcript.json204 kB
          • speaker_a_convo_92a95e84_demographics.json114 B
          • speaker_a_convo_92a95e84.wav64 MB
        • 66ccf3bc
          • speaker_a_convo_66ccf3bc.wav52 MB
          • speaker_b_convo_66ccf3bc_transcript.json385 kB
          • speaker_b_convo_66ccf3bc.wav52 MB
          • speaker_b_convo_66ccf3bc_demographics.json114 B
          • speaker_a_convo_66ccf3bc_demographics.json113 B
          • speaker_a_convo_66ccf3bc_transcript.json135 kB
        • 69079ee1
          • speaker_a_convo_69079ee1_transcript.json236 kB
          • speaker_b_convo_69079ee1.wav33 MB
          • speaker_b_convo_69079ee1_demographics.json114 B
          • speaker_b_convo_69079ee1_transcript.json211 kB
          • speaker_a_convo_69079ee1.wav33 MB
          • speaker_a_convo_69079ee1_demographics.json114 B
        • 01119679
          • speaker_a_convo_01119679_demographics.json109 B
          • speaker_b_convo_01119679.wav20 MB
          • speaker_b_convo_01119679_transcript.json111 kB
          • speaker_a_convo_01119679.wav20 MB
          • speaker_a_convo_01119679_transcript.json77 kB
          • speaker_b_convo_01119679_demographics.json110 B
        • 2ecf1db5
          • speaker_a_convo_2ecf1db5_transcript.json161 kB
          • speaker_b_convo_2ecf1db5_demographics.json113 B
          • speaker_b_convo_2ecf1db5.wav25 MB
          • speaker_b_convo_2ecf1db5_transcript.json100 kB
          • speaker_a_convo_2ecf1db5_demographics.json112 B
          • speaker_a_convo_2ecf1db5.wav25 MB
        • 81dd3246
          • speaker_a_convo_81dd3246_demographics.json111 B
          • speaker_a_convo_81dd3246.wav30 MB
          • speaker_b_convo_81dd3246_transcript.json171 kB
          • speaker_b_convo_81dd3246_demographics.json111 B
          • speaker_b_convo_81dd3246.wav30 MB
          • speaker_a_convo_81dd3246_transcript.json164 kB
        • 81b2b35e
          • speaker_b_convo_81b2b35e_demographics.json114 B
          • speaker_b_convo_81b2b35e_transcript.json216 kB
          • speaker_a_convo_81b2b35e.wav48 MB
          • speaker_a_convo_81b2b35e_demographics.json114 B
          • speaker_a_convo_81b2b35e_transcript.json257 kB
          • speaker_b_convo_81b2b35e.wav48 MB
        • 3ce6563e
          • speaker_a_convo_3ce6563e_transcript.json345 kB
          • speaker_a_convo_3ce6563e.wav56 MB
          • speaker_b_convo_3ce6563e_demographics.json113 B
          • speaker_b_convo_3ce6563e_transcript.json166 kB
          • speaker_b_convo_3ce6563e.wav56 MB
          • speaker_a_convo_3ce6563e_demographics.json114 B
        • 2c1b4416
          • speaker_a_convo_2c1b4416.wav53 MB
          • speaker_b_convo_2c1b4416_demographics.json111 B
          • speaker_b_convo_2c1b4416_transcript.json301 kB
          • speaker_a_convo_2c1b4416_demographics.json110 B
          • speaker_b_convo_2c1b4416.wav51 MB
          • speaker_a_convo_2c1b4416_transcript.json100 kB
        • 05b30647
          • speaker_b_convo_05b30647_demographics.json112 B
          • speaker_b_convo_05b30647_transcript.json370 kB
          • speaker_a_convo_05b30647_demographics.json114 B
          • speaker_b_convo_05b30647.wav64 MB
          • speaker_a_convo_05b30647_transcript.json355 kB
          • speaker_a_convo_05b30647.wav64 MB
        • ccd0f1a6
          • speaker_a_convo_ccd0f1a6_demographics.json114 B
          • speaker_b_convo_ccd0f1a6.wav64 MB
          • speaker_b_convo_ccd0f1a6_transcript.json199 kB
          • speaker_b_convo_ccd0f1a6_demographics.json112 B
          • speaker_a_convo_ccd0f1a6.wav64 MB
          • speaker_a_convo_ccd0f1a6_transcript.json324 kB
        • 2a139f9b
          • speaker_b_convo_2a139f9b_transcript.json102 kB
          • speaker_b_convo_2a139f9b.wav42 MB
          • speaker_a_convo_2a139f9b_transcript.json148 kB
          • speaker_a_convo_2a139f9b.wav42 MB
          • speaker_b_convo_2a139f9b_demographics.json115 B
          • speaker_a_convo_2a139f9b_demographics.json112 B
        • ebbc5293
          • speaker_a_convo_ebbc5293_transcript.json222 kB
          • speaker_b_convo_ebbc5293.wav63 MB
          • speaker_b_convo_ebbc5293_demographics.json113 B
          • speaker_a_convo_ebbc5293.wav63 MB
          • speaker_b_convo_ebbc5293_transcript.json600 kB
          • speaker_a_convo_ebbc5293_demographics.json112 B
        • 389f0bb5
          • speaker_a_convo_389f0bb5_demographics.json113 B
          • speaker_a_convo_389f0bb5.wav39 MB
          • speaker_a_convo_389f0bb5_transcript.json160 kB
          • speaker_b_convo_389f0bb5.wav39 MB
          • speaker_b_convo_389f0bb5_demographics.json113 B
          • speaker_b_convo_389f0bb5_transcript.json254 kB
        • 24c0c1b3
          • speaker_a_convo_24c0c1b3_demographics.json111 B
          • speaker_b_convo_24c0c1b3.wav22 MB
          • speaker_b_convo_24c0c1b3_transcript.json77 kB
          • speaker_a_convo_24c0c1b3.wav22 MB
          • speaker_a_convo_24c0c1b3_transcript.json75 kB
          • speaker_b_convo_24c0c1b3_demographics.json105 B
        • 8c25247b
          • speaker_a_convo_8c25247b_transcript.json274 kB
          • speaker_a_convo_8c25247b_demographics.json113 B
          • speaker_b_convo_8c25247b.wav55 MB
          • speaker_b_convo_8c25247b_transcript.json284 kB
          • speaker_a_convo_8c25247b.wav55 MB
          • speaker_b_convo_8c25247b_demographics.json114 B
        • 8af8f246
          • speaker_b_convo_8af8f246_demographics.json106 B
          • speaker_a_convo_8af8f246_transcript.json401 kB
          • speaker_a_convo_8af8f246_demographics.json113 B
          • speaker_b_convo_8af8f246.wav61 MB
          • speaker_b_convo_8af8f246_transcript.json305 kB
          • speaker_a_convo_8af8f246.wav61 MB
        • 2d219d50
          • speaker_a_convo_2d219d50_transcript.json172 kB
          • speaker_a_convo_2d219d50_demographics.json111 B
          • speaker_a_convo_2d219d50.wav39 MB
          • speaker_b_convo_2d219d50.wav39 MB
          • speaker_b_convo_2d219d50_transcript.json199 kB
          • speaker_b_convo_2d219d50_demographics.json111 B
        • bbc3f248
          • speaker_a_convo_bbc3f248_transcript.json390 kB
          • speaker_a_convo_bbc3f248.wav63 MB
          • speaker_b_convo_bbc3f248_demographics.json113 B
          • speaker_a_convo_bbc3f248_demographics.json113 B
          • speaker_b_convo_bbc3f248_transcript.json320 kB
          • speaker_b_convo_bbc3f248.wav63 MB
        • 826b4d3d
          • speaker_a_convo_826b4d3d.wav59 MB
          • speaker_b_convo_826b4d3d_transcript.json474 kB
          • speaker_b_convo_826b4d3d_demographics.json113 B
          • speaker_a_convo_826b4d3d_transcript.json219 kB
          • speaker_b_convo_826b4d3d.wav59 MB
          • speaker_a_convo_826b4d3d_demographics.json114 B
        • 0f2c315c
          • speaker_b_convo_0f2c315c_transcript.json110 kB
          • speaker_b_convo_0f2c315c_demographics.json113 B
          • speaker_b_convo_0f2c315c.wav27 MB
          • speaker_a_convo_0f2c315c_transcript.json34 kB
          • speaker_a_convo_0f2c315c_demographics.json113 B
          • speaker_a_convo_0f2c315c.wav27 MB
        • bcb44230
          • speaker_b_convo_bcb44230_transcript.json245 kB
          • speaker_b_convo_bcb44230.wav56 MB
          • speaker_a_convo_bcb44230_demographics.json110 B
          • speaker_a_convo_bcb44230_transcript.json267 kB
          • speaker_a_convo_bcb44230.wav56 MB
          • speaker_b_convo_bcb44230_demographics.json113 B
        • 44d73360
          • speaker_a_convo_44d73360_transcript.json171 kB
          • speaker_a_convo_44d73360.wav27 MB
          • speaker_b_convo_44d73360_demographics.json113 B
          • speaker_b_convo_44d73360_transcript.json152 kB
          • speaker_a_convo_44d73360_demographics.json113 B
          • speaker_b_convo_44d73360.wav27 MB
        • 997d4fe0
          • speaker_b_convo_997d4fe0.wav50 MB
          • speaker_a_convo_997d4fe0_demographics.json113 B
          • speaker_b_convo_997d4fe0_transcript.json227 kB
          • speaker_a_convo_997d4fe0.wav50 MB
          • speaker_a_convo_997d4fe0_transcript.json274 kB
          • speaker_b_convo_997d4fe0_demographics.json114 B
        • eda6925c
          • speaker_a_convo_eda6925c_transcript.json355 kB
          • speaker_a_convo_eda6925c.wav55 MB
          • speaker_b_convo_eda6925c_demographics.json110 B
          • speaker_a_convo_eda6925c_demographics.json111 B
          • speaker_b_convo_eda6925c_transcript.json105 kB
          • speaker_b_convo_eda6925c.wav55 MB
        • 8edc23bf
          • speaker_b_convo_8edc23bf.wav55 MB
          • speaker_a_convo_8edc23bf_demographics.json113 B
          • speaker_a_convo_8edc23bf_transcript.json259 kB
          • speaker_a_convo_8edc23bf.wav55 MB
          • speaker_b_convo_8edc23bf_transcript.json215 kB
          • speaker_b_convo_8edc23bf_demographics.json114 B
        • 5f55950e
          • speaker_a_convo_5f55950e_transcript.json118 kB
          • speaker_a_convo_5f55950e.wav18 MB
          • speaker_b_convo_5f55950e_demographics.json112 B
          • speaker_a_convo_5f55950e_demographics.json112 B
          • speaker_b_convo_5f55950e_transcript.json22 kB
          • speaker_b_convo_5f55950e.wav18 MB
        • c3a7fbe9
          • speaker_a_convo_c3a7fbe9_demographics.json113 B
          • speaker_b_convo_c3a7fbe9_transcript.json203 kB
          • speaker_a_convo_c3a7fbe9.wav38 MB
          • speaker_a_convo_c3a7fbe9_transcript.json143 kB
          • speaker_b_convo_c3a7fbe9_demographics.json113 B
          • speaker_b_convo_c3a7fbe9.wav38 MB
        • 2284fd64
          • speaker_b_convo_2284fd64_demographics.json109 B
          • speaker_a_convo_2284fd64_transcript.json67 kB
          • speaker_a_convo_2284fd64_demographics.json109 B
          • speaker_b_convo_2284fd64.wav17 MB
          • speaker_b_convo_2284fd64_transcript.json168 kB
          • speaker_a_convo_2284fd64.wav18 MB
        • 1939f519
          • speaker_a_convo_1939f519_transcript.json122 kB
          • speaker_a_convo_1939f519.wav38 MB
          • speaker_b_convo_1939f519_demographics.json113 B
          • speaker_b_convo_1939f519_transcript.json208 kB
          • speaker_b_convo_1939f519.wav38 MB
          • speaker_a_convo_1939f519_demographics.json113 B
        • 198f2863
          • speaker_a_convo_198f2863_transcript.json201 kB
          • speaker_a_convo_198f2863_demographics.json113 B
          • speaker_a_convo_198f2863.wav42 MB
          • speaker_b_convo_198f2863_transcript.json368 kB
          • speaker_b_convo_198f2863_demographics.json110 B
          • speaker_b_convo_198f2863.wav42 MB
        • 54ddefa8
          • speaker_a_convo_54ddefa8_demographics.json113 B
          • speaker_b_convo_54ddefa8_transcript.json107 kB
          • speaker_b_convo_54ddefa8.wav27 MB
          • speaker_a_convo_54ddefa8_transcript.json187 kB
          • speaker_b_convo_54ddefa8_demographics.json113 B
          • speaker_a_convo_54ddefa8.wav27 MB
        • ad46e29b
          • speaker_a_convo_ad46e29b.wav43 MB
          • speaker_b_convo_ad46e29b_transcript.json199 kB
          • speaker_b_convo_ad46e29b_demographics.json111 B
          • speaker_a_convo_ad46e29b_demographics.json113 B
          • speaker_a_convo_ad46e29b_transcript.json296 kB
          • speaker_b_convo_ad46e29b.wav43 MB
        • 2f1655ff
          • speaker_a_convo_2f1655ff_demographics.json109 B
          • speaker_b_convo_2f1655ff_transcript.json62 kB
          • speaker_b_convo_2f1655ff.wav17 MB
          • speaker_a_convo_2f1655ff_transcript.json43 kB
          • speaker_a_convo_2f1655ff.wav17 MB
          • speaker_b_convo_2f1655ff_demographics.json112 B
        • 2a07b3a7
          • speaker_a_convo_2a07b3a7_transcript.json84 kB
          • speaker_a_convo_2a07b3a7_demographics.json112 B
          • speaker_b_convo_2a07b3a7.wav30 MB
          • speaker_a_convo_2a07b3a7.wav30 MB
          • speaker_b_convo_2a07b3a7_transcript.json61 kB
          • speaker_b_convo_2a07b3a7_demographics.json112 B
      • half_conversations
        • e1e7765a
          • speaker_a_convo_e1e7765a.wav23 MB
          • speaker_a_convo_e1e7765a_transcript.json139 kB
          • speaker_a_convo_e1e7765a_demographics.json112 B
        • b6ad9f96
          • speaker_a_convo_b6ad9f96.wav34 MB
          • speaker_a_convo_b6ad9f96_transcript.json83 kB
          • speaker_a_convo_b6ad9f96_demographics.json113 B
        • f123a375
          • speaker_a_convo_f123a375_demographics.json109 B
          • speaker_a_convo_f123a375_transcript.json66 kB
          • speaker_a_convo_f123a375.wav18 MB
        • e25bc38d
          • speaker_b_convo_e25bc38d.wav63 MB
          • speaker_b_convo_e25bc38d_demographics.json110 B
          • speaker_b_convo_e25bc38d_transcript.json330 kB
        • caa6301e
          • speaker_a_convo_caa6301e_transcript.json230 kB
          • speaker_a_convo_caa6301e_demographics.json110 B
          • speaker_a_convo_caa6301e.wav58 MB
        • 3ac74ae1
          • speaker_a_convo_3ac74ae1.wav59 MB
          • speaker_a_convo_3ac74ae1_demographics.json110 B
          • speaker_a_convo_3ac74ae1_transcript.json269 kB
    • docs
      • manual_transcripts.json1 MB
      • spjallromur_README.txt7 kB