MSPKA corpus


Multi-SPeaKing-style Articulatory corpus


Description

MSPKA is an Italian corpus of simultaneous recordings of continous speech and trajectories of important speech articulators (i.e. tongue, lips, incisors) tracked by Electromagnetic Articulography in different speaking styles (e.g. read speech, hyperarticulated speech, hypoarticulated speech). If you use this corpus please reference [1].

Authors

Claudia Canevari (claudia.canevari@gmail.com), Leonardo Badino (leonardo.badino@iit.it), Luciano Fadiga (luciano.fadiga@iit.it)

References

[1] Canevari C., Badino L., Fadiga L., "A new Italian dataset of parallel acoustic and articulatory data", In Proc. of InterSpeech, Dresden, Germany, 2015.


Session 1

Session 1 includes more than 500 sentences uttered by three different speakers, one male (cnz) and two females (lls, olm), in citation condition for approximately 2 hours of speech materials.

Voice Version Date Size Down. Counter
cnz 1.0.0 March 2015 245 MB Download 0
lls 1.0.0 March 2015 217 MB Download 0
olm 1.0.0 March 2015 206 MB Download 0

Session 2

Session 2 includes more than 500 sentences uttered by three different speakers, one male (cnz) and two females (lls, olm), over a continuum of ten descending articulation degrees, from hyperarticulated to hypoarticulated forms of speech, for approximately 3 hours of speech materials.

Voice Version Date Size Down. Counter
cnz 1.0.0 -- 226 MB Download 0
lls 1.0.0 -- 220 MB Download 0
olm 1.0.0 -- 211 MB Download 0