Multi-SPeaKing-style Articulatory corpus

Description

MSPKA is an Italian corpus of simultaneous recordings of continous speech and trajectories of important speech articulators (i.e. tongue, lips, incisors) tracked by Electromagnetic Articulography in different speaking styles (e.g. read speech, hyperarticulated speech, hypoarticulated speech). If you use this corpus please reference [1].

Authors

Claudia Canevari (claudia.canevari@gmail.com), Leonardo Badino (leonardo.badino@iit.it), Luciano Fadiga (luciano.fadiga@iit.it)

References

[1] Canevari C., Badino L., Fadiga L., "A new Italian dataset of parallel acoustic and articulatory data", In Proc. of InterSpeech, Dresden, Germany, 2015.

Session 1

Session 1 includes more than 500 sentences uttered by three different speakers, one male (cnz) and two females (lls, olm), in citation condition for approximately 2 hours of speech materials.

Voice	Version	Date	Size
cnz	1.0.0	March 2015	245 MB	Download
lls	1.0.0	March 2015	217 MB	Download
olm	1.0.0	March 2015	206 MB	Download

Session 2

Session 2 includes more than 500 sentences uttered by three different speakers, one male (cnz) and two females (lls, olm), over a continuum of ten descending articulation degrees, from hyperarticulated to hypoarticulated forms of speech, for approximately 3 hours of speech materials.

Voice	Version	Date	Size
cnz	1.0.0	--	226 MB	Download
lls	1.0.0	--	220 MB	Download
olm	1.0.0	--	211 MB	Download

MSPKA corpus