You are here
The overarching question underlying most of Hertz's research is what role universal principles of human speech perception play in the organization of speech/sound patterns in language. She is currently developing a theory of how listeners extract phonological structure and other information, such as speaker identity, from the continuous speech signal. Hertz is testing various hypotheses related to this theory through perceptual experiments involving utterances that contain mixes of natural and synthetic speech segments. More information about her research as well as its applications to speech synthesis and other areas of speech processing can be found at www.novaspeech.com.
- Speech synthesis
- phonetics-phonology interface
- Hertz, S. R. and Goldhor, R (2004). When can segments serve as surrogates? From Sound to Sense: 50+ Years of Discovery in Speech Communications, MIT, Cambridge.
- Hertz, S. R., Spencer, I.C., Church, T.F., and Goldhor, R. (2004). Perceptual consequences of nasal surrogates in English: implications for speech synthesis, Proc. 147th Acoust. Soc. Amer.
- Hertz, S. R. (2002) Integration of rule-based formant synthesis and waveform concatenation: a hybrid approach to text-to-speech synthesis, Proceedings IEEE 2002 Workshop On Speech Synthesis.
- Hertz, S. R., Younes, R. J., and Zinovieva, N. (1999) Language-universal and language-specific components in the multi-language ETI-Eloquence text-to-speech system. Proc. 14th ICPhS 2283-2286.
- Clements, G.N., Hertz, S.R. (1996) An integrated approach to phonology and phonetics, in J. Durand and B. Laks (eds.), Current Trends in Phonology: Models and Methods, CNRS, Paris X and Univ. of Salford Pub.
- Hertz, S. R. and Huffman, M. K. (1992) A nucleus-based timing model applied to multi-dialect speech synthesis by rule, Proc. International Conference on Spoken Language Processing 2, 1171-1174.