VILTS

A Tale of Two Technologies

Authors

  • Marikka Elizabeth Rypa
  • Patti Price

DOI:

https://doi.org/10.1558/cj.v16i3.385-404

Keywords:

Voice Interactive Training System (VILTS), Natural Language Processing, Speech Recognition, Evaluation

Abstract

The Voice Interactive Training System (VILTS) is a language-training prototype developed to help improve comprehension and speaking skills. The system incorporates two related technologies: speech recognition and pronunciation scoring. Speech recognition allows students to navigate through units by using oral communication skills. Pronunciation scoring, validated through correlation with expert raters, provides assessment of speaking skills. We discuss the motivation for the program, the interdisciplinary efforts involved, and the resulting system architecture. We also describe challenges and trade-offs in designing activities using unscripted material and in integrating new speech technology. Finally, we discuss system evaluation and opportunities for future directions.

References

Bradlow, A., Pisoni, D., Akahane-Yamada, R., & Tohkura, Y. (1995). Training Japanese listeners to identify English /r/ and /l/ IV: Some effects of perceptual learning on speech production. Progress Report No. 20, Research on Spoken Language Processing. Bloomington, IN: Indiana University.

Catford, J. C., & Pisoni, D. (1970). Auditory vs. articulatory training in exotic sounds. Modern Language Journal, 54 (7), 477-481.

Flege, J., & Hammond, R. (1982). Mimicry of non-distinctive phonetic differences between language varieties. Studies in Second Language Acquisition, 5 (1), 1-17.

Hammond, R. (1995). Foreign accent and phonetic interference. In F. R. Eckman (Ed.), Second language acquisition theory and pedagogy. Mahwah, NJ: Lawrence Erlbaum.

Hammond, R. , & Flege, J. (1988). Attitudes, experience, and the mimicry of sounds: Implications for second language acquisition. Paper presented at the Seventh International Symposium on International Perspectives on Language, Literature, and Culture, George Mason University, Fairfax, VA.

Krashen, S. (1981). Second language acquisition and second language learning. Oxford: Pergamon.

Krashen, S. (1985). The input hypothesis. London: Longman.

Lee, J., & VanPatten, B. (1995). Making communicative language teaching happen. New York: McGraw-Hill.

Neumeyer, L., Franco, H., Abrash, V., Julia, L., Ronen, O., Bratt, H., Bing, J., Digalakis, V., & Rypa, M. (1998). WebGrader(TM): A multilingual pronunciation practice tool. In Proceedings of the Workshop on Speech Technology in Language Learning, Stockholm, Sweden.

Oxford, R. (1995). Linking theories of learning with intelligent computer-assisted language learning. In V. M. Holland, J. Kaplan, & M. Sams (Eds.), Intelligent language tutors: Theory shaping technology. Mahwah, NJ: Lawrence Erlbaum.

Postovsky, V. A. (1977). Why not start speaking later? In M. K. Burt, H. C. Dulay, & M. Finocchari (Eds.), English as a second language. New York: Regents.

Rochet, B. (1993). The role of auditory training in teaching nonnative speech contrasts. Paper presented at the Third Conference on Second Language Acquisition and Foreign Language Learning, Purdue University, West Lafayette, IN.

Terrell, T. (1989). Teaching Spanish pronunciation in a communicative approach. In P. Bjarkman & R. Hammond (Eds.), American Spanish pronunciation—Theoretical and applied perspectives Washington, D. C.: Georgetown University Press.

Downloads

Published

2013-01-14

Issue

Section

Articles

How to Cite

Rypa, M. E., & Price, P. (2013). VILTS: A Tale of Two Technologies. CALICO Journal, 16(3), 385-404. https://doi.org/10.1558/cj.v16i3.385-404

Most read articles by the same author(s)

1 2 3 4 5 6 7 8 9 10 > >>