Corpora Processing and Computational Scaffolding for a Web-based English Learning Environment
The CANDLE Project
DOI:
https://doi.org/10.1558/cj.v24i1.77-95Keywords:
Bilingual Concordancer, Chinese-English Parallel Corpora, Computational Scaffolding, Natural Language ProcessingAbstract
This paper describes the development of an innovative web-based environment for English language learning with advanced data-driven and statistical approaches. The project uses various corpora, including a Chinese-English parallel corpus (Sinorama) and various natural language processing (NLP) tools to construct effective English learning tasks for college learners with adaptive computational scaffolding. It integrates the expertise of a group of researchers in four areas: (a) advances in NLP technologies and applications, (b) construction of a self-access reading environment, (c) exploration of English language learning through written exercises and translations, and (d) use of bilingual corpora for culture-based English learning. In this paper, the conceptualization of the system and its various reference tools (e.g., a bilingual concordancer) for English learning and pilot testing on various modules (e.g., a reading module, Text Grader, and Collocation Practice) are reported.
References
Braun, S. (2005). From pedagogically relevant corpora to authentic language learning contents. ReCall, 17 (1), 47-64.
Chan, T. W., Hue, C. W., Chou, C. Y., & Tzeng, O. J. L. (2001). Four spaces of network learning models. Computers and Education, 37 (2), 141-161.
Chan, T. P., & Liou, H. C. (2005). Effects of online concordancing instruction on EFL students’ learning of verb-noun collocations. Computer Assisted Language Learning, 18 (3), 231-251.
Ellis, R. (1998). SLA research and language teaching. Oxford: Oxford University Press.
Gibbons, P. (2002). Scaffolding language, scaffolding learning: Teaching second language learners in the mainstream classroom. Portsmouth, NH: Henemann.
Ghadirian, S. (2003). Providing controlled exposure to target vocabulary through the screening and arranging of texts. Language Learning & Technology, 6 (1), 147164. Retrieved April 24, 2006, from http://llt.msu.edu/vol6num1/GHADIRIAN/ default.html
Hunston, S. (2002). Corpora in applied linguistics. Cambridge: Cambridge University Press.
Hwu, F. (2003). Learners’ behaviors in computer-based input activities elicited through tracking technologies. Computer Assisted Language Learning, 16 (1), 5-29.
Johns, T. (1988). Whence and whither classroom concordancing. In T. Bongaerts, P. De Haan, S. Lobbe, & H. Wekker (Eds.), Computer applications in language learning (pp. 9-27). Dordrecht, The Netherlands: Foris.
Kennedy, C., & Miceli, T. (2001). An evaluation of intermediate students’ approaches to corpus investigation. Language Learning & Technology, 5 (3), 77-90. Retrieved April 24, 2006, from http://llt.msu.edu/vol5num3/kennedy
Krishnamurthy, R. (2001). Language corpora: How can teachers and students use these valuable new resources? In Selected Papers from the 10th International Symposium on English Teaching (pp. 59-65). Taipei: Crane.
Laufer, B. (1989). What percentage of lexis is essential for comprehension? In C. Lauren & M. Nordman (Eds.), Special language: From humans thinking to thinking machine (pp. 69-75). Clevedon, England: Multilingual Matters.
Leech, G. (1997). Teaching and language corpora: A convergence. In A. Wichmann, S. Flingelstone, T. McEnery, & G. Knowles (Eds.), Teaching and language corpora (pp. 1-23). London: Longman.
Liou, H. C. & Huang, H. T. (2004, June). Effects of graded texts on EFL college students’ incidental vocabulary learning: Issues of exposure amount and acquisition of productive and receptive vocabulary. Paper presented at CALICO 2004 Symposium, Pittsburgh, PA.
Liu, L. E. (2002). A corpus-based lexical semantic investigation of verb-noun miscollocations in Taiwan learners’ English. Unpublished master’s thesis, Tamkang University, Taipei.
Nation, P. (2001). Learning vocabulary in another language. New York: Cambridge University Press.
Nattinger, J. R., & DeCarrico, J. D. (1992). Lexical phrase and language teaching. Oxford: Oxford University Press.
Savignon, S. & Sysoyev, P. (2002). Sociocultural strategies for a dialog of culture. The Modern Language Journal, 86 (4), 508-524.
Nesselhauf, N. (2003). The use of collocations by advanced learners of English and some implications for teaching. Applied Linguistics, 24 (2), 223-242.
Stevens, V. (1991). Classroom concordancing: Vocabulary materials derived from relevant, authentic text. English for Specific Purposes, 10 (1), 10-15.
Teubert, W. (1996). Why corpus linguistics? International Journal of Corpus Linguistics, 1 (1), iii-x.
Wang, L. (2001). Exploring parallel concordancing in English and Chinese. Language Learning & Technology, 5 (3), 174-184. Retrieved April 24, 2006, from http://llt. msu.edu/vol5num3/wang/default.html
Wible, D., Kuo, C.-H., Chien, F.-Y., Liu, A., & Tsao, N.-L. (2001). A web-based EFL writing environment: Intelligent information for learners, teachers, and researchers. Computers and Education, 37 (3-4), 297-315.
Wichmann, A., Fligelstone, S., McEnery, T., & Knowles, G. (Eds.). (1997) Teaching and language corpora. London: Longman.
Wood, D., Bruner, J., & Ross, G. (1976). The role of tutoring in problem solving. Journal of Child Psychology and Psychiatry, 17 (2), 89-100.
Wu, J. C., Yeh, K. C., Chuang, T. C., Shei, W. C., & Chang, J. S. (2003, July). TOTALrecall: A bilingual concordance for computer assisted translation and language learning. In Proceedings of the 41st Association of Computational Linguistics Conference, Sappora, Japan.