Using Statistical Techniques and Web Search to Correct ESL Errors

doi:10.1558/cj.v26i3.491-511

CALICO Journal, Vol 26, No 3 (2009)

Using Statistical Techniques and Web Search to Correct ESL Errors

Michael Gamon, Claudia Leacock, Chris Brockett, William B. Dolan, Jianfeng Gao, Dmitriy Belenko, Alexandre Klementiev

Issued Date: 7 Aug 2014

Abstract

In this paper we present a system for automatic correction of errors made by learners of English. The system has two novel aspects. First, machine-learned classifiers trained on large amounts of native data and a very large language model are combined to optimize the precision of suggested corrections. Second, the user can access real-life web examples of both their original formulation and the suggested correction. We discuss technical details of the system, including the choice of classifier, feature sets, and language model. We also present results from an evaluation of the system on a set of corpora. We perform an automatic evaluation on native English data and a detailed manual analysis of performance on three corpora of nonnative writing: the Chinese Learners' of English Corpus (CLEC) and two corpora of web and email writing.

Download Media

PDF Subscribers Only

DOI: 10.1558/cj.v26i3.491-511

Refbacks

There are currently no refbacks.

Equinox Publishing Ltd - 415 The Workstation 15 Paternoster Row, Sheffield, S1 2BX United Kingdom
Telephone: +44 (0)114 221-0285 - Email: [email protected]