Download | - View accepted manuscript: Feature space selection and combination for native language identification (PDF, 513 KiB)
|
---|
Author | Search for: Goutte, Cyril1; Search for: Léger, Serge1; Search for: Carpuat, Marine1 |
---|
Affiliation | - National Research Council of Canada. Information and Communication Technologies
|
---|
Format | Text, Article |
---|
Conference | 8th Workshop on Innovative Use of NLP for Building Educational Applications (BEA8), June 13, 2013, Atlanta, GA |
---|
Abstract | We decribe the submissions made by the National Research Council Canada to the Native Language Identification (NLI) shared task. Our submissions rely on a Support Vector Machine classifier, various feature spaces using a variety of lexical, spelling, and syntactic features, and on a simple model combination strategy relying on a majority vote between classifiers. Somewhat surprisingly, a classifier relying on purely lexical features performed very well and proved difficult to outperform significantly using various combinations of feature spaces. However, the combination of multiple predictors allowed to exploit their different strengths and provided a significant boost in performance. |
---|
Publication date | 2013-08-01 |
---|
In | |
---|
Language | English |
---|
Peer reviewed | Yes |
---|
NPARC number | 21270977 |
---|
Export citation | Export as RIS |
---|
Report a correction | Report a correction (opens in a new tab) |
---|
Record identifier | 68cbd15c-c2f6-45b1-8017-ded569f2e8e5 |
---|
Record created | 2014-02-20 |
---|
Record modified | 2020-06-04 |
---|