Download | - View accepted manuscript: Statistical phrase-based post-editing (PDF, 271 KiB)
|
---|
Author | Search for: Simard, Michel1; Search for: Goutte, Cyril1; Search for: Isabelle, Pierre1 |
---|
Affiliation | - National Research Council of Canada. NRC Institute for Information Technology
|
---|
Format | Text, Article |
---|
Conference | Human Language Technologies: The Annual Conference of the North American Chapter of the Association for Computational Linguistics (NAACL-HLT 2007), April 22-27, 2007, Rochester, New York, USA |
---|
Abstract | We propose to use a statistical phrase-based machine translation system in a post-editing task: the system takes as input raw machine translation output (from a commercial rule-based MT system), and produces post-edited target-language text. We report on experiments that were performed on data collected in precisely such a setting: pairs of raw MT output and their manually post-edited versions. In our evaluation, the output of our automatic post-editing (APE) system is not only better quality than the rule-based MT (both in terms of the BLEU and TER metrics), it is also better than the output of a state-of-the-art phrase-based MT System used in standalone translation mode. These results indicate that automatic post-editing constitutes a simple and efficient way of combining rule-based and statistical MT technologies. |
---|
Publication date | 2007 |
---|
In | |
---|
Language | English |
---|
NRC number | NRCC 49288 |
---|
NPARC number | 5764892 |
---|
Export citation | Export as RIS |
---|
Report a correction | Report a correction (opens in a new tab) |
---|
Record identifier | aaa8c0b3-9d7a-4db3-9b1d-c3deab67d09f |
---|
Record created | 2009-03-29 |
---|
Record modified | 2020-08-12 |
---|