Examining the Quality of Machine Translation
The first homework assignment does not involve any programming. Instead, you will take a closer look at the quality of today’s machine translation systems.
Translate a Web Page with Google Translate
- Pick a foreign language (preferable one that you have some understanding of - or an easy one like French or Spanish)
- Find a news site that publishes news stories in that language. You can also look at Wikipedia articles, if you prefer that.
- Pick a web page to translate and enter it with Google Translate
Analyze the Translations
Write a report about the quality of the machine translation.
Go over at least 20 sentences, manually correct each sentence, and report for each sentence:
- the source sentence
- the machine translation
- a correction of the machine translation
- an assessment of the error in the machine translation
You may do step 4 in any way you want. For instance, you could classify errors as “reordering errors”, “word sense error for a noun”, or any other type of error you can think of.
- Erst drei Tage ist der neue Ministerpräsident Griechenlands im Amt.
- Only three days is the new Prime Minister of Greece in office.
- The new Prime Minister of Greece has been in office for only three days.
- (1) Verb tense is wrong: is instead of has been. (2) Preposition for was missing in front of the time phrase only three days. (3) While the noun phrases and preprositional phrases are correct, the overall sentence structure on the clause level is scambled.
Conclude your report with a summary of your impression of the major quality problems in the machine translation system that you analysed.
What to Hand in
Write a written report titled report.tex that compiles without errors using XeLaTeX. Use the article class, and ensure that your report has a title and lists your name as the author.
Commit and push that file to your HW1 repo on Github prior to 11:59 PM on Monday 07 September 2020.