with 90 per cent confidence. Maybe one of the original lines said 'I would like an apple and a pear' and
another said 'I would like an apple and a banana'. Since the pear and banana are only mentioned once
each in the translations, the computer knows to leave them out. The computer doesn't have to
understand anything about language to do this. Given enough sample text, the computer could translate
Shakespeare into Dothraki, the language invented by the makers of Game of Thrones.
Google Translate was developed under the leadership of computer scientist Franz Och and originally
used documents created by the United Nations. The UN has a need to create a lot of written material
accurately in a number of languages, and all of this material is freely available, which made it a good
source to use. (We should point out that Google did not invent statistical translation.)
2.1 Try the exercises suggested in the case - how did Google Translate do?
[5]
2.2 Could this technology be used by a global firm or is the need for absolute accuracy so large that a
human translator would always be needed?
[5]
2.3 What are some of the problems with using Google Translate to translate a web page?
[5]
2.4. What would the computer need in order to translate Shakespeare into Dothraki? [5]/nQuestion Two
Translating Shakespeare into Dothraki
[20 Marks]
Many people using Google Translate for the first time are amazed by its abilities. If you haven't tried it,
visit your local site (it's https://translate.google.co.uk in the UK), enter some text, choose a language and
click translate. If you actually speak the language you translated into, how did it do? Does the translation
make sense and does it say what you meant it to say? If you don't speak that language, copy the
translated text and translate it back into the first language - it probably won't be exactly the same, but
does it still mean what you said in the first place? Try translating the text through several other languages
- English to German to French to Chinese and then back to English - what does it say now? This game
is actually popular fodder for YouTube videos - YouTubers translate the lyrics of a popular song (usually
a song in English) into another language, then back into English, and record themselves singing exactly
what they ended up with. Clearly, however, this technology has more serious applications.
So how does it work? It relies on the one thing that Google is enormously good at - processing lots and
lots of data, very very quickly. Sorry to disappoint you if you thought the computer could actually
understand what you are saying, but in fact it's just doing some statistics. There are many books that
have been translated by a human from one language into another, say Victor Hugo's Notre Dame de
Paris from French to English. To translate a French phrase, all the computer has to do is find the same
phrase in one of these books, then find the corresponding page and line in the English translation and
return that to the user. But how does the computer know that the page and line are the exact bits that
translate the exact phrase the user wrote? That's where the statistics and data processing come in. The
computer doesn't just search for the phrase in one book, it looks in say 100 books (the actual figure
Google uses is difficult to track down) and finds 100 possible translations. It then chooses the phrase
that appears the most in the sample of 100.
For instance, if the user writes 'Je voudrais une pomme pour mon petit déjeuner avec mamie', this line
might not appear in any book, but the phrase 'Je voudrais une pomme' probably does. If the computer
finds 100 books where this line appears, and in the translated book at around about the right point in the
Fig: 1
Fig: 2