Idea Transcript
Ho me
Serv i ces
M et h o d o l o g y
Ab o u t u s
C u st o mers / In d u st ri es
Blog
C o n t act
The dark side of Google Translate
Search…
THE DARK SIDE OF GOOGLE TRANSLATE Posted by Stephen Whiteley
On 02/05/2011
dotcom Willy Wonka, Google Translate, privacy
CATEGORIES
The inner workings of Google Translate are shrouded in a characteristic mix of Willy Wonka bonhomie and mildly Orwellian obfuscation. Understandably, Google are (is?) keen to keep the details of their revolutionary approach to machine translation under wraps, and share only enough information to give a vague idea of what they are up to. This video is representative of their approach: fun, cute and colourful, it is only after it has finished that you realise you haven’t actually learned that much. The key concept here is, of course, ‘statistical machine translation’.
Curiosities diseño etimologia maquetación News Nuevas tecnologías
The most popular machine translation tool is SYSTRAN, which was developed during the cold war, and is still used by Yahoo’s Babel Fish, amongst others. SYSTRAN follows a rule-based logic, which is theoretically similar to the way that a human learns a language: the program is filled with basic linguistic rules and vocabularies, which it then uses to translate texts.
traducción traducción española
SUBSCRIBE TO OUR NEWSLETTER
When they moved from SYSTRAN in 2007, Google research scientist Franz Ochs explained: “Most state-of-the-art commercial machine translation systems in use today have been developed using a rules-based approach and require a lot of work by linguists to define vocabularies and grammars. Several research systems, including ours, take a different approach: we feed the computer with billions of words of text, both monolingual text in the target language, and aligned text consisting of examples of human translations between the languages. We then apply statistical learning techniques to build a translation model.” Or, as the video puts it: “Once the computer finds a pattern, it can use this pattern to translate similar texts in the future. When you repeat this process billions of times you end up with billions of patterns and one very smart computer program.”
Join our mailing list to receive updates from our team! Email Address Industry SUBSCRIBE NOW!
As with N-Grams, it is the staggering volume of data at Google’s disposal which grabs the attention. A significant step forward for Google Translate was the integration of reams of UN documentation, a ready-made six language corpus, but, unlike with N-Grams, the source of the rest of the data it uses remains worryingly unclear. Here are two extracts from Google’s Terms of Service: “ …By submitting, posting or displaying the content you give Google a perpetual, irrevocable, worldwide, royalty-free, and non-exclusive license to reproduce, adapt, modify, translate, publish, publicly perform, publicly display and distribute any Content which you submit, post or display on or through, the Services.” “… You agree that this license includes a right for Google to make such Content available to other companies, organizations or individuals with whom Google has relationships for the provision of syndicated services, and to use such Content in connection with the provision of those services.”
TAG CLOUD Arabic
Business translation
Chinese
corporate translation
cost saving
It is hard to establish what the implications of all this are, not least because Google won’t divulge what use it makes of “a perpetual, irrevocable” right to do whatever it wants with anything you do online. What is clear, however, is that Google is much more than a sort of dotcom Willy Wonka.
dialect
Share this:
Formatting
desktop publishing
English
False friends
etymology
Fire protection French
internal review
German
language
language service provider le mot juste
Linguistics
literary translation
0 COMMENTS
Latin
localisation
literature
localization
machine translation
Marketing
Numbers
Portuguese
poetry
process
Professional translators
Quality
Spanish
Technical translations traducciones traductor
technology
traducción
traducir
Translation
Translation blog Translation companies translation memory Translation software
words
RECENT POSTS Top phrases for party-goers Main differences between American and British English How important is it to be good at translating? How Portuguese language has been influenced by English Romantic expressions in English
@2015 QuickSilver Translate - by redywebs
SPAIN
USA
Girona 17, Esc. A, 1º 2ª Sant Cugat del Vallés 08172 -Barcelona
2655 Le Jeune Road, Suite 810 Coral Gables Miami – Florida 33134
(+34) 93 590 7140
(+1) 786 3328 515
Home
Services
About us
Blog
Legal Disclaimer
Contact