Google's Computing Power Refines Translation 142
gollum123 sends an excerpt from the NY Times on how Google has taken a lead in language translation, in one of the company's few unqualified successes as it attempts to broaden its offerings beyond search. "...Google's quick rise to the top echelons of the translation business is a reminder of what can happen when Google unleashes its brute-force computing power on complex problems. The network of data centers that it built for Web searches may now be, when lashed together, the world's largest computer. Google is using that machine to push the limits on translation technology. Last month, for example, it said it was working to combine its translation tool with image analysis, allowing a person to, say, take a cellphone photo of a menu in German and get an instant English translation. ...in the mid-1990s, researchers began favoring a so-called statistical approach. They found that if they fed the computer thousands or millions of passages and their human-generated translations, it could learn to make accurate guesses about how to translate new texts. It turns out that this technique, which requires huge amounts of data and lots of computing horsepower, is right up Google's alley. ...Google's service is good enough to convey the essence of a news article, and it has become a quick source for translations for millions of people."
Converting that article from English to Chinese to (Score:5, Interesting)
English, with Google Translate:
--- ... In the mid-90s, researchers began to favor a so-called statistical methods. They found that if they ate the computer or hundreds of thousands of millions of paragraphs and the translation of humans, it can learn how to make an accurate translation of the new text of speculation. Facts have proved that this technology requires large amounts of data and a lot of computing power, is the right of Google's alley. ... Google's service is sufficient to convey the essence of news articles, it has become a quick translation of millions of people everywhere.
Google's rapid rise to the translation of business executives is a result of what Google released a complex problem, and its powerful computing power for reminding me. The data center, and its Web search, it may be now, when attacked with the network, is the world's largest computer. Google's machine translation technology is being used to push forward the limit. Last month, for example, it indicated that it was a combination of image analysis of the translation tools to enable a person, says that while walking in the German mobile phone menu, photos and immediately the English translation.
---
Okay, perhaps not spectacular... but compared to Babelfish:
--- ...Is anything the prompt possible to occur to the translation business's crown trapezoid's Google quick rise, when Google unties it when the complex question violence computing power. Perhaps the data central network it for the net search establishment now is, when attacks together, world large-scale computer. Google uses that machine to push in the translation technology limit. The previous month, for example, it said that it operates and the image analysis unifies its translation tool, allows the human to adopt a menu the handset picture and obtains one with German immediately English translation. ... in the mid-1990s, researcher started to favor the so-called statistical method. They have discovered that if they have fed the translation which the computer thousands or the tens of thousands of paragraphs and their person cause, its possibly academic society does about what kind of guesses translator accurately the new text. _ it this technology, requests the huge large amount data finally and completely the calculated horsepower, is correct Google the alley. ... The Google service is enough good expresses the news article the essence, and it has become translation quick origin tens of thousands of people
---
Re:Converting that article from English to Chinese (Score:3, Interesting)
Re:Converting that article from English to Chinese (Score:3, Interesting)
Plus, round-trip translation at least doubles the error compared to an actual application which would involve one-way translation (and probably more, since the "return-trip" translation is starting with a poor quality input). A much more fair test would be comparing a one-way translation, man vs. machine.
For western languages... (Score:3, Interesting)
Just not now. It still needs a lot of work.
I'm in the translation business, and the general trend in internet communications such as websites, etc. at least, is to simplify the language being used.
For specialized text, we're a long way off yet.
Re:Similar languages (Score:5, Interesting)
This seems like the ideal opportunity to mention Translation Party [translationparty.com]. You give it English, and it translates it to and back from Japanese until the input and output English are the same.
It can be a ton of fun.
Re:Converting that article from English to Chinese (Score:5, Interesting)
Der Spiegel offers version of some of its stories in English. They aren't direct translations, but quite similar.
Here's part of a story published in english [spiegel.de]:
And the same story, published in German, [spiegel.de] translated to English by google:
And babelfish translation of the same story:
I do think the google version is significantly better.
I noticed that they were using my web site (Score:2, Interesting)
I have a web site where every page is available in English and German. When I tested Google's translation with it, I noticed that Google reliably translated one sentence in the opposite direction, i.e. from English to German when I had asked for a German to English translation: On every page in German, there is one sentence in English which leads to the corresponding page in English. Google's translator appeared to pick the translation right from that page, which of course has that sentence in German (leading to the German version of that page). Google doesn't do this anymore, but when I saw it, I realized that Google's translator did not at all "understand" what it was translating.
Their search parsing tech probably helps too (Score:3, Interesting)
An exerpt from the article:
"People change words in their queries. So someone would say, 'pictures of dogs,' and then they'd say, 'pictures of puppies.' So that told us that maybe 'dogs' and 'puppies' were interchangeable. We also learned that when you boil water, it's hot water. We were relearning semantics from humans, and that was a great advance." But there were obstacles. Google's synonym system understood that a dog was similar to a puppy and that boiling water was hot. But it also concluded that a hot dog was the same as a boiling puppy.
Re:Similar languages (Score:3, Interesting)
Are you sure that the error messages are even meaningful in Korean?
Asian languages and vastly different grammar (Score:5, Interesting)
Several others have noted this as well - for Asian languages, Google has a lot of work to do. The Chinese translation near the top is impressive, but while Chinese and Japanese translations are probably pretty good on Google, other Asian languages suffer greatly.
I've been translating a lot of Thai lately, and initially I thought Google was great - the interface is really slick, and it seemed to give a decent result. Passing the translation back through often gave me really weird stuff, but I was expecting that. So it was great, until I tried using it to communicate with someone in Thai - even for really, really basic stuff, often they had absolutely no idea. It was just way off.
While you can feed western languages through it and get great, usable results, for Asian languages besides Chinese and Japanese it's next to useless. I'm guessing there isn't much of an incentive for Google to focus on other Asian languages - for example, in Android 2.1 on the Nexus One there is no way to even install fonts for less-popular Asian scripts like Thai, much less inputting text in those scripts - despite this capability being available on certain other Android phones (you can install it on the Nexus One if you root it, of course).
Based on what their technique for learning translation is, though, hopefully this will improve over time. It's an impressive system as it is, but very much limited to "popular" languages and those very similar to English.
Re:Converting that article from English to Chinese (Score:2, Interesting)
In Philip K. Dick's obscure 1969 novel Galactic Pot-Healer [wikipedia.org], the characters play a game based on this very idea. They take common sayings and figures of speech, and feed them through several language-translation computers. The results are then sent to a friend, who attempts to figure out what the original phrase was.
Sometimes when you're reading PKD you get the uncomfortable feeling he really could see into the future.
How different is this from AI research? (Score:1, Interesting)
Obligatory Chinese Room [wikipedia.org] mention.
If a translation engine grows strong enough to adequately translate the phrases "give us our daily bread," "sharks are predatory carnivores," and "the loan shark wants his bread," that implies a significant ability to contextually infer meaning. Could someone opine on (or point to a work exploring) how similar the task of building an accurate translator is to the task of building a competent, world-aware (if perhaps not absolutely Turing-quality) AI?
Re:Similar languages (Score:3, Interesting)
That is fun. Your sig breaks it [translationparty.com].