Catch up on stories from the past week (and beyond) at the Slashdot story archive

 



Forgot your password?
typodupeerror
×
Software Android Google IOS Technology

Google Brings Offline Neural Machine Translations For 59 Languages To Its Translate App (techcrunch.com) 46

Google is rolling out offline Neural Machine Translation (NMT) support for 59 languages in the Translate apps. Some of the supported languages include Arabic, Chinese, English, German, Japanese, Spanish, French, and Korean (TechCrunch has a full list of the languages in their report). From the report: In the past, running these deep learning models on a mobile device wasn't really an option since mobile phones didn't have the right hardware to efficiently run them. Now, thanks to both advances in hardware and software, that's less of an issue and Google, Microsoft and others have also found ways to compress these models to a manageable size. In Google's case, that's about 30 to 40 megabytes per language. Users will see the updated offline translations within the next few weeks.
This discussion has been archived. No new comments can be posted.

Google Brings Offline Neural Machine Translations For 59 Languages To Its Translate App

Comments Filter:
  • Will google allow to save this NMT translations (@40Meg a pop) on the SD card?

    Currently, they DO NOT allo to save these translations to the SD card in android, while Microsoft Translator does.

    Is the only reason I use the Microsoft app and not both (for the languages I care about both are about the same, so I'd be delighted to have both).

  • by epine ( 68316 ) on Tuesday June 12, 2018 @10:14PM (#56775032)

    Is English considered to be the pivot language, or do all of these models product the same intermediate representation?

    Rather useless article, with no shred of a deep understanding, whatsoever.

    I'm guessing you run the input model from language to IR, and the output model from IR back to language, so you need to have at least two models to use this app. (I suppose you could translate from English to IR and back to English again, for perverse joy.)

    Only I haven't read anything about training multiple machine translation models with a shared IR. That strikes me as technically difficult, and I would have thought I'd have seen some loud crowing out there, had it been achieved (it's now been a couple of months since I gave the Internet a good shake on machine learning, and things move fast).

    • Re:pivot language? (Score:4, Informative)

      by Lenbok ( 22992 ) on Wednesday June 13, 2018 @01:12AM (#56775506)

      I'm not sure to what extent it relates to the specific offline translation modules in the translate app, but a while back the Google Research blog had a post on multi-lingual machine translation models (and that let them do translation between two languages for which they didn't have direct translation training corpus). So at least in that case, there is just a single translation model rather than separate input and output models that go to and from an IR.

      https://ai.googleblog.com/2016... [googleblog.com]

    • Re:pivot language? (Score:5, Interesting)

      by AmiMoJo ( 196126 ) on Wednesday June 13, 2018 @03:59AM (#56775818) Homepage Journal

      I don't have that detail but I can tell you my experience as a user of machine translation for 15+ years.

      Originally it only really worked on formal documents, and even then only produced something you could barely understand. The biggest issue seemed to be that it didn't understand context at all.

      Google made some early improvements in making the translated text sound more natural. They also managed to fix a lot of common phrases that didn't quite fit the standard grammar model and thus didn't used to get translated properly. Apparently they did that by using the web as a resource for natural language and by allowing users to submit corrections.

      Then AI started to be used. Baidu were the first I think and their Chinese/English translation was a huge improvement over everything else. It seemed to work slightly better going from English to Chinese though, and when Google released their AI updates not long after Chinese to English became nearly perfect.

      It's actually incredible how good it is now. Often the resulting translation is not only accurate and seemingly context aware, it sounds like something a person might actually say. You don't have to think about what you are writing either. Before you had to be careful to phrase things so that the software could understand it, but not any more.

      There are still some issues, like the way Japanese newspaper headlines often get translated as if it was a person speaking about their own experience (e.g. some houses were flooded, but the translation is "my house was flooded" because the software assumes that context), but for conversations between two people it's like Star Trek or something.

      • by Anonymous Coward

        but for conversations between two people it's like Star Trek or something.

        I think you must be using a different Google Translate than I do.

        I do ja->en quite often, usually on informal (but not in any way slangy) text and dialogue and I honestly cannot think of the end result as anything more than source of amusement. That's for English; it's much worse to/from my native language...

        I'll have an example now (romanised, because slashdot). Just a random sentence I had lying around.

        Source text:

        ore ha mitame kara tekkiri aoyanagi no hou ga tosiue da to omotte ita kara, siturei wo syouti de tazuneta.

        Google:

        I thought Aoyagi was the older from the appearance, so I asked for rude and asked.

        Manual translation:

        I was certain that Aoyanagi was the older one of the two, so though I knew that asking would be rude, I decided to enquire:

        Luckily I am reasonably proficient in Japanese so I do not have

        • Re:pivot language? (Score:5, Informative)

          by AmiMoJo ( 196126 ) on Wednesday June 13, 2018 @08:04AM (#56776306) Homepage Journal

          Try using Google Translate and Bing Translate on a random story from srad.jp. Srad used to be Slashdot Japan before the name change, and the story summaries are written in an informal tone similar to Slashdot ones (but better edited!)

          Google:


          Ministry of Economy, Trade and Industry, Internet mail order order due to erroneous judgment of smart speaker will not be contracted

          First of all, about electronic commerce using smart speakers, smart speakers have the ability to order orders to net mail orderers by voice. However, when an order occurred due to misunderstanding or misunderstanding, guidelines on how to handle that order were not shown. In this revised bill, it is clearly stated that "contract through AI speaker has not been established" for misrecognized orders, and it is said that businesses must properly deal with these problems. Also, even if the ordering party makes a mistake, if the system is such that confirmation is not made for the order, the ordering party may be able to argue the invalidity of the contract.

          You can see some trivial mistakes, like how two different words in Japanese are translated into the same English word "misunderstanding", but the meaning is clear and things like the ministry name are correct and the sentences are actual English.

          Bing:

          Due to misjudgment of transdermal production Ministry said, smart speaker e-store purchase contract would suppose

          There are features for e-commerce using the first source to speaker smart speaker audio to Internet mail order companies order allows. But the positives and say mistakes in order occurs, the order what to do of the guidelines was not shown. We're in this amendment, and describing "the agreement through the AI speakers has not been established" in probable order operators must respond properly on these issues. In addition, says is possible if the system check do not for the order if the buyer did mistakes, the officials can claim contract invalid.

          It's like something out of the 90s era Bablefish. Not only is the interpretation of the original Japanese poor, but the resulting English sentences are broken too.

      • Here the Google translator never, ever gets Brazilian Portuguese (or even Portuguese) right. he is unable to understand the portuguese verbal agreement, the correct order of verbs, and sometimes simply invents expressions that have nothing to do with the original text, to be left alone in the most obvious problems. As a Brazilian Portuguese speaker I have to first "translate" what I mean in the most basic and simple possible way or the translator will completely fail to execute the translation.
        • by AmiMoJo ( 196126 )

          I guess it's going to be different for every language, depending on how much time the have put into it, if they have staff who speak that language on the team etc. It seems to work okay for Spanish and French, but I don't know how different Portuguese is...

        • Here the Google translator never, ever gets Brazilian Portuguese (or even Portuguese) right. he is unable to understand the portuguese verbal agreement, the correct order of verbs, and sometimes simply invents expressions that have nothing to do with the original text, to be left alone in the most obvious problems. As a Brazilian Portuguese speaker I have to first "translate" what I mean in the most basic and simple possible way or the translator will completely fail to execute the translation.

          What do you expect? Look at how many words have multiple meanings! Is [i]canjica[/i] sweetened cream corn with cinnamon or a dessert made of hominy, coconut milk, shredded coconut, and cloves? Is [i]bomba[/i] bomb, pump, or a pastry in this context? How should the English [i]to be[/i] be translated in this context (ser, estar, ficar, passar)? Is [i]manga[/i] a sleave, mango, or Japanese art? Is yucca called [i]aipim[/i], [i]mandioca[/i], or [i]cassava[/i]? Are you talking about a computer mouse ([i]mouse[/i

      • by jebrick ( 164096 )

        I just have experience with the MS translation APIs but I am guessing that Google will be similar. With MS, you can "train" their cognitive hub in a language by putting paired documents into it. They need to be the same document that has already been translated. The Hub learns and is able to (eventually) pick up tone. It is useful in industries that can have more specialized language usages.

        Downside is that a company must have one "Hub" for each language pair. Takes a lot of training.

    • Is English considered to be the pivot language, or do all of these models product the same intermediate representation?

      Pragmatically speaking it's a good choice, since from the users' POV to & from English would be priorities anyway.

      Technically I'd say it's appalling. Irregular, words with multiple meanings, phrasal bloody verbs and did I mention it's irregular?

  • by phantomfive ( 622387 ) on Tuesday June 12, 2018 @11:01PM (#56775190) Journal
    I'm not sure the new model is actually better than the old model. In recent months, I've seen it make bizarre mistakes, like translate "man" as "woman" in contexts where there was no room for mistake. Also it translated 10,000 as a million. Something is wrong with it.
    • Google will do that 95% of the time. Even if it was 100% wrong consistently you would be able to work around it, but it's random. And really quite bad. If you didn't already know what number to expect you wouldn't even know it was wrong.

      Chinese have a concept of 10,000 being a standard division for counting [wikipedia.org], so things will be measured in 10k's but Google changes them to either just thousands or millions instead depending on how it feels that particular time.

      Not quite the same as Windy just making up numbers [slashdot.org]

    • Man as woman?

      Is it possible that it is just trying to be PC and substitute woman for man when the context isn't necessarily important?

      Like the phrase "Every man for himself" would occasionally translate to "Every woman for herself" - you know, for the sake of inclusiveness.

      Does it ever screw up the other way around?

  • Is this true?
  • The main reason I like these 'edge computing' developments is that they give access to advanced functionality without constantly reporting to the cloud what you are doing.

    Then again, this being Google, I suspect this opportunity has not been taken..
  • Uh? I had this enabled several months ago. Is this one of those features that gets rolled progressively, and I was lucky?

    Or may be it's because I'm not in the US and they launched it at countries with non-English languages before?

  • But the 1 remaining question on everyone's mind: have the fixed the "beep beep lettuce" translation?

"Protozoa are small, and bacteria are small, but viruses are smaller than the both put together."

Working...