Google Open-Sources Live Transcribe's Speech Engine (venturebeat.com) 14
Friday Google open-sourced "the speech engine that powers its Android speech recognition transcription tool Live Transcribe," reports Venture Beat:
The company hopes doing so will let any developer deliver captions for long-form conversations. The source code is available now on GitHub.
Google released Live Transcribe in February. The tool uses machine learning algorithms to turn audio into real-time captions. Unlike Android's upcoming Live Caption feature, Live Transcribe is a full-screen experience, uses your smartphone's microphone (or an external microphone), and relies on the Google Cloud Speech API. Live Transcribe can caption real-time spoken words in over 70 languages and dialects. You can also type back into it — Live Transcribe is really a communication tool. The other main difference: Live Transcribe is available on 1.8 billion Android devices. (When Live Caption arrives later this year, it will only work on select Android Q devices.)
Google released Live Transcribe in February. The tool uses machine learning algorithms to turn audio into real-time captions. Unlike Android's upcoming Live Caption feature, Live Transcribe is a full-screen experience, uses your smartphone's microphone (or an external microphone), and relies on the Google Cloud Speech API. Live Transcribe can caption real-time spoken words in over 70 languages and dialects. You can also type back into it — Live Transcribe is really a communication tool. The other main difference: Live Transcribe is available on 1.8 billion Android devices. (When Live Caption arrives later this year, it will only work on select Android Q devices.)
Summary is worded poorly (Score:5, Informative)
Re: (Score:1)
If it is free to use - great news. But I get the impression this is not the case from a glance at the readme.
Re: (Score:2)
That lie was clear from the summary when they said it relied on an API. That always would mean that it is just a client, not the "engine."
Re: (Score:1)
Reading the (linked, not Slashdot) article, I've come to the same conclusion. It appears to be only an open source client app, the heavy lifting is still done in google's cloud.
Re: (Score:1)
>came here to say this /thread.
a callback to their api != release the code
this is almost like a free ad for google
besides, you also gift them the data you send. this is horrible, not open source at all. change this headline for journalism integrity's sake
Open Source API, Closed Source product (Score:5, Insightful)
They have released the source code to the library that talks to their servers, as long as you have an API key. The engine that does all the hard work is still closed source, still requires Google's permission to use, still requires you to be on line and of course still allows Google to collect all your data.
This isn't about Google magnanimously releasing their code to the community so that others can build on their science and improve the state of the art. This is about Google making it possible for people building things other than Android applications to buy into Google's services.
Re: (Score:2)
The last slashdot editor who made it past "hello world" was the Taco himself.
Re: (Score:1)
Not a Speech Engine. Not even a Speech API (Score:2)
Programmers have gotten too dependent on remote servers for shit that used to be done locally. Heck, I didn't need a network connection to have my 4K Radio Shack turn the lights on an
Re: (Score:1)
Re: (Score:2)
And this right here, is why we ditched main frames. The sad reality is everyone forgets.
I myself keep my hands on my pc at all costs. I do not want to send all this info off to google to be mined. Im iritated enough I can't remove google assistant, that keeps telling me the where it thinks I want to eat.
I tested Live Transcription App for work (Score:3)
The quality of the voice recognition was remarkable, far better than anything I tested. The app lacked any feature to save the text making it unsuitable for my needs. It is intended for use by the hard of hearing in noisy environments and I wanted it to automate note taking.
Re: (Score:1)
Hey look, we have open-sourced curl (Score:1)
That is why we have open-sourced a specially crafted version of curl so that you can offer us your data for our proprietary stuff...