Live Caption for iOS

To start, press the microphone button, speak, and text results will appear. It also works with Bluetooth devices like headsets and standalone microphones. Get streaming transcriptions for the world around you, only with the Live Caption app.
Live Caption is NOT for transcribing phone calls. 

Instructions:

  • Live Caption for iOS listens to spoken words and translates that to text in real time. It is best for in-person conversation, not for use with incoming phone calls.

  • To begin captioning, press the microphone button on the keyboard in the lower left corner. Speak normally, and text results will appear as recognized. The microphone will stay open until long periods of no speech.

  • To respond with typed text, tap "Clear All" and all previous messages on screen will be removed. Type with keyboard and the typed text will appear at the same size.

  • You may edit any text as you would in any other text editing software. Tap into your text results and the keyboard will appear to edit your text.

  • You may tap and hold to select large sections of text. This is helpful to delete or copy and paste sections of text into another app if desired.

  • iOS 8 on iPhone allows you to install third party keyboards. If you have your phone set to use a non-standard keyboard, you may not be able to access the microphone and voice recognition feature.

  • Your keyboard must have the microphone button showing and enabled for use. To change your keyboard within the app, tap and hold the world icon (or icon of your third party keyboard) key in the lower left spot on your keyboard to switch between enabled keyboards. Make sure it is set to the world icon keyboard.

Tips:

  • Pair with Bluetooth devices like headsets and standalone microphones for greater distances. Good for use in a classroom, in church or at the office.

  • Stream the app to a Television by connecting an Apple TV for a larger display

  • Captioning may not be as accurate with strong accents or children's voices.

  • Live Caption does not record or store voice or text results. Once the "Clear All" button is tapped results cannot be retrieved.

  • If you change the language on your phone, Live Caption will return results in that language.


Live Caption for Android

We are working to turn this into an experience that will be easy to use, whenever you need it. The current voice recognition we use is Google's and is extremely advanced, however it is not perfect yet. It has a harder time with accents and children's voices, but if you are the primary user of the device, you can turn on your phone's speech personalization settings to make it learn your voice as you use it (more instruction below). This greatly improves the results you will get over time. The best part is, that the VR technology is getting better and better as more people use it, and Google is constantly updating the service and accuracy they offer.

Instructions:

  1. Tap on the hint text to initiate keyboard.
  2. Tap on microphone icon in top right of keyboard.
  3. When microphone initiates, begin speaking normally near the device.
  4. Connect a bluetooth microphone for app to caption at longer distances.
  5. "X" button at top, clears all text on screen
  6. The "A" size buttons increase or decrease the size of captions
  7. Keyboard icon closes the keyboard to show captions at full screen.
  8. To edit text, tap anywhere and use like any other text editor.

Tips:

  • Changing the language on your phone will change the returned results to that language as well. 
  • Works with every language that Google supports
  • Live Caption for Android should work with most Bluetooth voice input devices such as headsets and in-ear microphones.

Note to Samsung Users

If you have a newer Samsung device, you need to change the following setting to make Live Caption work in its helpful streaming fashion:

  • Go to: SETTINGS >> LANGUAGE AND INPUT >> TEXT-to-SPEECH OPTIONS,
  • Change: SAMSUNG TEXT-to-SPEECH ENGINE
    to: GOOGLE TEXT-to-SPEECH ENGINE

Offline Speech Capabilities

Android supports offline speech recognition. You must download your preferred language file to your device for this to work on your phone or tablet. Go to Language & Input Settings, Select Google Voice Input and Download offline speech recognition for offline use.

Settings > Language Input > Google Voice Typing > Offline Speech Recognition > Offline Speech Recognition > English


Screenshot_2013-06-06-17-45-08.png

What is the benefit of personalized speech recognition?
Speech recognition is based on statistical modeling. To recognize spoken words, we compare the input speech to a statistical model of the language and try to find the closest match - the system's best guess at what the user said. The statistical model is huge - it must cover all of the fundamental sounds of the language (phonemes), all of the words, and all of the different ways that the words can be strung together in the spoken language. Furthermore, it must capture all of the variations among users that happen when a language is spoken, for example all of the different dialects and accents and individual differences in the sound of the voice (e.g., male vs. female, young vs. old).Knowing what you said in the past allows us to build specialized models that are designed to match your voice and your words. Over time, this allows us to improve the speech recognition accuracy for you. But to do that, we must keep track of what you said in the past.
http://googlemobile.blogspot.com/2010/12/voice-search-gets-personal.html


Google's Personalized Voice Recognition and your privacy

Why do we need to keep a link between you and your voice recordings?
Normally, we keep all the spoken utterances anonymous. In other words, we have millions upon millions of voice recordings with no way of telling who spoke any particular one of them. When you sign up for personalized speech recognition we keep an ‘electronic key’ that links the utterances that you said with your Google account. Using this key we can access your voice recordings and use them to refine your personalized models that are custom tailored for you.

How do we protect the data?
The ‘electronic keys’ are designed to be accessed by machines. Very few people within Google, who passed careful vetting, will have access to them. The personalized acoustic and language models are binary files designed for use by machines.What if you change your mind?

At any time you can go back to the account dashboard and instruct us to break that electronic link. We immediately destroy the electronic keys and delete the personalized models that we built for you.

https://support.google.com/websearch/answer/186263?hl=en