Misplaced Pages

Speech Recognition & Synthesis

Article snapshot taken from Wikipedia with creative commons attribution-sharealike license. Give it a read and then ask your questions in the chat. We can research this topic together.
(Redirected from Google Text-to-Speech) Screen reader application by Google
Speech Recognition & Synthesis
Developer(s)Google
Initial release10 October 2013; 11 years ago (2013-10-10)
Stable release20241030.02/p3 (Build 702043126) / 2 December 2024; 32 days ago (2024-12-02)
Operating systemAndroid
TypeScreen reader

Speech Recognition & Synthesis, formerly known as Speech Services, is a screen reader application developed by Google for its Android operating system. It powers applications to read aloud (speak) the text on the screen, with support for many languages. Text-to-Speech may be used by apps such as Google Play Books for reading books aloud, Google Translate for reading aloud translations for the pronunciation of words, Google TalkBack, and other spoken feedback accessibility-based applications, as well as by third-party apps. Users must install voice data for each language.

Supported languages

  • Afrikaans (South Africa)
  • Albanian (Albania)
  • Amharic (Ethiopia)
  • Arabic (Saudi Arabia)
  • Assamese (India)
  • Basque (Spain)
  • Bengali (Bangladesh)
  • Bengali (India)
  • Bodo (India)
  • Bosnian (Bosnia and Herzegovina)
  • Bulgarian (Bulgaria)
  • Burmese (Myanmar)
  • Cantonese (Hong Kong)
  • Catalan (Spain)
  • Chinese (China)
  • Chinese (Taiwan)
  • Croatian (Croatia)
  • Czech (Czech Republic)
  • Danish (Denmark)
  • Dogri (India)
  • Dutch (Belgium)
  • Dutch (Netherlands)
  • English (Australia)
  • English (Nigeria)
  • English (India)
  • English (United Kingdom)
  • English (United States)
  • Estonian (Estonia)
  • Filipino (Philippines)
  • Finnish (Finland)
  • French (Canada)
  • French (France)
  • Galician (Spain)
  • German (Germany)
  • Greek (Greece)
  • Gujarati (India)
  • Hausa (Nigeria)
  • Hebrew (Israel)
  • Hindi (India)
  • Hungarian (Hungary)
  • Icelandic (Iceland)
  • Indonesian (Indonesia)
  • Italian (Italy)
  • Japanese (Japan)
  • Javanese (Indonesia)
  • Kannada (India)
  • Kashmiri (India)
  • Khmer (Cambodia)
  • Konkani (India)
  • Korean (South Korea)
  • Latin (Vatican City)
  • Latvian (Latvia)
  • Lithuanian (Lithuania)
  • Maithili (India)
  • Malay (Malaysia)
  • Malayalam (India)
  • Manipuri (India)
  • Marathi (India)
  • Nepali (Nepal)
  • Norwegian (Norway)
  • Odia (India)
  • Polish (Poland)
  • Portuguese (Brazil)
  • Portuguese (Portugal)
  • Punjabi (India)
  • Romanian (Romania)
  • Russian (Russia)
  • Sanskrit (India)
  • Santali (India)
  • Serbian (Serbia)
  • Sindhi (India)
  • Sinhala (Sri Lanka)
  • Slovak (Slovakia)
  • Slovenian (Slovenia)
  • Spanish (Spain)
  • Spanish (United States)
  • Sundanese (Indonesia)
  • Swahili (Kenya)
  • Swedish (Sweden)
  • Tamil (India)
  • Telugu (India)
  • Thai (Thailand)
  • Turkish (Turkey)
  • Ukrainian (Ukraine)
  • Urdu (Pakistan)
  • Urdu (India)
  • Vietnamese (Vietnam)
  • Welsh (United Kingdom)

History

This article needs additional citations for verification. Please help improve this article by adding citations to reliable sources. Unsourced material may be challenged and removed.
Find sources: "Speech Recognition & Synthesis" – news · newspapers · books · scholar · JSTOR (November 2023) (Learn how and when to remove this message)

Some app developers have started adapting and tweaking their Android Auto apps to include Text-to-Speech, such as Hyundai in 2015. Apps such as textPlus and WhatsApp use Text-to-Speech to read notifications aloud and provide voice-reply functionality.

Google Cloud Text-to-Speech is powered by WaveNet, software created by Google's UK-based AI subsidiary DeepMind, which was bought by Google in 2014. It tries to distinguish from its competitors, Amazon and Microsoft.

Most voice synthesizers (including Apple's Siri) use concatenative synthesis, in which a program stores individual phonemes and then pieces them together to form words and sentences. WaveNet synthesizes speech with human-like emphasis and inflection on syllables, phonemes, and words. Unlike most other text-to-speech systems, a WaveNet model creates raw audio waveforms from scratch. The model uses a neural network that has been trained using a large volume of speech samples. During training, the network extracts the underlying structure of the speech, such as which tones follow each other and what a realistic speech waveform looks like. When given a text input, the trained WaveNet model can generate the corresponding speech waveforms from scratch, one sample at a time, with up to 24,000 samples per second and smooth transitions between the individual sounds.

The service was renamed Speech Recognition & Synthesis in 2023.

See also

References

  1. "Speech Recognition & Synthesis". Google Play. Retrieved 2024-12-11.
  2. "Speech Recognition & Synthesis googletts.google-speech-apk_20241125.02_p2.702443970". APKMirror. 2024-12-11. Retrieved 2024-12-11.
  3. Wang, Jules (November 8, 2021). "You'll never guess the latest Google app to cross 10 billion installs (seriously)". Android Police. Archived from the original on November 8, 2021. Retrieved November 18, 2021.
  4. "Google, Hyundai show off new third-party Android Auto apps". CNET. CBS Interactive. Retrieved 17 January 2015.
  5. ^ "WaveNet". www.deepmind.com. Retrieved 2023-06-22.
  6. Gibbs, Samuel (2014-01-27). "Google buys UK artificial intelligence startup Deepmind for £400m". The Guardian. ISSN 0261-3077. Retrieved 2023-06-22.
  7. "Text-to-Speech AI: Lifelike Speech Synthesis". Google Cloud. Retrieved 2023-06-22.

External links

Google
a subsidiary of Alphabet
Company
Divisions
Subsidiaries
Active
Defunct
Programs
Events
Infrastructure
People
Current
Former
Criticism
General
Incidents
Other
Development
Software
A–C
D–N
O–Z
Operating systems
Language models
Neural networks
Computer programs
Formats and codecs
Programming languages
Search algorithms
Domain names
Typefaces
Products (software and services)
Defunct or discontinued
Hardware
Pixel
Smartphones
Smartwatches
Tablets
Laptops
Other
Nexus
Smartphones
Tablets
Other
Other
Litigation
Advertising
Antitrust
Intellectual property
Privacy
Other
Related
Concepts
Products
Android
Street View coverage
YouTube
Other
Documentaries
Books
Popular culture
Other
Italics denote discontinued products.
Android
Software
development
Development tools
Official
Other
Integrated
development
environments
(IDE)
Languages, databases
Virtual reality (VR)
Events, communities
Releases
Derivatives
Devices
Pixel
Nexus
Play edition
Custom
distributions
Booting and
recovery
APIs
Alternative UIs
Rooting
Lists
Related topics
Categories: