Generate Human-like Audio From Text

Republished By Plato

Followers: 0

Summary

Using Node.js and React components, create a web app that generates human-like audio from text. The app uses IBM® Watson™ Text to Speech to provide a selection of voices with support for multiple languages and genders. Watson Text to Speech is available on IBM Cloud and with the Watson API Kit on IBM Cloud Pak™ for Data.

Description

Built with React components and a Node.js server, the text-to-speech web app takes text input and sends it to the Watson Text to Speech service to be spoken in the voice you choose. Various voices (male and female) are available, covering many languages and regions.

By adding SSML elements to the input text, you can manipulate the voice. SSML can be used to control the timing, expressiveness, pitch, breathiness, rate, pronunciation, and more.

This app is intended to get you started. A text-to-speech app is a fun example, but the real results happen when you use this code to give your own application a voice.

Watson Text to Speech is available on IBM Cloud and with the Watson API Kit on IBM Cloud Pak for Data. With IBM Cloud Pak for Data, you can provision Watson Text to Speech on your own private cloud or wherever Red Hat OpenShift runs.

When you have completed this code pattern, you understand how to:

Retrieve and play audio from the Watson Text to Speech service using a REST API
Integrate the Watson Text to Speech service in a web app
Use React components and a Node.js server

Flow

Generate human-like audio from text

The user supplies some text as input to the application (running locally, in IBM Cloud, or in IBM Cloud Pak for Data).
The application sends the text to the Text to Speech service.
As the data is processed, the Text to Speech service returns audio information to the HTML5 audio element for playback.

Instructions

Find the detailed steps for this pattern in the readme file. The steps show you how to:

Provision the Watson Text to Speech service.
Deploy the server.
Use the web app.

Source: https://developer.ibm.com/patterns/generate-human-like-audio-from-text/

Time Stamp: July 9, 2020

Time Stamp: Jun 16, 2020

UK Post Office Adds Option to Buy Bitcoin via Easyid App

Monitor Sagemaker machine learning with Watson OpenScale

Source Cluster:

IBM Developer

Source Node: 1860946

Time Stamp: Aug 9, 2021

A Python Flask audio search application

Source Cluster:

IBM Developer

Source Node: 748181

Time Stamp: Sep 10, 2020

Generate human-like audio from text

Republished By Plato

Summary

Description

Flow

Instructions

More from IBM Developer

Analyze data patterns to find fraudulent insurance claims

Build a custom speech-to-text model with speaker diarization capabilities

Build a machine learning regression model using Findability Platform Predict Plus

Enhance customer helpdesks with Smart Document Understanding using the Watson Assistant search skill

Predict, manage, and monitor the call drops of cell towers using IBM Cloud Pak for Data

Build a virtual insurance assistant to help process claims

Automate visual recognition model training

Create a web-based intelligent bank loan application for a loan agent

Build an e-learning portal

Validate computer vision deep learning models

Monitor Sagemaker machine learning with Watson OpenScale

A Python Flask audio search application

About Us

Vertical Search & Ai

Platform

Stay Connected

Account