Extract Insights From Videos

Republished By Plato

Followers: 0

Summary

In this code pattern, learn how to extract speaker diarized notes and meaningful insights reports using IBM® Watson™ Speech To Text, Watson Natural Language Processing, and Watson Tone Analysis when given any video.

Description

In a virtually connected world, staying focused on work or education is very important. Studies suggest that many people lose their focus in live virtual meetings or virtual classroom sessions after approximately 20 minutes. Therefore, many meetings and virtual classrooms are recorded so that an individual can watch it later.

It might help if these recordings could be analyzed, and a detailed report of the meeting or class is generated by using artificial intelligence (AI). This code pattern explains how to do that. Given a video recording of the virtual meeting or virtual classroom, it explains how to extract audio from a video file using the FFmpeg open source library, transcribe the audio to get speaker-diarized notes with custom-trained language and acoustic speech to text models, and generate a natural language understanding report that consists of the category, concepts, emotion, entities, keywords, sentiment, top positive sentences, and word clouds using a Python Flask runtime.

After completing the code pattern, you understand how to:

Use the Watson Speech to Text service to convert the human voice into the written word
Use advanced natural language processing to analyze text and extract metadata from content such as concepts, entities, keywords, categories, sentiment, and emotion
Leverage Watson Tone Analyzer cognitive linguistic analysis to identify a variety of tones at both the sentence and document level

Flow

flow

The user uploads a recorded video file of the virtual meeting or virtual classroom.
The FFmpeg library extracts audio from the video file.
The Watson Speech To Text service transcribes the audio to give a diarized textual output.
(Optionally) The Watson Language Translator service translates other languages into an English transcript.
Watson Tone Analyzer analyses the transcript and picks up the top positive statements from the transcript.
Watson Natural Language Understanding reads the transcript to identify key pointers and to get the sentiments and emotions.
The key pointers and summary of the video are presented to the user in the application.
The user can download the textual insights.

Instructions

Find the detailed steps in the README file. Those steps explain how to:

Clone the GitHub repository.
Add the credentials to the application.
Deploy the application.
Run the application.

This code pattern is part of the Extracting insights from videos with IBM Watson use case series, which showcases the solution on extracting meaningful insights from videos using Watson Speech to Text, Watson Natural Language Processing, and Watson Tone Analyzer services.

Source: https://developer.ibm.com/patterns/extract-textual-insights-from-a-given-video/

Time Stamp: November 4, 2020

Deploy and use a multi-framework deep learning platform on Kubernetes

Source Cluster:

IBM Developer

Source Node: 1580762

Time Stamp: Mar 27, 2019

Analyze AI fraud prediction models

Source Cluster:

Source Cluster:

IBM Developer

Source Node: 1573053

Time Stamp: Aug 19, 2019

Republished By Plato

Summary

Description

Flow

Instructions

More from IBM Developer

Deploy and use a multi-framework deep learning platform on Kubernetes

Analyze AI fraud prediction models

Create a web app to visually interact with objects detected using machine learning

Locate and count items with object detection

Use advanced natural language processing and tone analysis to extract meaningful insights

Predict an event with fairness, explainability, and robustness

A Python Flask audio search application

Build an assistant to evaluate various health insurance policies

Create a next-generation call center with Watson Assistant Phone Integration

Object tracking in video with OpenCV and Deep Learning

Assemble a pizza-ordering chatbot dialog

About Us

Vertical Search & Ai

Platform

Stay Connected

Account