A college professor is looking to transcribe some audio files as text and publish them online. He wrote – “We have some old lectures recorded on reel-to-reel tapes. We have digitized the audio lectures using Audacity and would now like to transcribe the audio and publish the lectures as text. What is the best way to proceed?”
A quick Google search will return a list of paid transcription services where you can hire people who will accurately transcribe and convert the audio content of your digital files into text. However, if you are looking for an inexpensive and automated option, YouTube can help.
When you upload a video file to YouTube, it will automatically generate subtitles or closed captions for that video. Google uses speech recognition to transform the speech portion of your video into closed captions that are displayed in the video player when the viewer hits the CC button (see screenshot).
How we use IBM Watson speech-to-text to transcribe our classes TLDR; In this step by step guide we’ll show you how to transcribe an audio file using IBM Watson speech-to-text API and a little bit of Python.
If your video has decent audio quality and there not too many people speaking in the video at the same time, YouTube will automatically make a text transcript that may not be as accurate as human transcription but would do the job. The transcript is hidden inside obfuscated JavaScript but there’s a way to download it as plain text file.
Download Audio Transcriptions from YouTube
Here’s a quick guide on how to transcribe audio or video files to text with the help of YouTube.
Go to youtube.com/upload and upload your video file. If you have an MP3 audio file, you may use a tool like Windows Movie Maker, iMovie on Mac or FFMpeg to convert the audio into a video file before uploading to YouTube.
Wait for YouTube to completely process the video. The machine transcriptions may not immediately become available after uploading the video.
Open the YouTube video page in Chrome and look for the CC button in the player. If it exists, the transcribed audio can be downloaded as text.
Press F12 on Windows, or Option+Cmd+J on Mac, to open the JavaScript console inside Chrome Developer tools and paste this code:
It will open the transcribed text of the uploaded video in the current browser tab as shown in this short video. Save the file with a .html extension and double-click to view the transcription in plain text.
The same trick can help you download the closed captions of any video on YouTube even if you are not the uploader. And you can replace “en” in the URL with “fr” or “es” to download the transcriptions in another language.
You'll also like:
POSTED IN: Building Bridges from R to IBM WatsonNovember 05, 2015
REDBOOKS - Building Cognitive Applications with IBM Watson Services series is a seven-volume collection that introduces IBM Watson cognitive computing services. http://www.redbooks.ibm.com/redbooks.nsf/pages/cognitiveapps
Create Shiny Web App Dashboard in R https://medium.com/ibm-data-science-experience/create-r-shiny-web-apps-with-data-science-experience-and-bluemix-bbf51c0bf4db
CODE HOME: Collection REST APIs and SDKs that use cognitive computing to solve complex problems: https://github.com/watson-developer-cloud
GETTING STARTED WITH BLUEMIX - https://www.youtube.com/watch?v=u0Tx2QzBZqA
ZERO TO COGNITIVE - building first app https://www.youtube.com/watch?v=c5nmbF72lCM
GENTLE INTRO / BEGINNER TO BOTS & DESIGN THINKING - t https://developer.ibm.com/courses/all/chatbots-for-good-empathetic-chatbots/
ARCHITECTURES & PATTERNS - IBM's architectures provide the best practices for building applications in the Cloud. https://www.ibm.com/devops/method/category/architectures/
Cloud Patterns, AI Patterns, Data Patterns https://developer.ibm.com/code/
CUSTOMER CARE - Sample application demonstrating how the Watson APIs can be used to support customer care on Twitter - https://github.com/watson-developer-cloud/social-customer-care
VIsion - Sample ASP.Net Core Application for the IBM Watson Visual Recognition Service https://github.com/watson-developer-cloud/visual-recognition-aspnet
A Deploy-To-Bluemix enabled instance of Node-RED that can be forked, customized and reused. https://github.com/watson-developer-cloud/node-red-bluemix-starter
Bot Builder - A simple sample application demonstrating the conversation api - https://github.com/watson-developer-cloud/conversation-simple
SPEECH - Sample Node.js Application for the IBM Watson Speech to Text Service - https://github.com/watson-developer-cloud/speech-to-text-nodejs
PYTHON to Twitter API to fetch a user's timeline and then calls Watson Personality Insights to estimate their personality traits. https://github.com/watson-developer-cloud/personality-insights-twitter-python
PYTHON - Josh Bloom Heaps of info https://github.com/profjsb/python-seminar
SPEECH Python - Python client that interacts with the IBM Watson Speech To Text service through its WebSockets interface - https://github.com/watson-developer-cloud/speech-to-text-websockets-python
Data Refinery in Watson Studio - SHape Data - https://medium.com/ibm-watson/shaping-your-unstructured-text-for-machine-learning-9b60b889b30e
Series of webinars called 'Building with Watson' that we run biweekly. It's a technical web series for developers who want to or are beginning to build with Watson APIs http://www.ibm.com/watson/building-with-watson-webinar.html
Study Guide - https://github.com/havasnewyork/IBM-Watson-Developer-Certification-Study-Guide
IBM WDC Services for the Non-Technical (IBM lunch and learn) https://ibm.box.com/s/d2zmhv48uxpl0t70m85jj68gljj7720r
Tradeoff Analytics to help choose better: http://tradeoff-analytics-demo.mybluemix.net/
Analyzing movie review with 'IBM Insights for Twitter': https://www.youtube.com/watch?v=9yVNwOs9L4c
Golden State Warriors - http://www.nustory.com/special-project-golden-state-warriors-social-media-investigation/
TRAINABLE Visual Classification Model https://visual-recognition-demo-v2.mybluemix.net/
RETRIEVE AND RANK - Chris Ackerson Developing with IBM Watson Retrieve and Rank: Part 1 Solr Configuration https://medium.com/machine-learning-with-ibm-watson/developing-with-ibm-watson-retrieve-and-rank-part-1-solr-configuration-29c18e52966f#.25jegq21i
SEVENTY DAYS - 3 steps for building a cognitive solution in 70 days https://developer.ibm.com/dwblog/2016/cognitive-solution-70-days-joe-kozhaya/
IBM IX - https://www.youtube.com/watch?v=iamSHeHDe7o
Building with Watson, many links below are from list of articles from https://medium.com/@snrubnomis/purple-brain-2eb1f93fce5 from Simon Burns
Simon Says: Simon Burns' Multiple Assets for Assistants and Chatbots https://medium.com/@snrubnomis and https://medium.com/@snrubnomis/conversational-directory-5a5531749295
Getting Chatty with IBM Watson https://medium.com/@snrubnomis/getting-chatty-with-ibm-watson-1075c549ee9e
This is a simple conversation to demonstrate how a user's emotional tone can be used to provide more tailored and empathetic responses by integrating Watson Conversation and Watson Tone Analyzer. http://food-coach.mybluemix.net/
WEARABLE TONE - A cognitive computing experiment to analyze audience's collective tone of voice http://tone-led-pin.mybluemix.net/
IOT Slides and some nice embedded video http://stt-iot-slides.mybluemix.net/3 - IOT - General Architecture for Voice Interaction Quickstart guide https://developer.ibm.com/recipes/tutorials/general-architecture-for-voice-interaction-quickstart-guide/
How To Build a Candy Machine With Feelings https://medium.com/@joshzheng/how-to-build-a-candy-machine-with-feelings-922285a475c8#.84xx9zdel
Cognitive Wingman - Never Walk Alone - http://cognitivewingman.com/
Igor Ramos - cool demo of 'Cognitive + IoT + Blockchain' put together as a talking candy machine. https://vimeo.com/174682798/a4c41a5eb8 http://cognitivecandy.io/free-candy
BEER - Come and taste it - https://www.ibm.com/innovation/milab/work/ibm-sxsw-tasting-experience/
Voice Controlled Robot Arm ($1) - https://dreamtolearn.com/ryan/r_journey_to_watson/40
WINE Selection with your Food: STT NLC TTS - https://www.youtube.com/watch?v=KPOANTGm3hQ
COFFEE Selection by taste https://www.youtube.com/watch?v=9PtkXj07-4U
IBM Blue Voice - Playing with Amazon Echo/Alexa - very light test https://www.youtube.com/watch?v=QyXDUxkNj_k
Audio Analysis Dashboard - TedTalks and Youtube Vids TRANSCRIPTS gen https://audio-analysis-application-starter-kit.mybluemix.net/
MACKENZIE - an IBM Watson Powered Speech / Text / NLC interface for R-Studio https://www.youtube.com/watch?v=imNjZKtyx0s
Sensemaking Systems in Call Centers - https://dreamtolearn.com/ryan/r_journey_to_watson/55
Boaty Mc Boatface - https://dreamtolearn.com/ryan/r_journey_to_watson/36
IBM Skylink connects drones to the IBM Cloud in real time https://techcrunch.com/video/ibm-skylink-connects-drones-to-the-ibm-cloud-in-real-time/57727f125095493da6cdaa7c/
Call for Code: Introducing DroneAid, Puerto Rico - DRONES post disaster https://www.youtube.com/watch?v=9fRcis-5Zuc
create the workings for a multilingual chat room using OpenWhisk, Watson Text to Speech and Watson Language Translator. https://github.com/IBM/serverless-language-translation
Unity Language Translation in some of these videos - https://www.youtube.com/watch?v=sZAlfx8X5kc&list=PLXQJDMm1VnWDHsdlNmDff6pouHa_jpMUw TTS expressive and STT
1. User Segmentation - Discussion Paper - Personality Insights > Jungian Archetypes https://ibm.box.com/s/jtnlgbvszf9bn1pakn6fvsiwvas4x8fq
2: PI Driven Segmentation: https://ibm.box.com/s/h1rslp5yzbobl7j2jvcndigpq8pmo798
3. IBM Sports - IX - https://www.youtube.com/watch?v=7zKLEyLTqNU - Toronto Raptors UX/Data Viz/Watson
4. Leveraging personality to predict consumption preferences https://www.ibm.com/blogs/watson/2016/10/leveraging-personality-predict-consumption-preferences/
5. Watson Ads https://watsonads.com
6. 10 Major Brands Using IBM Watson - http://www.topbots.com/10-major-fortune-500-brands-using-ibm-watson/
7. Equals3 Lucy - Powered by Watson - http://equals3.ai/
8. Influential - Powered by Watson (Social Listening and Social Messaging) - https://influential.co/
VERY GOOD: Localized IBM Watson Visual Recognition using image preprocessing (great blog and source code from Andrew Trice ) https://www.ibm.com/blogs/bluemix/2017/03/sharpen-watson-visual-recognition-results/
Slice and Dice Images (very good)
This is complete source code for visualizing localized results for a single class, within a single custom classifier using image preprocessing (tiling) techniques, and will run on Bluemix as-is. You can re-use as much as needed. You just need to specify your Watson VR key and the classifier ID inside of app.js. It can also be extended to support multiple classifiers and multiple classifications within a classifier https://github.com/IBM-Bluemix/Visual-Recognition-Tile-Localization
Visual Recognition Overview - http://www.ibm.com/watson/developercloud/visual-recognition.html Visual Recognition demo - https://visual-recognition-demo.mybluemix.net/ ('Try' = out-of-the-box tagging, 'Train' = custom classifiers) Similarity Search demo - https://similarity-search-demo.mybluemix.net/ Best practices for custom classifiers - http://ibm.co/2ehdy7P Guidelines for good training - http://ibm.co/2cpFS7i API Reference - bit.ly/VR-ref Documentation - bit.ly/VR-docs Fork on Github - bit.ly/VR-github
Visual Recognition Use Cases by Industry - http://ibm.co/2e1Frzr OmniEarth case study (satellite imagery) - http://ibm.co/2cldt3w Drones performing inspections: https://youtu.be/BWDfP_udMA0 Drones gathering live video data: use case -
http://ibm.co/2dLwa1yhttps://techcrunch.com/2016/06/29/ibm-drone/source code - https://github.com/IBM-Bluemix/skylink Drones performing unmanned inspections: http://bit.ly/2fCpgt9 Hail damage classification with Watson - https://youtu.be/VrZMQZSB_UE?t=1h8m6s Similarity search for retail: http://bit.ly/1sKkaRm Visual Recognition for videos: https://github.com/IBM-Bluemix/openwhisk-darkvisionapp iTrend case study - http://ecc.ibm.com/case-study/us-en/ECCF-ASC12419USEN US Open - https://www.ibm.com/blogs/bluemix/2016/09/us-open-2016-bluemix-delivering-cognitive/ Pokemon Go - https://www.ibm.com/blogs/internet-of-things/pokemon-go-watson/ Seafood Fraud (U.S. State Dept hackathon winner) - https://devpost.com/software/dory
Under Armour + IBM Watson - http://ibm.co/2jqOBKE YNAP Video (Retail): https://ibm.ent.box.com/s/kocj7yor2aol9g4ny6obdola0lio7m2f Tile Localization technique / code: https://github.com/IBM-Bluemix/Visual-Recognition-Tile-Localization
CRM magazine IBM's Augmented Shopping Advisor https://www.youtube.com/watch?v=nyM9Pfa-yOI IBM UK - IBM and Tesco test Augmented Reality mobile app https://www.youtube.com/watch?v=qJMyC9o08OM
ANDY TRICE'S GOODIES: Here is a CLI for the Watson Visual Recognition service:
https://developer.ibm.com/dwblog/2017/command-line-tools-watson-visual-recognition/ Best practices for custom classifiers - http://ibm.co/2ehdy7P Guidelines for good training - http://ibm.co/2cpFS7i
Andrew Trice demonstrates a solution for insurance adjustment utilizing drones and built on IBM Bluemix. Build cognitive solutions
Digital Twins and Asset Management
IBM Watson & Digital Twins - Holistic Asset Management https://www.linkedin.com/pulse/digital-twins-threads-holistic-asset-management-ryan-anderson
Designing Better Machines - PPT Deck from Sky on Cognitive Digital Twinhttps://www.slideshare.net/IBMIoT/hannover-messe-evolution-of-a-cognitive-digital-twin
What is Digital Twin? https://www.ibm.com/blogs/internet-of-things/digital-twin/ https://www.slideshare.net/IBMIoT/ibm-watson-internet-of-things-introducing-digital-twin (slides)
IBM launches new Digital Twin and Platform capabilities for Watson Internet of Things https://www.ibm.com/blogs/internet-of-things/new-digital-twin-capabilities/
Building a Digital Twin using Watson IoT Platform https://developer.ibm.com/iotplatform/2017/05/01/building-digital-twin-using-watson-iot-platform/
IOT & Digital Twin - Industry 4.0 https://www.enterpriseirregulars.com/113624/ibm-watson-iot-digital-twin-industry-4-0/
Digital Twin: Bridging the physical-digital divide https://www.ibm.com/internet-of-things/resources/digital-twin/digital-convergence/
Immersive Insights and International Space Station (Watson Verbal Interface) https://www.youtube.com/watch?v=6WRNqdOMXmc https://vimeo.com/216672088
Blogs
9. https://ibmpairs.mybluemix.net/ is a platform, specifically designed for massive geospatial-temporal data (maps, satellite, weather, drone, IoT), query and analytics services.