Public Preview - Video Indexer Multilingual identification and transcription

Posted on Wednesday, September 18, 2019

Some media assets like news, current affairs, and interviews contain audio with speakers using different languages. Most existing speech-to-text capabilities require the audio recognition language to be specified in advance, which is an obstacle to transcribing multilingual videos. Our new automatic spoken language identification for multiple content feature leverages machine learning technology to identify the different languages used in a media asset. Once detected, each language segment undergoes an automatic transcription process in the language identified, and all segments are integrated back together into one transcription file consisting of multiple languages.

Learn more about the new multilingual option

  • Media Services
  • Video Indexer
  • Features
  • Services